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As we have prepared an updated edition our first priorities 
are to preserve, and to enhance, the qualities that have made 
previous editions so successful. In particular, we adopt the 
viewpoint of an applied mathematician with diverse interests 
in differential equations, ranging from quite theoretical to 
intensely practical—and usually a combination of both. Three 
pillars of our presentation of the material are methods 
of solution, analysis of solutions, and approximations of 
solutions. Regardless of the specific viewpoint adopted, we 
have sought to ensure the exposition is simultaneously correct 
and complete, but not needlessly abstract. 

The intended audience is undergraduate STEM students 
whose degree program includes an introductory course in 
differential equations during the first two years. The essential 
prerequisite is a working knowledge of calculus, typically a 
two- or three-semester course sequence or an equivalent. While 
a basic familiarity with matrices is helpful, Sections 7.2 and 7.3 
provide an overview of the essential linear algebra ideas needed 
for the parts of the book that deal with systems of differential 
equations (the remainder of Chapter 7, Section 8.5, and 
Chapter 9). 

A strength of this book is its appropriateness in a 
wide variety of instructional settings. In particular, it allows 
instructors flexibility in the selection of and the ordering of 
topics and in the use of technology. The essential core material 
is Chapter 1, Sections 2.1 through 2.5, and Sections 3.1 through 
3.5. After completing these sections, the selection of additional 
topics, and the order and depth of coverage are generally at 
the discretion of the instructor. Chapters 4 through 11 are 
essentially independent of each other, except that Chapter 7 
should precede Chapter 9, and Chapter 10 should precede 
Chapter 11. 

A particularly appealing aspect of differential equations 
is that even the simplest differential equations have a direct 
correspondence to realistic physical phenomena: exponential 
growth and decay, spring-mass systems, electrical circuits, 
competitive species, and wave propagation. More complex 
natural processes can often be understood by combining and 
building upon simpler and more basic models. A thorough 
knowledge of these basic models, the differential equations 
that describe them, and their solutions—either explicit solutions 
or qualitative properties of the solution-is the first and 
indispensable step toward analyzing the solutions of more 
complex and realistic problems. The modeling process is 
detailed in Chapter 1 and Section 2.3. Careful constructions 
of models appear also in Sections 2.5, 3.7, 9.4, 10.5, and 
10.7 (and the appendices to Chapter 10). Various problem sets 
throughout the book include problems that involve modeling 
to formulate an appropriate differential equation, and then 
to solve it or to determine some qualitative properties of its 
solution. The primary purposes of these applied problems are 
to provide students with hands-on experience in the derivation 
of differential equations, and to convince them that differential 
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equations arise naturally in a wide variety of real-world 
applications. 

Another important concept emphasized repeatedly 
throughout the book is the transportability of mathematical 
knowledge. While a specific solution method applies to only a 
particular class of differential equations, it can be used in any 
application in which that particular type of differential equation 
arises. Once this point is made in a convincing manner, we 
believe that it is unnecessary to provide specific applications of 
every method of solution or type of equation that we consider. 
This decision helps to keep this book to a reasonable size, and 
allows us to keep the primary emphasis on the development 
of more solution methods for additional types of differential 
equations. 

From a student's point of view, the problems that are 
assigned as homework and that appear on examinations define 
the course. We believe that the most outstanding feature of 
this book is the number, and above all the variety and range, 
of the problems that it contains. Many problems are entirely 
straightforward, but many others are more challenging, and 
some are fairly open-ended and can even serve as the basis 
for independent student projects. The observant reader will 
notice that there are fewer problems in this edition than in 
previous editions; many of these problems remain available 
to instructors via the WileyPlus course. The remaining 1600 
problems are still far more problems than any instructor can 
use in any given course, and this provides instructors with a 
multitude of choices in tailoring their course to meet their own 
goals and the needs of their students. The answers to almost all 
of these problems can be found in the pages at the back of the 
book; full solutions are in either the Student's Solution Manual 
or the Instructor's Solution Manual. 

While we make numerous references to the use of 
technology, we do so without limiting instructor freedom to 
use as much, or as little, technology as they desire. Appropriate 
technologies include advanced graphing calculators (TI 
Nspire), a spreadsheet (Excel), web-based resources (applets), 
computer algebra systems, (Maple, Mathematica, Sage), 
scientific computation systems (MATLAB), or traditional 
programming (FORTRAN, Javascript, Python). Problems 
marked with a W are ones we believe are best approached with 
a graphical tool; those marked with a UJ are best solved with the 
use of a numerical tool. Instructors should consider setting their 
own policies, consistent with their interests and intents about 
student use of technology when completing assigned problems. 

Many problems in this book are best solved through 
a combination of analytic, graphic, and numeric methods. 
Pencil-and-paper methods are used to develop a model that 
is best solved (or analyzed) using a symbolic or graphic 
tool. The quantitative results and graphs, frequently produced 
using computer-based resources, serve to illustrate and to 
clarify conclusions that might not be readily apparent from 
a complicated explicit solution formula. Conversely, the 
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implementation of an efficient numerical method to obtain 
an approximate solution typically requires a good deal of 
preliminary analysis—to determine qualitative features of the 
solution as a guide to computation, to investigate limiting 
or special cases, or to discover ranges of the variables or 
parameters that require an appropriate combination of both 
analytic and numeric computation. Good judgment may well 
be required to determine the best choice of solution methods 
in each particular case. Within this context we point out that 
problems that request a “sketch” are generally intended to 
be completed without the use of any technology (except your 
writing device). 

We believe that it is important for students to understand 
that (except perhaps in courses on differential equations) the 
goal of solving a differential equation is seldom simply to 
obtain the solution. Rather, we seek the solution in order to 
obtain insight into the behavior of the process that the equation 
purports to model. In other words, the solution is not an end 
in itself. Thus, we have included in the text a great many 
problems, as well as some examples, that call for conclusions 
to be drawn about the solution. Sometimes this takes the form 
of finding the value of the independent variable at which the 
solution has a certain property, or determining the long-term 
behavior of the solution. Other problems ask for the effect of 
variations in a parameter, or for the determination of all values 
of a parameter at which the solution experiences a substantial 
change. Such problems are typical of those that arise in the 
applications of differential equations, and, depending on the 
goals of the course, an instructor has the option of assigning as 
few or as many of these problems as desired. 

Readers familiar with the preceding edition will observe 
that the general structure of the book is unchanged. The 
minor revisions that we have made in this edition are in 
many cases the result of suggestions from users of earlier 
editions. The goals are to improve the clarity and readability of 
our presentation of basic material about differential equations 
and their applications. More specifically, the most important 
revisions include the following: 


1. Chapter 1 has been rewritten. Instead of a separate section 
on the History of Differential Equations, this material 
appears in three installments in the remaining three 
section. 


2. Additional words of explanation and/or more explicit 
details in the steps in a derivation have been added 
throughout each chapter. These are too numerous and 
widespread to mention individually, but collectively they 
should help to make the book more readable for many 
students. 


3. There are about forty new or revised problems scattered 
throughout the book. The total number of problems has 
been reduced by about 400 problems, which are still 
available through WileyPlus, leaving about 1600 problems 
in print. 

4. There are new examples in Sections 2.1, 3.8, and 7.5. 

5. The majority (is this correct?) of the figures have been 
redrawn, mainly by the use full color to allow for easier 
identification of critical properties of the solution. In 


addition, numerous captions have been expanded to clarify 
the purpose of the figure without requiring a search of the 
surrounding text. 


6. There are several new references, and some others have 
been updated. 


The authors have found differential equations to be a 
never-ending source of interesting, and sometimes surprising, 
results and phenomena. We hope that users of this book, both 
students and instructors, will share our enthusiasm for the 
subject. 


William E. Boyce and Douglas B. Meade 
Watervliet, New York and Columbia, SC 
29 August 2016 


Supplemental Resources for 
Instructors and Students 


An Instructor's Solutions Manual, ISBN 978-1-119-16976-5, 
includes solutions for all problems not contained in the Student 
Solutions Manual. 

A Student Solutions Manual, ISBN 978-1-119-16975-8, 
includes solutions for selected problems in the text. 

A Book Companion Site, www.wiley.com/college/boyce, 
provides a wealth of resources for students and instructors, 
including 


PowerPoint slides of important definitions, examples, and 
theorems from the book, as well as graphics for presentation 
in lectures or for study and note taking. 


Chapter Review Sheets, which enable students to test their 
knowledge of key concepts. For further review, diagnostic 
feedback is provided that refers to pertinent sections in the 
text. 


Mathematica, Maple, and Mat as data files for selected 
problems in the text providing opportunities for further 
exploration of important concepts. 


Projects that deal with extended problems normally not 
included among traditional topics in differential equations, 
many involving applications from a variety of disciplines. 
These vary in length and complexity, and they can be 
assigned as individual homework or as group assignments. 


A series of supplemental guidebooks, also published by John 
Wiley & Sons, can be used with Boyce/DiPrima/Meade in 
order to incorporate computing technologies into the course. 
These books emphasize numerical methods and graphical 
analysis, showing how these methods enable us to interpret 
solutions of ordinary differential equations (ODEs) in the real 
world. Separate guidebooks cover each of the three major 
mathematical software formats, but the ODE subject matter is 
the same in each. 


* Hunt, Lipsman, Osborn, and Rosenberg, Differential 
Equations with Matas , 3rd ed., 2012, ISBN 978-1-118- 
37680-5 


* Hunt, Lardy, Lipsman, Osborn, and Rosenberg, Differential 
Equations with Maple, 3rd ed., 2008, ISBN 978-0-471- 
77317-7 


e Hunt, Outing, Lipsman, Osborn, and Rosenberg, 
Differential Equations with Mathematica, 3rd ed., 2009, 
ISBN 978-0-471-77316-0 


WileyPLUS WileyPLUS is an innovative, research-based 
online environment for effective teaching and learning. 

WileyPLUS builds students’ confidence because it takes 
the guesswork out of studying by providing students with a 
clear roadmap: what to do, how to do it, if they did it right. 
Students will take more initiative so you'll have greater impact 
on their achievement in the classroom and beyond. 
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CHAPTER L 


Introduction 


In this first chapter we provide a foundation for your study of differential equations in several 
different ways. First, we use two problems to illustrate some of the basic ideas that we 
will return to, and elaborate upon, frequently throughout the remainder of the book. Later, 
to provide organizational structure for the book, we indicate several ways of classifying 
differential equations. 

The study of differential equations has attracted the attention of many of the world’s 
greatest mathematicians during the past three centuries. On the other hand, it is important 
to recognize that differential equations remains a dynamic field of inquiry today, with many 
interesting open questions. We outline some of the major trends in the historical development 
of the subject and mention a few of the outstanding mathematicians who have contributed to 
it. Additional biographical information about some of these contributors will be highlighted 
at appropriate times in later chapters. 


11 Some Basic Mathematical Models; 
Direction Fields 


Before embarking on a serious study of differential equations (for example, by reading this 
book or major portions of it), you should have some idea of the possible benefits to be gained by 
doing so. For some students the intrinsic interest of the subject itself is enough motivation, but 
for most it is the likelihood of important applications to other fields that makes the undertaking 
worthwhile. 

Many of the principles, or laws, underlying the behavior of the natural world 
are statements or relations involving rates at which things happen. When expressed in 
mathematical terms, the relations are equations and the rates are derivatives. Equations 
containing derivatives are differential equations. Therefore, to understand and to investigate 
problems involving the motion of fluids, the flow of current in electric circuits, the dissipation 
of heat in solid objects, the propagation and detection of seismic waves, or the increase 
or decrease of populations, among many others, it is necessary to know something about 
differential equations. 

A differential equation that describes some physical process is often called a 
mathematical model of the process, and many such models are discussed throughout this 
book. In this section we begin with two models leading to equations that are easy to solve. It 
is noteworthy that even the simplest differential equations provide useful models of important 
physical processes. 


EXAMPLE 1 | A Falling Object 


Suppose that an object is falling in the atmosphere near sea level. Formulate a differential equation 
that describes the motion. 


2 


CHAPTER 1 Introduction 


Solution: 


We begin by introducing letters to represent various quantities that may be of interest in this problem. 
The motion takes place during a certain time interval, so let us use f to denote time. Also, let 
us use v to represent the velocity of the falling object. The velocity will presumably change with 
time, so we think of v as a function of t; in other words, t is the independent variable and v is the 
dependent variable. The choice of units of measurement is somewhat arbitrary, and there is nothing 
in the statement of the problem to suggest appropriate units, so we are free to make any choice that 
seems reasonable. To be specific, let us measure time f in seconds and velocity v in meters/second. 
Further, we will assume that v is positive in the downward direction—that is, when the object is 
falling. 

The physical law that governs the motion of objects is Newton's second law, which states that 
the mass of the object times its acceleration is equal to the net force on the object. In mathematical 
terms this law is expressed by the equation 


F — ma, (1) 


where m is the mass of the object, a is its acceleration, and F is the net force exerted on the object. To 
keep our units consistent, we will measure m in kilograms, a in meters/second?, and F in newtons. 
Of course, a is related to v by a = dv/dt, so we can rewrite equation (1) in the form 


pog 2 
=m ar (2) 

Next, consider the forces that act on the object as it falls. Gravity exerts a force equal to 
the weight of the object, or mg, where g is the acceleration due to gravity. In the units we 
have chosen, g has been determined experimentally to be approximately equal to 9.8 m/s? near 
the earth's surface. 

There is also a force due to air resistance, or drag, that is more difficult to model. This is not 
the place for an extended discussion of the drag force; suffice it to say that it is often assumed that 
the drag is proportional to the velocity, and we will make that assumption here. Thus the drag force 
has the magnitude yv, where y is a constant called the drag coefficient. The numerical value of the 
drag coefficient varies widely from one object to another; smooth streamlined objects have much 
smaller drag coefficients than rough blunt ones. The physical units for ^; are mass/time, or kg/s for 
this problem; if these units seem peculiar, remember that yv must have the units of force, namely, 
kg-m/s?. 

In writing an expression for the net force F, we need to remember that gravity always acts in 
the downward (positive) direction, whereas, for a falling object, drag acts in the upward (negative) 
direction, as shown in Figure 1.1.1. Thus 


F = mg — yv (3) 
and equation (2) then becomes 
dv = (4) 
m "m mg — yv. 


Differential equation (4) is a mathematical model for the velocity v of an object falling in the 
atmosphere near sea level. Note that the model contains the three constants m, g, and y. The constants 
m and y depend very much on the particular object that is falling, and they are usually different for 
different objects. It is common to refer to them as parameters, since they may take on a range of 
values during the course of an experiment. On the other hand, g is a physical constant, whose value 
is the same for all objects. 


yu 
Qm 


mg 


| FIGURE 1.1.1 | Free-body diagram of the forces on a falling object. 


1.1 Some Basic Mathematical Models; Direction Fields 


To solve equation (4), we need to find a function v = v(t) that satisfies the equation. It 
is not hard to do this, and we will show you how in the next section. For the present, however, 
let us see what we can learn about solutions without actually finding any of them. Our task 
is simplified slightly if we assign numerical values to m and y, but the procedure is the same 
regardless of which values we choose. So, let us suppose that m = 10 kg and y = 2 kg/s. 
Then equation (4) can be rewritten as 


d 
gg- t, (5) 
dt 5 


EXAMPLE 2 | A Falling Object (continued) 


Investigate the behavior of solutions of equation (5) without solving the differential equation. 


Solution: 


First let us consider what information can be obtained directly from the differential equation itself. 
Suppose that the velocity v has a certain given value. Then, by evaluating the right-hand side of 
differential equation (5), we can find the corresponding value of dv / dt. For instance, if v = 40, then 
dv/dt = 1.8. This means that the slope of a solution v = v(t) has the value 1.8 at any point where 
y — 40. We can display this information graphically in the tv-plane by drawing short line segments 
with slope 1.8 at several points on the line v — 40. (See Figure 1.1.2(a)). Similarly, when v — 50, 
then dv/dt — —0.2, and when v — 60, then dv/dt — —2.2, so we draw line segments with slope 
—0.2 at several points on the line v = 50 (see Figure 1.1.2(b)) and line segments with slope —2.2 at 
several points on the line v — 60 (see Figure 1.1.2(c)). Proceeding in the same way with other values 
of v we create what is called a direction field, or a slope field. The direction field for differential 
equation (5) is shown in Figure 1.1.3. 

Remember that a solution of equation (5) is a function v = v(t) whose graph is a curve in 
the tv-plane. The importance of Figure 1.1.3 is that each line segment is a tangent line to one 
of these solution curves. Thus, even though we have not found any solutions, and no graphs of 
solutions appear in the figure, we can nonetheless draw some qualitative conclusions about the 
behavior of solutions. For instance, if v is less than a certain critical value, then all the line segments 
have positive slopes, and the speed of the falling object increases as it falls. On the other hand, if v 
is greater than the critical value, then the line segments have negative slopes, and the falling object 
slows down as it falls. What is this critical value of v that separates objects whose speed is increasing 
from those whose speed is decreasing? Referring again to equation (5), we ask what value of v will 
cause dv / dt to be zero. The answer is v — (5)(9.8) — 49 m/s. 

In fact, the constant function v(t) = 49 is a solution of equation (5). To verify this statement, 
substitute v(t) = 49 into equation (5) and observe that each side of the equation is zero. Because 
it does not change with time, the solution v(t) = 49 is called an equilibrium solution. It is 
the solution that corresponds to a perfect balance between gravity and drag. In Figure 1.1.3 we show 
the equilibrium solution v(t) = 49 superimposed on the direction field. From this figure we can 
draw another conclusion, namely, that all other solutions seem to be converging to the equilibrium 
solution as t increases. Thus, in this context, the equilibrium solution is often called the terminal 
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Assembling a direction field for equation (5): dv /dt = 9.8—v/5. (a) when v = 40, 
dv/dt = 1.8, (b) when v = 50, dv/dt = —0.2, and (c) when v = 60, dv/dt = —2.2. 
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| FIGURE 1.1.3 | Direction field and equilibrium solution for equation (5): 


dv/dt —9.8—v/5. 


The approach illustrated in Example 2 can be applied equally well to the more general 
differential equation (4), where the parameters m and y are unspecified positive numbers. The 
results are essentially identical to those of Example 2. The equilibrium solution of equation (4) 
is the constant solution v(t) = mg/^. Solutions below the equilibrium solution increase with 
time, and those above it decrease with time. As a result, we conclude that all solutions approach 
the equilibrium solution as t becomes large. 


Direction Fields. Direction fields are valuable tools in studying the solutions of differential 
equations of the form 


— = f(t, y), (6) 


where f is a given function of the two variables t and y, sometimes referred to as the rate 
function. A direction field for equations of the form (6) can be constructed by evaluating f 
at each point of a rectangular grid. At each point of the grid, a short line segment is drawn 
whose slope is the value of f at that point. Thus each line segment is tangent to the graph 
of the solution passing through that point. A direction field drawn on a fairly fine grid gives 
a good picture of the overall behavior of solutions of a differential equation. Usually a grid 
consisting of a few hundred points is sufficient. The construction of a direction field is often 
a useful first step in the investigation of a differential equation. 

Two observations are worth particular mention. First, in constructing a direction field, we 
do not have to solve equation (6); we just have to evaluate the given function f(t, y) many 
times. Thus direction fields can be readily constructed even for equations that may be quite 
difficult to solve. Second, repeated evaluation of a given function and drawing a direction field 
are tasks for which a computer or other computational or graphical aid are well suited. All the 
direction fields shown in this book, such as the one in Figures 1.1.2 and 1.1.3, were computer 
generated. 


Field Mice and Owls. Now let us look at another, quite different example. Consider a 
population of field mice that inhabit a certain rural area. In the absence of predators we assume 
that the mouse population increases at a rate proportional to the current population. This 
assumption is not a well-established physical law (as Newton’s law of motion is in Example 1), 
but it is a common initial hypothesis! in a study of population growth. If we denote time by t 
and the mouse population at time f by p(t), then the assumption about population growth can 
be expressed by the equation 


uo EPs (7) 


! A better model of population growth is discussed in Section 2.5. 
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where the proportionality factor r is called the rate constant or growth rate. To be specific, 
suppose that time is measured in months and that the rate constant r has the value 0.5/month. 
Then the two terms in equation (7) have the units of mice/month. 

Now let us add to the problem by supposing that several owls live in the same 
neighborhood and that they kill 15 field mice per day. To incorporate this information into 
the model, we must add another term to the differential equation (7), so that it becomes 


dp p 

— = — — 450. 8 

dt 2 99 
Observe that the predation term is —450 rather than —15 because time is measured in months, 


so the monthly predation rate is needed. 


EXAMPLE 3 


Investigate the solutions of differential equation (8) graphically. 


Solution: 


A direction field for equation (8) is shown in Figure 1.1.4. For sufficiently large values of p it can 
be seen from the figure, or directly from equation (8) itself, that dp / dt is positive, so that solutions 
increase. On the other hand, if p is small, then dp/dt is negative and solutions decrease. Again, 
the critical value of p that separates solutions that increase from those that decrease is the value of 
p for which dp/dt is zero. By setting dp/dt equal to zero in equation (8) and then solving for p, 
we find the equilibrium solution p(t) = 900, for which the growth term and the predation term in 
equation (8) are exactly balanced. The equilibrium solution is also shown in Figure 1.1.4. 
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| FIGURE 1.1.4 | Direction field (red) and equilibrium solution (blue) for 
equation (8): dp/dt = p/2 — 450. 


Comparing Examples 2 and 3, we note that in both cases the equilibrium solution separates 
increasing from decreasing solutions. In Example 2 other solutions converge to, or are attracted 
by, the equilibrium solution, so that after the object falls long enough, an observer will see 
it moving at very nearly the equilibrium velocity. On the other hand, in Example 3 other 
solutions diverge from, or are repelled by, the equilibrium solution. Solutions behave very 
differently depending on whether they start above or below the equilibrium solution. As 
time passes, an observer might see populations either much larger or much smaller than the 
equilibrium population, but the equilibrium solution itself will not, in practice, be observed. 
In both problems, however, the equilibrium solution is very important in understanding how 
solutions of the given differential equation behave. 

A more general version of equation (8) is 


dp — 


=rp—k, 9 
ap P (9) 
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where the growth rate r and the predation rate k are positive constants that are otherwise 
unspecified. Solutions of this more general equation are very similar to those of equation (8). 
The equilibrium solution of equation (9) is p(t) = k/r. Solutions above the equilibrium 
solution increase, while those below it decrease. 

You should keep in mind that both of the models discussed in this section have their 
limitations. The model (5) of the falling object is valid only as long as the object is falling 
freely, without encountering any obstacles. If the velocity is large enough, the assumption 
that the frictional resistance is linearly proportional to the velocity has to be replaced with 
a nonlinear approximation (see Problem 21). The population model (8) eventually predicts 
negative numbers of mice (if p < 900) or enormously large numbers (if p > 900). Both of 
these predictions are unrealistic, so this model becomes unacceptable after a fairly short time 
interval. 


Constructing Mathematical Models. In applying differential equations to any of the 
numerous fields in which they are useful, it is necessary first to formulate the appropriate 
differential equation that describes, or models, the problem being investigated. In this section 
we have looked at two examples of this modeling process, one drawn from physics and 
the other from ecology. In constructing future mathematical models yourself, you should 
recognize that each problem is different, and that successful modeling cannot be reduced to the 
observance of a set of prescribed rules. Indeed, constructing a satisfactory model is sometimes 
the most difficult part of the problem. Nevertheless, it may be helpful to list some steps that 
are often part of the process: 


1. Identify the independent and dependent variables and assign letters to represent them. 
Often the independent variable is time. 


2. Choose the units of measurement for each variable. In a sense the choice of units is 
arbitrary, but some choices may be much more convenient than others. For example, we 
chose to measure time in seconds for the falling-object problem and in months for the 
population problem. 


3. Articulate the basic principle that underlies or governs the problem you are investigating. 
This may be a widely recognized physical law, such as Newton’s law of motion, or it 
may be a more speculative assumption that may be based on your own experience or 
observations. In any case, this step is likely not to be a purely mathematical one, but will 
require you to be familiar with the field in which the problem originates. 


4. Express the principle or law in step 3 in terms of the variables you chose in step 1. 
This may be easier said than done. It may require the introduction of physical constants 
or parameters (such as the drag coefficient in Example 1) and the determination of 
appropriate values for them. Or it may involve the use of auxiliary or intermediate 
variables that must then be related to the primary variables. 


5. Ifthe units agree, then your equation at least is dimensionally consistent, although it may 
have other shortcomings that this test does not reveal. 


6. In the problems considered here, the result of step 4 is a single differential equation, 
which constitutes the desired mathematical model. Keep in mind, though, that in more 
complex problems the resulting mathematical model may be much more complicated, 
perhaps involving a system of several differential equations, for example. 


Historical Background, Part I: Newton, Leibniz, and the Bernoullis. Without knowing 
something about differential equations and methods of solving them, it is difficult to appreciate 
the history of this important branch of mathematics. Further, the development of differential 
equations is intimately interwoven with the general development of mathematics and cannot 
be separated from it. Nevertheless, to provide some historical perspective, we indicate here 
some of the major trends in the history of the subject and identify the most prominent early 
contributors. The rest of the historical background in this section focuses on the earliest 
contributors from the seventeenth century. The story continues at the end of Section 1.2 with 
an overview of the contributions of Euler and other eighteenth-century (and early-nineteenth- 
century) mathematicians. More recent advances, including the use of computers and other 
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technologies, are summarized at the end of Section 1.3. Additional historical information is 
contained in footnotes scattered throughout the book and in the references listed at the end of 
the chapter. 

The subject of differential equations originated in the study of calculus by Isaac Newton 
(1643-1727) and Gottfried Wilhelm Leibniz (1646-1716) in the seventeenth century. Newton 
grew up in the English countryside, was educated at Trinity College, Cambridge, and became 
Lucasian Professor of Mathematics there in 1669. His epochal discoveries of calculus and of 
the fundamental laws of mechanics date to 1665. They were circulated privately among his 
friends, but Newton was extremely sensitive to criticism and did not begin to publish his results 
until 1687 with the appearance of his most famous book Philosophiae Naturalis Principia 
Mathematica. Although Newton did relatively little work in differential equations as such, his 
development of the calculus and elucidation of the basic principles of mechanics provided a 
basis for their applications in the eighteenth century, most notably by Euler (see Historical 
Background, Part II in Section 1.2). Newton identified three forms of first-order differential 
equations: dy/dx = f(x), dy/dx = f(y), and dy/dx = f(x, y). For the latter equation 
he developed a method of solution using infinite series when f(x, y) is a polynomial in x and 
y. Newton’s active research in mathematics ended in the early 1690s, except for the solution 
of occasional “challenge problems” and the revision and publication of results obtained much 
earlier. He was appointed Warden of the British Mint in 1696 and resigned his professorship a 
few years later. He was knighted in 1705 and, upon his death in 1727, became the first scientist 
buried in Westminster Abbey. 

Leibniz was born in Leipzig, Germany, and completed his doctorate in philosophy at the 
age of 20 at the University of Altdorf. Throughout his life he engaged in scholarly work in 
several different fields. He was mainly self-taught in mathematics, since his interest in this 
subject developed when he was in his twenties. Leibniz arrived at the fundamental results of 
calculus independently, although a little later than Newton, but was the first to publish them, 
in 1684. Leibniz was very conscious of the power of good mathematical notation and was 
responsible for the notation dy / dx for the derivative and for the integral sign. He discovered 
the method of separation of variables (Section 2.2) in 1691, the reduction of homogeneous 
equations to separable ones (Section 2.2, Problem 30) in 1691, and the procedure for solving 
first-order linear equations (Section 2.1) in 1694. He spent his life as ambassador and adviser 
to several German royal families, which permitted him to travel widely and to carry on an 
extensive correspondence with other mathematicians, especially the Bernoulli brothers. In the 
course of this correspondence many problems in differential equations were solved during the 
latter part of the seventeenth century. 

The Bernoulli brothers, Jakob (1654-1705) and Johann (1667-1748), of Basel, 
Switzerland did much to develop methods of solving differential equations and to extend 
the range of their applications. Jakob became professor of mathematics at Basel in 1687, 
and Johann was appointed to the same position upon his brother’s death in 1705. Both 
men were quarrelsome, jealous, and frequently embroiled in disputes, especially with each 
other. Nevertheless, both also made significant contributions to several areas of mathematics. 
With the aid of calculus, they solved a number of problems in mechanics by formulating 
them as differential equations. For example, Jakob Bernoulli solved the differential equation 
yz (a° / (b? y — a?) ) ue (see Problem 9 in Section 2.2) in 1690 and, in the same paper, first 
used the term “integral” in the modern sense. In 1694 Johann Bernoulli was able to solve the 
equation dy/dx = y/(ax) (see Problem 10 in Section 2.2). One problem that both brothers 
solved, and that led to much friction between them, was the brachistochrone problem (see 
Problem 24 in Section 2.3). The brachistochrone problem was also solved by Leibniz, Newton, 
and the Marquis de l'Hópital. It is said, perhaps apocryphally, that Newton learned of the 
problem late in the afternoon of a tiring day at the Mint and solved it that evening after dinner. 
He published the solution anonymously, but upon seeing it, Johann Bernoulli exclaimed, “Ah, 
I know the lion by his paw.” 

Daniel Bernoulli (1700-1782), son of Johann, migrated to St. Petersburg, Russia, as a 
young man to join the newly established St. Petersburg Academy, but returned to Basel in 
1733 as professor of botany and, later, of physics. His interests were primarily in partial 
differential equations and their applications. For instance, it is his name that is associated with 
the Bernoulli equation in fluid mechanics. He was also the first to encounter the functions that 
a century later became known as Bessel functions (Section 5.7). 
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Problems 


In each of Problems | through 4, draw a direction field for the 
given differential equation. Based on the direction field, determine the 
behavior of y as t — oo. If this behavior depends on the initial value 
of y att = 0, describe the dependency. 


O 1 
O 2. y=2y-3 
O 3 
O 4. y =1+2y 


In each of Problems 5 and 6, write down a differential equation of the 
form dy/dt = ay + b whose solutions have the required behavior as 
t — oo 


. y =3-2y 


oy —-—1-2y 


5. All solutions approach y — 2/3. 
6. All other solutions diverge from y = 2. 


In each of Problems 7 through 10, draw a direction field for the 
given differential equation. Based on the direction field, determine the 
behavior of y as t — oo. If this behavior depends on the initial value 
of y at f = 0, describe this dependency. Note that in these problems 


the equations are not of the form y’ = ay + b, and the behavior of 
their solutions is somewhat more complicated than for the equations 
in the text. 

@ 7. y'=y(4-y) 

@ 8 y =-y5-y) 


O 9. y=y 

Q 10. y =y -3° 

Consider the following list of differential equations, some of which 
produced the direction fields shown in Figures 1.1.5 through 1.1.10. 


In each of Problems 11 through 16, identify the differential equation 
that corresponds to the given direction field. 


a. y =2y-—1 
b. yy =2+y 
c. y =y-2 


d. y'= y(y +3) 
e. y =y(y-3) 


f. y =1+2y 
g y--2-y 
h. y = y(3—y) 
i y =1—2y 
j y =2-y 


11. The direction field of Figure 1.1.5. 
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| FIGURE 1.1.5 | Problem 11. 


12. The direction field of Figure 1.1.6. 
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| FIGURE 1.1.6 | Problem 12. 


13. The direction field of Figure 1.1.7. 
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Problem 13. 


14. The direction field of Figure 1.1.8. 
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15. The direction field of Figure 1.1.9. 
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| FIGURE 1.1.9 | Problem 15. 


16. The direction field of Figure 1.1.10. 
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17. A pond initially contains 1,000,000 gal of water and an unknown 
amount of an undesirable chemical. Water containing 0.01 grams of 
this chemical per gallon flows into the pond at a rate of 300 gal/h. The 
mixture flows out at the same rate, so the amount of water in the pond 
remains constant. Assume that the chemical is uniformly distributed 
throughout the pond. 


a. Write a differential equation for the amount of chemical in 
the pond at any time. 
b. How much of the chemical will be in the pond after a very 
long time? Does this limiting amount depend on the amount that 
was present initially? 
c. Write a differential equation for the concentration of the 
chemical in the pond at time t. Hint: The concentration is 
c=a/v =a(t)/10°. 
18. A spherical raindrop evaporates at a rate proportional to its 
surface area. Write a differential equation for the volume of the 
raindrop as a function of time. 


19. Newton’s law of cooling states that the temperature of an 
object changes at a rate proportional to the difference between the 
temperature of the object itself and the temperature of its surroundings 
(the ambient air temperature in most cases). Suppose that the ambient 
temperature is 70°F and that the rate constant is 0.05 (min)~!. Write a 
differential equation for the temperature of the object at any time. Note 
that the differential equation is the same whether the temperature of 
the object is above or below the ambient temperature. 
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20. A certain drug is being administered intravenously to a hospital 
patient. Fluid containing 5 mg/cm? of the drug enters the patient’s 
bloodstream at a rate of 100 cm?/h. The drug is absorbed by body 
tissues or otherwise leaves the bloodstream at a rate proportional to 
the amount present, with a rate constant of 0.4/h. 
a. Assuming that the drug is always uniformly distributed 
throughout the bloodstream, write a differential equation for the 
amount of the drug that is present in the bloodstream at any time. 
b. How much of the drug is present in the bloodstream after a 
long time? 
@ 21. For small, slowly falling objects, the assumption made in 
the text that the drag force is proportional to the velocity is a good one. 
For larger, more rapidly falling objects, it is more accurate to assume 
that the drag force is proportional to the square of the velocity.” 
a. Write a differential equation for the velocity of a falling 
object of mass m if the magnitude of the drag force is 
proportional to the square of the velocity and its direction is 
opposite to that of the velocity. 
b. Determine the limiting velocity after a long time. 
C. If m = 10 kg, find the drag coefficient so that the limiting 
velocity is 49 m/s. 
@ d. Using the data in part c, draw a direction field and compare 
it with Figure 1.1.3. 


In each of Problems 22 through 25, draw a direction field for the 
given differential equation. Based on the direction field, determine the 
behavior of y as t — oo. If this behavior depends on the initial value 
of y at t = 0, describe this dependency. Note that the right-hand sides 
of these equations depend on : as well as y; therefore, their solutions 
can exhibit more complicated behavior than those in the text. 


O 22. y--24:-y 
O 23. 
O 24. 


y eu wey 
y'23sint -14- y 


?See Lyle N. Long and Howard Weiss, “The Velocity Dependence of 
Aerodynamic Drag: A Primer for Mathematicians," American Mathematical 
Monthly 106 (1999), 2, pp. 127-135. 


12 Solutions of Some Differential Equations 


In the preceding section we derived the differential equations 
dv 
m mp 
and 
dp 


=rp—k. 
dt um 


(1) 


(2) 


Equation (1) models a falling object, and equation (2) models a population of field mice preyed 


on by owls. Both of these equations are of the general form 


(3) 


where a and b are given constants. We were able to draw some important qualitative 
conclusions about the behavior of solutions of equations (1) and (2) by considering the 
associated direction fields. To answer questions of a quantitative nature, however, we need 
to find the solutions themselves, and we now investigate how to do that. 
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Field Mice and Owls (continued) 


EXAMPLE 1 


Consider the equation 


(4) 


which describes the interaction of certain populations of field mice and owls (see equation (8) of 


Section 1.1). Find solutions of this equation. 


Solution 


we need to find functions p(t) that, when substituted into the equation, reduce 


identity. Here is one way to proceed. First, rewrite equation (4) in the form 


, 


To solve equation (4) 
itto an obvious 


(5) 


, 


p — 900 
2 


dp _ 
dt -— 


or, if p # 900, 


(6) 


1 
Y 


dp/dt 
p — 900 


By the chain rule the left-hand side of equation (6) is the derivative of In |p — 900| with respect to f, 


so we have 


d 
dt 


(7) 


2 


In|p — 900| 


Then, by integrating both sides of equation (7), we obtain 


(8) 


4c 
2 , 


In|p — 900| 


where C is an arbitrary constant of integration. Therefore, by taking the exponential of both sides of 


equation (8), we find that 


(9) 


|p — 900| = e /?*€ = eel? 


or 


(10) 


p — 900 = +e°e'/?, 


and finally 


(41) 


where c = +e is also an arbitrary (nonzero) constant. Note that the constant function p = 900 is 
also a solution of equation (5) and that it is contained in the expression (11) if we allow c to take the 


900 + ce!/?, 
value zero. Graphs of equation (11) for several values of c are shown in Figure 1.2.1. 
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Graphs of p = 900 + ce 


curve is a solution of dp/dt = 0.5p — 450. 


1.2 Solutions of Some Differential Equations 


Note that they have the character inferred from the direction field in Figure 1.1.4. For instance, 
solutions lying on either side of the equilibrium solution p = 900 tend to diverge from that 
solution. 


In Example 1 we found infinitely many solutions of the differential equation (4), 
corresponding to the infinitely many values that the arbitrary constant c in equation (11) might 
have. This is typical of what happens when you solve a differential equation. The solution 
process involves an integration, which brings with it an arbitrary constant, whose possible 
values generate an infinite family of solutions. 

Frequently, we want to focus our attention on a single member of the infinite family of 
solutions by specifying the value of the arbitrary constant. Most often, we do this indirectly by 
specifying instead a point that must lie on the graph of the solution. For example, to determine 
the constant c in equation (11), we could require that the population have a given value at a 
certain time, such as the value 850 at time ? = 0. In other words, the graph of the solution 
must pass through the point (0, 850). Symbolically, we can express this condition as 


p(0) = 850. (12) 
Then, substituting t = 0 and p = 850 into equation (11), we obtain 
850 = 900 + c. 
Hence c — —50, and by inserting this value into equation (11), we obtain the desired solution, 
namely, 
p = 900 — 50e"”?. (13) 


The additional condition (12) that we used to determine c is an example of an initial 
condition. The differential equation (4) together with the initial condition (12) forms an initial 
value problem. 

Now consider the more general problem consisting of the differential equation (3) 

dy 
"rm ay —b 
and the initial condition 


y(0) = yo, (14) 


where yo is an arbitrary initial value. We can solve this problem by the same method as in 
Example 1. If a 4 0 and y 4 b/a, then we can rewrite equation (3) as 


dy /dt 
»/ z =a. (15) 
ya 
By integrating both sides, we find that 
b 
In x0 - 2|-ar «c. (16) 
a 


where C is an arbitrary constant. Then, taking the exponential of both sides of equation (16) 
and solving for y, we obtain 


b at 
y(t) = = ce", (17) 


where c = ce is also an arbitrary constant. Observe that c = 0 corresponds to the equilibrium 
solution y(t) = b/a. Finally, the initial condition (14) requires that c = yo — (b/a), so the 
solution of the initial value problem (3), (14) is 


b b\ u 
y(t) =- + | yo-— Je”. (18) 
a a 


For a + 0 the expression (17) contains all possible solutions of equation (3) and is called 
the general solution.* The geometric representation of the general solution (17) is an infinite 
family of curves called integral curves. Each integral curve is associated with a particular 


If a = 0, then the solution of equation (3) is not given by equation (17). We leave it to you to find the general solution 
in this case. 
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value of c and is the graph of the solution corresponding to that value of c. Satisfying an initial 
condition amounts to identifying the integral curve that passes through the given initial point. 

To relate the solution (18) to equation (2), which models the field mouse population, we 
need only replace a by the growth rate r and replace b by the predation rate k; we assume that 
r > Oandk > 0. Then the solution (18) becomes 


k k rt 
P(t) =—+ | po— — Je", (19) 
r r 


where po is the initial population of field mice. The solution (19) confirms the conclusions 
reached on the basis of the direction field and Example 1. If pp = k/r, then from equation (19) 
it follows that p(t) = k/r for all t; this is the constant, or equilibrium, solution. If po 4 
k/ r, then the behavior of the solution depends on the sign of the coefficient po — k/r of the 
exponential term in equation (19). If pp > k/r, then p grows exponentially with time t; if 
po < k/r, then p decreases and becomes zero (at a finite time), corresponding to extinction 
of the field mouse population. Negative values of p, while possible for the expression (19), 
make no sense in the context of this particular problem. 

To put the falling-object equation (1) in the form (3), we must identify a with —^ / m and 
b with —g. Observe that assuming y > 0 and m > 0 implies that a < 0 and b < 0. Making 
these substitutions in the solution (18), we obtain 


v(t) = ME E: (» — 2E Jen, (20) 
E Y 


where vọ is the initial velocity. Again, this solution confirms the conclusions reached in 
Section 1.1 on the basis of a direction field. There is an equilibrium, or constant, solution 
v(t) = mg/^, and all other solutions tend to approach this equilibrium solution. The speed 
of convergence to the equilibrium solution is determined by the exponent —^ / m. Thus, for 
a given mass m, the velocity approaches the equilibrium value more rapidly as the drag 
coefficient y increases. 


EXAMPLE2 | A Falling Object (continued) 


Suppose that, as in Example 2 of Section 1.1, we consider a falling object of mass m = 10 kg and 
drag coefficient y = 2 kg/s. Then the equation of motion (1) becomes 


i egg oe QD 
dt /— 5 


Suppose this object is dropped from a height of 300 m. Find its velocity at any time t. How long will 
it take to fall to the ground, and how fast will it be moving at the time of impact? 
Solution: 


The first step is to state an appropriate initial condition for equation (21). The word "dropped" in the 
statement of the problem suggests that the object starts from rest, that is, its initial velocity is zero, 
so we will use the initial condition 


v(0) = 0. (22) 


The solution of equation (21) can be found by substituting the values of the coefficients into the 
solution (20), but we will proceed instead to solve equation (21) directly. First, rewrite the equation as 


dv/dt 1 
-—-. (23) 
y — 49 5 
By integrating both sides, we obtain 
t 
In|v(t) — 49| == H (24) 
and then the general solution of equation (21) is 
v(t) = 49 +ce"/, (25) 


where the constant c is arbitrary. To determine the particular value of c that corresponds to the initial 
condition (22), we substitute £ = 0 and v = 0 into equation (25), with the result that c = —49. Then 
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the solution of the initial value problem (21), (22) is 
v(t) = 49(1— e7/5). (26) 


Equation (26) gives the velocity of the falling object at any positive time after being dropped—until 
it hits the ground, of course. 

Graphs of the solution (25) for several values of c are shown in Figure 1.2.2, with the solution 
(26) shown by the green curve. It is evident that, regardless of the initial velocity of the object, all 
solutions tend to approach the equilibrium solution v(t) = 49. This confirms the conclusions we 
reached in Section 1.1 on the basis of the direction fields in Figures 1.1.2 and 1.1.3. 
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Graphs of the solution (25), v = 49 + ce /5. for several values 


of c. The green curve corresponds to the initial condition v(0) = 0. The point 
(10.51, 43.01) shows the velocity when the object hits the ground. 


To find the velocity of the object when it hits the ground, we need to know the time at which 
impact occurs. In other words, we need to determine how long it takes the object to fall 300 m. To 
do this, we note that the distance x the object has fallen is related to its velocity v by the differential 
equation v — dx/dt, or 


= = 49(1— e7'/5). (27) 


Consequently, by integrating both sides of equation (27) with respect to t, we have 
x = 49t + 245e™"/5 + k, (28) 


where k is an arbitrary constant of integration. The object starts to fall when t = 0, so we know that 
x = 0 when t = 0. From equation (28) it follows that k = —245, so the distance the object has fallen 
at time ¢ is given by 


x = 49t + 245e!/5 — 245. (29) 


Let T be the time at which the object hits the ground; then x = 300 when t = T. By substituting 
these values in equation (29), we obtain the equation 


49T --245e 1/5 — 245 = 300. (30) 
The value of T satisfying equation (30) can be approximated by a numerical process^ using 
a calculator or other computational tool, with the result that T = 10.51s. At this time, the 
corresponding velocity vr is found from equation (26) to be vr = 43.01m/s. The point 


(10.51, 43.01) is also shown in Figure 1.2.2. 


^A computer algebra system provides this capability; many calculators also have built-in routines for solving such 
equations. 
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Further Remarks on Mathematical Modeling. Up to this point we have related our 
discussion of differential equations to mathematical models of a falling object and of a 
hypothetical relation between field mice and owls. The derivation of these models may have 
been plausible, and possibly even convincing, but you should remember that the ultimate test 
of any mathematical model is whether its predictions agree with observations or experimental 
results. We have no actual observations or experimental results to use for comparison purposes 
here, but there are several sources of possible discrepancies. 

In the case of the falling object, the underlying physical principle (Newton’s laws of 
motion) is well established and widely applicable. However, the assumption that the drag 
force is proportional to the velocity is less certain. Even if this assumption is correct, the 
determination of the drag coefficient ^; by direct measurement presents difficulties. Indeed, 
sometimes one finds the drag coefficient indirectly —for example, by measuring the time of 
fall from a given height and then calculating the value of y that predicts this observed time. 

The model of the field mouse population is subject to various uncertainties. The 
determination of the growth rate r and the predation rate k depends on observations of actual 
populations, which may be subject to considerable variation. The assumption that r and k are 
constants may also be questionable. For example, a constant predation rate becomes harder 
to sustain as the field mouse population becomes smaller. Further, the model predicts that a 
population above the equilibrium value will grow exponentially larger and larger. This seems 
at variance with the behavior of actual populations; see the further discussion of population 
dynamics in Section 2.5. 

If the differences between actual observations and a mathematical model’s predictions are 
too great, then you need to consider refining the model, making more careful observations, 
or perhaps both. There is almost always a tradeoff between accuracy and simplicity. Both are 
desirable, buta gain in one usually involves aloss in the other. However, even if a mathematical 
model is incomplete or somewhat inaccurate, it may nevertheless be useful in explaining 
qualitative features of the problem under investigation. It may also give satisfactory results 
under some circumstances but not others. Thus you should always use good judgment and 
common sense in constructing mathematical models and in using their predictions. 
Historical Background, Part Il: Euler, Lagrange, and Laplace. The greatest mathe- 
matician of the eighteenth century, Leonhard Euler (1707-1783), grew up near Basel, 
Switzerland and was a student of Johann Bernoulli. He followed his friend Daniel Bernoulli to 
St. Petersburg in 1727. For the remainder of his life he was associated with the St. Petersburg 
Academy (1727-1741 and 1766-1783) and the Berlin Academy (1741-1766). Losing sight 
in his right eye in 1738, and in his left eye in 1766, did not stop Euler from being one of the 
most prolific mathematicians of all time. In addition to publishing more than 500 books and 
papers during his life, an additional 400 have appeared posthumously. 

Of particular interest here is Euler’s formulation of problems in mechanics in 
mathematical language and his development of methods of solving these mathematical 
problems. Lagrange said of Euler’s work in mechanics, “The first great work in which analysis 
is applied to the science of movement.” Among other things, Euler identified the condition 
for exactness of first-order differential equations (Section 2.6) in 1734-1735, developed the 
theory of integrating factors (Section 2.6) in the same paper, and gave the general solution of 
homogeneous linear differential equations with constant coefficients (Sections 3.1, 3.3, 3.4, 
and 4.2) in 1743. He extended the latter results to nonhomogeneous differential equations in 
1750-1751. Beginning about 1750, Euler made frequent use of power series (Chapter 5) in 
solving differential equations. He also proposed a numerical procedure (Sections 2.7 and 8.1) 
in 1768-1769, made important contributions in partial differential equations, and gave the first 
systematic treatment of the calculus of variations. 

Joseph-Louis Lagrange (1736-1813) became professor of mathematics in his native 
Turin, Italy, at the age of 19. He succeeded Euler in the chair of mathematics at the Berlin 
Academy in 1766 and moved on to the Paris Academy in 1787. He is most famous for his 
monumental work Mécanique analytique, published in 1788, an elegant and comprehensive 
treatise of Newtonian mechanics. With respect to elementary differential equations, Lagrange 
showed in 1762-1765 that the general solution of a homogeneous nth order linear differential 
equation is a linear combination of n independent solutions (Sections 3.2 and 4.1). Later, in 
1774—1775, he offered a complete development of the method of variation of parameters 
(Sections 3.6 and 4.4). Lagrange is also known for fundamental work in partial differential 
equations and the calculus of variations. 
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Pierre-Simon de Laplace (1749-1827) lived in Normandy, France, as a boy but arrived in 
Paris in 1768 and quickly made his mark in scientific circles, winning election to the Académie 
des Sciences in 1773. He was preeminent in the field of celestial mechanics; his greatest work, 
Traité de mécanique céleste, was published in five volumes between 1799 and 1825. Laplace’s 
equation is fundamental in many branches of mathematical physics, and Laplace studied it 
extensively in connection with gravitational attraction. The Laplace transform (Chapter 6) 
is also named for him, although its usefulness in solving differential equations was not 
recognized until much later. 

By the end of the eighteenth century many elementary methods of solving ordinary 
differential equations had been discovered. In the nineteenth century interest turned more 
toward the investigation of theoretical questions of existence and uniqueness and to the 
development of less elementary methods such as those based on power series expansions (see 
Chapter 5). These methods find their natural setting in the complex plane. Consequently, they 
benefitted from, and to some extent stimulated, the more or less simultaneous development of 
the theory of complex analytic functions. Partial differential equations also began to be studied 
intensively, as their crucial role in mathematical physics became clear. In this connection a 
number of functions, arising as solutions of certain ordinary differential equations, occurred 
repeatedly and were studied exhaustively. Known collectively as higher transcendental 
functions, many of them are associated with the names of mathematicians, including Bessel 
(Section 5.7), Legendre (Section 5.3), Hermite (Section 5.2), Chebyshev (Section 5.3), Hankel, 


and many others. 


Problems 


@ 1. Solve each of the following initial value problems and plot 
the solutions for several values of yọ. Then describe in a few words 
how the solutions resemble, and differ from, each other. 


a. dy/dt=—y+5, y(0) = yo 
b. dy/dt 2 —2y +5, y(0) = yo 
c. dy/dt 2 —2y +10, y(0) = yo 


Q 2. Follow the instructions for Problem 1 for the following 
initial-value problems: 
a. dy/dt=y—5, y(0) = yo 
O b. dy/dt =2y—5, y(0) = yo 
C. dy/dt=2y—10, y(0) = yo 
3. Consider the differential equation 
dy/dt = —ay +b, 
where both a and b are positive numbers. 
a. Find the general solution of the differential equation. 
Q b. Sketch the solution for several different initial conditions. 
c. Describe how the solutions change under each of the 
following conditions: 


i. a increases. 
ii. b increases. 


iii. Both a and b increase, but the ratio b/a remains the same. 


4. Consider the differential equation dy /dt = ay — b. 
a. Find the equilibrium solution ye. 
b. Let Y(t) = y — ye; thus Y(t) is the deviation from the 
equilibrium solution. Find the differential equation satisfied by 
Y(t). 
5. Undetermined Coefficients. Here is an alternative way to solve 
the equation 
dy 
2^ ay — b. (31) 
a. Solve the simpler equation 
dy 
PE 


ay. (32) 


Call the solution y(t). 
b. Observe that the only difference between equations (31) and 
(32) is the constant —b in equation (31). Therefore, it may seem 
reasonable to assume that the solutions of these two equations 
also differ only by a constant. Test this assumption by trying 
to find a constant k such that y = y,(t) + k is a solution of 
equation (31). 
C. Compare your solution from part b with the solution given in 
the text in equation (17). 
Note: This method can also be used in some cases in which the 
constant b is replaced by a function g(t). It depends on whether you 
can guess the general form that the solution is likely to take. This 
method is described in detail in Section 3.5 in connection with second- 
order equations. 


6. Use the method of Problem 5 to solve the equation 


dy 
— =—ay+b. 
d A 


7. The field mouse population in Example 1 satisfies the 


differential equation 
S o es 

dt 2 
a. Find the time at which the population becomes extinct if 
p(C0) = 850. 
b. Find the time of extinction if p(0) = po, where 0 < po < 
900. 
Q c. Find the initial population po if the population is to 
become extinct in 1 year. 


8. The falling object in Example 2 satisfies the initial value 
problem 
dv 
dt 
a. Find the time that must elapse for the object to reach 98% of 
its limiting velocity. 
b. How far does the object fall in the time found in part a? 


9.8—~, v(0) =0 
—9.8—-, v(0) =0. 
5 
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9. Consider the falling object of mass 10 kg in Example 2, but 
assume now that the drag force is proportional to the square of the 
velocity. 

a. If the limiting velocity is 49 m/s (the same as in Example 2), 
show that the equation of motion can be written as 
e e 1 (49 -y?). 
dt | 245 
Also see Problem 21 of Section 1.1. 
b. If v(0) — O, find an expression for v(t) at any time. 
Q c. Plot your solution from part b and the solution (26) from 
Example 2 on the same axes. 
d. Based on your plots in part c, compare the effect of a 
quadratic drag force with that of a linear drag force. 
e. Find the distance x(t) that the object falls in time f. 
Q f. Find the time T it takes the object to fall 300 m. 


10. A radioactive material, such as the isotope thorium-234, 
disintegrates at a rate proportional to the amount currently present. 
If Q(t) is the amount present at time f, then d Q/dt = —r Q, where 
r > Ois the decay rate. 
a. If 100 mg of thorium-234 decays to 82.04 mg in 1 week, 
determine the decay rate r. 
b. Find an expression for the amount of thorium-234 present at 
any time f. 
c. Find the time required for the thorium-234 to decay to one- 
half its original amount. 


11. The half-life of a radioactive material is the time required for an 
amount of this material to decay to one-half its original value. Show 
that for any radioactive material that decays according to the equation 
Q' = —r Q, the half-life 7 and the decay rate r satisfy the equation 
rz 2n. 


12. According to Newton's law of cooling (see Problem 19 of 
Section 1.1), the temperature u(t) of an object satisfies the differential 
equation 
du 
ae —k(u — T), 
where T is the constant ambient temperature and k is a positive 
constant. Suppose that the initial temperature of the object is 
u(0) = ug. 
a. Find the temperature of the object at any time. 
b. Let 7 be the time at which the initial temperature difference 
ug — T has been reduced by half. Find the relation between k 
and 7. 


13. Consider an electric circuit containing a capacitor, resistor, and 


battery; see Figure 1.2.3. The charge Q(t) on the capacitor satisfies 
the equation? 


where R is the resistance, C is the capacitance, and V is the constant 
voltage supplied by the battery. 
O a. If Q(0) = 0, find Q(t) at any time f, and sketch the graph 
of Q versus f. 
b. Find the limiting value Qz that Q(t) approaches after a long 
time. 
O c. Suppose that Q(rj)) = Qz and that at time t = t the 
battery is removed and the circuit is closed again. Find Q(t) for 
t > tı and sketch its graph. 


C 


| FIGURE 1.2.3 | The electric circuit of Problem 13. 


@ 14. A pond containing 1,000,000 gal of water is initially free 
of a certain undesirable chemical (see Problem 17 of Section 1.1). 
Water containing 0.01 g/gal of the chemical flows into the pond at 
a rate of 300 gal/h, and water also flows out of the pond at the same 
rate. Assume that the chemical is uniformly distributed throughout the 


pond. 
a. Let Q(t) be the amount of the chemical in the pond at time 


t. Write down an initial value problem for Q(t). 

b. Solve the problem in part a for Q(t). How much chemical is 
in the pond after 1 year? 

c. At the end of 1 year the source of the chemical in the pond 
is removed; thereafter pure water flows into the pond, and the 
mixture flows out at the same rate as before. Write down the 
initial value problem that describes this new situation. 

d. Solvethe initial value problem in part c. How much chemical 
remains in the pond after 1 additional year (2 years from the 
beginning of the problem)? 

e. How long does it take for Q(t) to be reduced to 10 g? 

Q f. Plot Q(t) versus t for 3 years. 


>This equation results from Kirchhoff’s laws, which are discussed in Section 
3.7. 
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The main purposes of this book are to discuss some of the properties of solutions of differential 
equations and to present some of the methods that have proved effective in finding solutions 
or, in some cases, in approximating them. To provide a framework for our presentation, 
we describe here several useful ways of classifying differential equations. Mastery of this 
vocabulary is essential to selecting appropriate solution methods and to describing properties 
of solutions of differential equations that you encounter later in this book—and in the real 


world. 


Ordinary and Partial Differential Equations. 


One important classification is based on 


whether the unknown function depends on a single independent variable or on several 


1.3 Classification of Differential Equations 


independent variables. In the first case, only ordinary derivatives appear in the differential 

equation, and it is said to be an ordinary differential equation. In the second case, the 

derivatives are partial derivatives, and the equation is called a partial differential equation. 

All the differential equations discussed in the preceding two sections are ordinary 
differential equations. Another example of an ordinary differential equation is 

d’ Q(t) dQ(r) 1 

L R t) = E(t), 1 

dp tk ga, te Q0 - EO (1) 

for the charge Q(t) ona capacitor in a circuit with capacitance C, resistance R, and inductance 

L; this equation is derived in Section 3.7. Typical examples of partial differential equations 

are the heat conduction equation 


92 UO t) = Ou(x, t) 
Ox? Ot 


(2) 


and the wave equation 


19 6 t) = O?u(x, t) | 
Ox? Or? 
Here, a? and a? are certain physical constants. Note that in both equations (2) and (3) the 
dependent variable u depends on the two independent variables x and t. The heat conduction 
equation describes the conduction of heat in a solid body, and the wave equation arises in a 
variety of problems involving wave motion in solids or fluids. 


(3) 


Systems of Differential Equations. Another classification of differential equations de- 
pends on the number of unknown functions that are involved. If there is a single function 
to be determined, then one differential equation is sufficient. However, if there are two or 
more unknown functions, then a system of differential equations is required. For example, the 
Lotka-Volterra, or predator-prey, equations are important in ecological modeling. They have 
the form 


dx 

—— = ax — ax 

dt y 

d 

Y = ey + yxy, (4) 
dt 


where x(t) and y(t) are the respective populations of the prey and predator species. The 
positive constants a, a,c, and y are based on empirical observations and depend on the 
particular species being studied. Systems of equations are discussed in Chapters 7 and 9; 
in particular, the Lotka-Volterra equations are examined in Section 9.5. In some areas of 
application it is not unusual to encounter very large systems containing hundreds, or even 
many thousands, of differential equations. 


Order. The order of a differential equation is the order of the highest derivative that appears 
in the equation. The equations in the preceding sections are all first-order equations, whereas 
equation (1) is a second-order equation. Equations (2) and (3) are also second-order partial 
differential equations. More generally, the equation 


F(t,w(t),w((t), ... i (0) =0 (5) 


is an ordinary differential equation of the n'" order. Equation (5) expresses a relation between 
the independent variable ż and the values of the function u and its first n derivatives 


u', u", ... , wu. It is convenient and customary in differential equations to write y for u(t), 
with y’, y", ... , y? standing for u'(1), u(t), ... , u(t). Thus equation (5) is written as 
EIE T a sug. (6 


For example, 
y" TE 2e! y" Ẹ yy! = tt (7) 


is a third-order differential equation for y = u(t). Occasionally, other letters will be used 
instead of ż and y for the independent and dependent variables; the meaning should be clear 
from the context. 
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We assume that it is always possible to solve a given ordinary differential equation for 
the highest derivative, obtaining 


ye ac Yy, y”, a. ge), (8) 


This is mainly to avoid the ambiguity that may arise because a single equation of the form (6) 
may correspond to several equations of the form (8). For example, the equation 


(y)? +ty' +4y =0 (9) 
leads to the two equations 


—t+./t? — 16 —t — J/t? — 16 
y= SE or ys E, (10) 


Linear and Nonlinear Equations. A crucial classification of differential equations is 
whether they are linear or nonlinear. The ordinary differential equation 


Fai agi) S0 


is said to be linear if F is a linear function of the variables y, y’, ... , y"; a similar definition 
applies to partial differential equations. Thus the general linear ordinary differential equation 
of order n is 


aot) Y? c a (0 y" 7P +- H an(t)y = g(t). (11) 


Most of the equations you have seen thus far in this book are linear; examples are 
the equations in Sections 1.1 and 1.2 describing the falling object and the field mouse 
population. Similarly, in this section, equation (1) is a linear ordinary differential equation 
and equations (2) and (3) are linear partial differential equations. An equation that is not of the 
form (11) is a nonlinear equation. Equation (7) is nonlinear because of the term yy’. Similarly, 
each equation in the system (4) is nonlinear because of the terms that involve the product of 
the two unknown functions xy. 

A simple physical problem that leads to a nonlinear differential equation is the oscillating 
pendulum. The angle 6 = 60(t) that an oscillating pendulum of length L makes with the 
vertical direction (see Figure 1.3.1) satisfies the equation 


amy sin 0 = 0, (12) 


whose derivation is outlined in Problems 22 through 24. The presence of the term involving 
sin 0 makes equation (12) nonlinear. 


| FIGURE 1.3.1 | An oscillating pendulum. 


The mathematical theory and methods for solving linear equations are highly developed. 
In contrast, for nonlinear equations the theory is more complicated, and methods of solution 
are less satisfactory. In view of this, it is fortunate that many significant problems lead to linear 
ordinary differential equations or can be approximated by linear equations. For example, for 
the pendulum, if the angle 0 is small, then sin 0 = 0 and equation (12) can be approximated 
by the linear equation 


— +20 =0. (13) 


This process of approximating a nonlinear equation by a linear one is called linearization; it 
is an extremely valuable way to deal with nonlinear equations. Nevertheless, there are many 
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physical phenomena that simply cannot be represented adequately by linear equations. To 
study these phenomena, it is essential to deal with nonlinear equations. 

In an elementary text it is natural to emphasize the simpler and more straightforward 
parts of the subject. Therefore, the greater part of this book is devoted to linear equations and 
various methods for solving them. However, Chapters 8 and 9, as well as parts of Chapter 2, are 
concerned with nonlinear equations. Whenever it is appropriate, we point out why nonlinear 
equations are, in general, more difficult and why many of the techniques that are useful in 
solving linear equations cannot be applied to nonlinear equations. 


Solutions. A solution of the n™ order ordinary differential equation (8) on the interval 
a < t < (isa function ¢ such that 6’, 6”, ... , 9? exist and satisfy 


eq) = f (ret, 40, ... 79 (0) (14) 


for every t ina < t < 5f. Unless stated otherwise, we assume that the function f of 
equation (8) is a real-valued function, and we are interested in obtaining real-valued solutions 
y= (1). 

Recall that in Section 1.2 we found solutions of certain equations by a process of direct 
integration. For instance, we found that the equation 


d 
T = A — 450 (15) 
has the solution 
p(t) = 900 + ce'?, (16) 


where c is an arbitrary constant. 

It is often not so easy to find solutions of differential equations. However, if you find 
a function that you think may be a solution of a given equation, it is usually relatively easy 
to determine whether the function is actually a solution: just substitute the function into the 
equation. 

For example, in this way it is easy to show that the function y;(t) = cost is a solution of 


y’+y=0 (17) 


for all t. To confirm this, observe that yj (t+) = —sint and y} (t) = —cos t; then it follows that 
yi(t) + y,(t) = 0. In the same way you can easily show that y2(t) = sint is also a solution 
of equation (17). 

Of course, this does not constitute a satisfactory way to solve most differential equations, 
because there are far too many possible functions for you to have a good chance of finding the 
correct one by a random choice. Nevertheless, you should realize that you can verify whether 
any proposed solution is correct by substituting it into the differential equation. This can be a 
very useful check; it is one that you should make a habit of considering. 


Some Important Questions. Although for the differential equations (15) and (17) we are 
able to verify that certain simple functions are solutions, in general we do not have such 
solutions readily available. Thus a fundamental question is the following: Does an equation of 
the form (8) always have a solution? The answer is “No.” Merely writing down an equation 
of the form (8) does not necessarily mean that there is a function y = $(t) that satisfies it. 
So, how can we tell whether some particular equation has a solution? This is the question of 
existence of a solution, and it is answered by theorems stating that under certain restrictions 
on the function f in equation (8), the equation always has solutions. This is not a purely 
theoretical concern for at least two reasons. If a problem has no solution, we would prefer to 
know that fact before investing time and effort in a vain attempt to solve the problem. Further, 
if a sensible physical problem is modeled mathematically as a differential equation, then the 
equation should have a solution. If it does not, then presumably there is something wrong with 
the formulation. In this sense an engineer or scientist has some check on the validity of the 
mathematical model. 

If we assume that a given differential equation has at least one solution, then we may 
need to consider how many solutions it has, and what additional conditions must be specified 
to single out a particular solution. This is the question of uniqueness. In general, solutions 


19 


20 


CHAPTER 1 Introduction 


of differential equations contain one or more arbitrary constants of integration, as does the 
solution (16) of equation (15). Equation (16) represents an infinity of functions corresponding 
to the infinity of possible choices of the constant c. As we saw in Section 1.2, if p is specified 
at some time f, this condition will determine a specific value for c; even so, we have not yet 
ruled out the possibility that there may be other solutions of equation (15) that also have the 
prescribed value of p at the prescribed time t. As in the question of existence of solutions, 
the issue of uniqueness has practical as well as theoretical implications. If we are fortunate 
enough to find a solution of a given problem, and if we know that the problem has a unique 
solution, then we can be sure that we have completely solved the problem. If there may be 
other solutions, then perhaps we should continue to search for them. 

A third important question is: Given a differential equation of the form (8), can we actually 
determine a solution, and if so, how? Note that if we find a solution of the given equation, we 
have at the same time answered the question of the existence of a solution. However, without 
knowledge of existence theory we might, for example, use a computer to find a numerical 
approximation to a "solution" that does not exist. On the other hand, even though we may 
know that a solution exists, it may be that the solution is not expressible in terms of the usual 
elementary functions — polynomial, trigonometric, exponential, logarithmic, and hyperbolic 
functions. Unfortunately, this is the situation for most differential equations. Thus, we discuss 
both elementary methods that can be used to obtain exact solutions of certain relatively simple 
problems, and also methods of a more general nature that can be used to find approximations 
to solutions of more difficult problems. 


Technology Use in Differential Equations. Technology provides many extremely 
valuable tools for the study of differential equations. For many years computers have been 
used to execute numerical algorithms, such as those described in Chapter 8, to construct 
numerical approximations to solutions of differential equations. These algorithms have been 
refined to an extremely high level of generality and efficiency. A few lines of computer code, 
written in a high-level programming language and executed (often within a few seconds) 
on a relatively inexpensive computer, suffice to approximate to a high degree of accuracy 
the solutions of a wide range of differential equations. More sophisticated routines are also 
readily available. These routines combine the ability to handle very large and complicated 
systems with numerous diagnostic features that alert the user to possible problems as they are 
encountered. 

The usual output from a numerical algorithm is a table of numbers, listing selected values 
of the independent variable and the corresponding values of the dependent variable. With 
appropriate software it is easy to display the solution of a differential equation graphically, 
whether the solution has been obtained numerically or as the result of an analytical procedure 
of some kind. Such a graphical display is often much more illuminating and helpful in 
understanding and interpreting the solution of a differential equation than a table of numbers or 
a complicated analytical formula. There are on the market several well-crafted and relatively 
inexpensive special-purpose software packages for the graphical investigation of differential 
equations. The increased power and sophistication of modern smartphones, tablets, and other 
mobile devices have brought powerful computational and graphical capability within the reach 
of individual students. You should consider, in the light of your own circumstances, how best 
to take advantage of the available computing resources. You will surely find it enlightening to 
do so. 

Another aspect of computer use that is very relevant to the study of differential equations is 
the availability of extremely powerful and general software packages that can perform a wide 
variety of mathematical operations. Among these are Maple, Mathematica, and MATLAB, 
each of which can be used on various kinds of personal computers or workstations. All 
three of these packages can execute extensive numerical computations and have versatile 
graphical facilities. Maple and Mathematica also have very extensive analytical capabilities. 
For example, they can perform the analytical steps involved in solving many differential 
equations, often in response to a single command. Anyone who expects to deal with differential 
equations in more than a superficial way should become familiar with at least one of these 
products and explore the ways in which it can be used. 

For you, the student, these computing resources have an effect on how you should study 
differential equations. To become confident in using differential equations, it is essential to 
understand how the solution methods work, and this understanding is achieved, in part, by 
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working out a sufficient number of examples in detail. However, eventually you should plan 
to utilize appropriate computational tools to complete as many as possible of the routine 
(often repetitive) details, while you focus on the proper formulation of the problem and on 
the interpretation of the solution. Our viewpoint is that you should always try to use the 
best methods and tools available for each task. In particular, you should strive to combine 
numerical, graphical, and analytical methods so as to attain maximum understanding of the 
behavior of the solution and of the underlying process that the problem models. You should 
also remember that some tasks can best be done with pencil and paper, while others require 
the use of some sort of computational technology. Good judgment is often needed in selecting 
an effective combination. 


Historical Background, Part Ill: Recent and Ongoing Advances. The numerous 
differential equations that resisted solution by analytical means led to the investigation of 
methods of numerical approximation (see Chapter 8). By 1900 fairly effective numerical 
integration methods had been devised, but their implementation was severely restricted by the 
need to execute the computations by hand or with very primitive computing equipment. Since 
World War II the development of increasingly powerful and versatile computers has vastly 
enlarged the range of problems that can be investigated effectively by numerical methods. 
Extremely refined and robust numerical integrators were developed during the same period and 
now are readily available, even on smartphones and other mobile devices. These technological 
advances have brought the ability to solve a great many significant problems within the reach 
of individual students. 

Another characteristic of modern differential equations is the creation of geometric or 
topological methods, especially for nonlinear equations. The goal is to understand at least the 
qualitative behavior of solutions from a geometrical, as well as from an analytical, point of 
view. If more detailed information is needed, it can usually be obtained by using numerical 
approximations. An introduction to geometric methods appears in Chapter 9. We conclude 
this brief historical review with two examples that illustrate how computational and real-world 
experiences have motivated important analytical and theoretical discoveries. 

In 1834 John Scott Russell (1808-1882), a Scottish civil engineer, was conducting 
experiments to determine the most efficient design for canal boats when he noticed that 
“when the boat suddenly stopped” the water being pushed by the boat “accumulated round 
the prow of the vessel in a state of violent agitation, then suddenly leaving it [the boat] 
behind, [the water] rolled forward with great velocity, assuming the form of a large solitary 
elevation, a rounded, smooth and well-defined heap of water.”® Many mathematicians did 
not believe that the solitary traveling waves reported by Russell existed. These objections 
were silenced when the doctoral dissertation of Dutch mathematician Gustav de Vries 
(1866-1934) included a nonlinear partial differential equation model for water waves in a 
shallow canal. Today these equations are known as the Korteweg-de Vries (KdV) equations. 
Diederik Johannes Korteweg (1848-1941) was de Vries's thesis advisor. Unknown to either 
Korteweg or de Vries, their Korteweg-de Vries model appeared as a footnote ten years 
earlier in French mathematician Joseph Valentin Boussinesq's (1842-1929) 680-page treatise 
Essai sur la théorie des eaux courantes. The work of Boussinesq and of Korteweg and 
deVries remained largely unnoticed until two Americans, physicist Norman J. Zabusky 
(1929—) and mathematician Martin David Kruskal (1925—2006), used computer simulations 
to discover, in 1965, that all solutions of the KdV equations eventually consist of a finite 
set of localized traveling waves. Today, nearly 200 years after Russell's observations and 50 
years after the computational experiments of Zabusky and Kruskal, the study of “solitons” 
remains an active area of differential equations research. Other early contributors to nonlinear 
wave propagation include David Hilbert (German, 1862-1943), Richard Courant (German- 
American, 1888-1972), and John von Neumann (Hungarian-American, 1903-1957); we will 
encounter some of these ideas again in Chapter 9. 

Computational results were also an essential element in the discovery of "chaos theory." 
In 1961, Edward Lorenz (1917—2008), an American mathematician and meteorologist at the 
Massachusetts Institute of Technology, was developing weather prediction models when he 
Observed different results upon restarting a simulation in the middle of the time period using 


9*Report on Waves,” in Proceedings of the Fourteenth Meeting of the British Association for the Advancement of 
Science, 1845, pp. 311—390, plus plates 47-57, http://www.macs.hw.ac.uk/--chris/Scott-Russell/SR44.pdf. 
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CHAPTER 1 Introduction 


Problems 


In each of Problems | through 4, determine the order of the given 
differential equation; also state whether the equation is linear or 


nonlinear. 


1. 


d'y | dy 
2 

t i 
dt? * dt 


2 
(ee) 4.49. y gt 


dt? dt 
d'y dy | d?y 


+2y =sint 


previously computed results. (Lorenz restarted the computation with three-digit approximate 
solutions, not the six-digit approximations that were stored in the computer.) In 1976 the 
Australian mathematician Sir Robert M. May (1938-) introduced and analyzed the logistic 
map, showing that there are special values of the problem's parameter where the solutions 
undergo drastic changes. The common trait that small changes in the problem produce large 
changes in the solution is one of the defining characteristics of chaos. May's logistic map is 
discussed in more detail in Section 2.9. Other classical examples of what we now recognize as 
"chaos" include the work by French mathematician Henri Poincaré (1854-1912) on planetary 
motion and the studies of turbulent fluid flow by Soviet mathematician Andrey Nikolaevich 
Kolmogorov (1903-1987), American mathematician Mitchell Feigenbaum (1944—), and 
many others. In addition to these and other classical examples of chaos, new examples continue 
to be found. 

Solitons and chaos are just two of many examples where computers, and especially 
computer graphics, have given a new impetus to the study of systems of nonlinear differential 
equations. Other unexpected phenomena (Section 9.8), such as strange attractors (David 
Ruelle, Belgium, 1935—) and fractals (Benoit Mandelbrot, Poland, 1924—2010), have been 
discovered, are being intensively studied, and are leading to important new insights in a 
variety of applications. Although it is an old subject about which much is known, the study 
of differential equations in the twenty-first century remains a fertile source of fascinating and 
important unsolved problems. 


In each of Problems 16 through 18, determine the order of the given 
partial differential equation; also state whether the equation is linear 
or nonlinear. Partial derivatives are denoted by subscripts. 

16. uj +uyy + uz =0 

17. uc 2Uxxyy + Uyyyy = 0 

18. u, +uus = 1+ uxx 

In each of Problems 19 through 21, verify that each given function is 
a solution of the given partial differential equation. 


+-+ 
dtt — dt? dt? 
2 


d^y 
4. —— +sin(t + y) —sint 
dt? ( ») 


In each of Problems 5 through 10, verify that each given function is a 


solution of the differential equation. 
5. y"-y=0; y(t) =e", y(t) = cosht 


y” +2y' —3y = 0; 3t 


6 xi) =e", y(t) =e 

7. y=3t+r? 

8. y" 4 4y"43y=t;, y(t) = 1/3; y(t) =e +1/3 
9 


Py" E 5ty Ay 20, t>0; y(t) =t, w(t) 20?Dnt 


ty’ y=’; 


t 
10. y'-2ty =1; y-e f Paste 
0 


In each of Problems 11 through 13, determine the values of r for which 
the given differential equation has solutions of the form y = e”. 


11. y'-2y20 

12. y’+y'-6y =0 

13. y" —3y"” +2y'’=0 

In each of Problems 14 and 15, determine the values of r for which the 
given differential equation has solutions of the form y = ¢’ fort > 0. 


14. ty’ +4ty’+2y 20 
15. ty” —4ty’+4y 20 


+y=!1 19. uxx +uyy =0; u(x, y) = cosx cosh y, 
u(x, y) = ln(x? + y?) 
20. o?u, =u; u(x,t) = e e sin x, 


212, . 
u(x, t) = e-* sinAx, A a real constant 


21. auxx =un; u(x,t) = sin(Ax) sin(Aat), 
u(x,t) = sin(x — at), A areal constant 


22. Follow the steps indicated here to derive the equation of motion 
of a pendulum, equation (12) in the text. Assume that the rod is rigid 
and weightless, that the mass is a point mass, and that there is no 
friction or drag anywhere in the system. 
a. Assume that the mass is in an arbitrary displaced position, 
indicated by the angle 0. Draw a free-body diagram showing the 
forces acting on the mass. 
b. Apply Newton’s law of motion in the direction tangential to 
the circular arc on which the mass moves. Then the tensile force 
in the rod does not enter the equation. Observe that you need to 
find the component of the gravitational force in the tangential 
direction. Observe also that the linear acceleration, as opposed to 
the angular acceleration, is Ld?0 /dt?, where L is the length of 
the rod. 
c. Simplify the result from part b to obtain equation (12) in the 
text. 


23. Another way to derive the pendulum equation (12) is based on 
the principle of conservation of energy. 
a. Show that the kinetic energy T of the pendulum in motion is 


1 _,/do\? 
T= —mL* | —]. 
2 dt 


b. Show that the potential energy V of the pendulum, relative to 
its rest position, is 


V = mgL(1 — cos0). 


C. By the principle of conservation of energy, the total energy 
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E = T + V is constant. Calculate d E / dt, set it equal to zero, 
and show that the resulting equation reduces to equation (12). 


24. A third derivation of the pendulum equation depends on the 
principle of angular momentum: The rate of change of angular 
momentum about any point is equal to the net external moment about 
the same point. 
a. Show that the angular momentum M, or moment of 
momentum, about the point of support is given by M — 
mL?d0 | dt. 
b. Set 4M /dt equal to the moment of the gravitational force, 
and show that the resulting equation reduces to equation (12). 
Note that positive moments are counterclockwise. 


A useful historical appendix on the early development of 
differential equations appears in 


Ince, E. L., Ordinary Differential Equations (London: Longmans, 
Green, 1927; New York: Dover, 1956). 

Encyclopedic sources of information about the lives and 

achievements of mathematicians of the past are 

Gillespie, C. C., ed., Dictionary of Scientific Biography (15 vols.) 
(New York: Scribner's, 1971). 

Koertge, N., ed., New Dictionary of Scientific Biography (8 vols.) 
(New York: Scribner's, 2007). 
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(New York: Scribner's, 2007 [e-book]). 
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created by O’Connor, J. J., and Robertson, E. F., Department 
of Mathematics and Statistics, University of St. Andrews, 
Scotland. 
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This chapter deals with differential equations of first order 

dy 

di = f(t, y), (1) 
where f is a given function of two variables. Any differentiable function y = (ft) that 
satisfies this equation for all t in some interval is called a solution, and our objective is 
to determine whether such functions exist and, if so, to develop methods for finding them. 
Unfortunately, for an arbitrary function f, there is no general method for solving the equation 
in terms of elementary functions. Instead, we will describe several methods, each of which is 
applicable to a certain subclass of first-order equations. 

The most important of these are linear equations (Section 2.1), separable equations 
(Section 2.2), and exact equations (Section 2.6). Other sections of this chapter describe 
some of the important applications of first-order differential equations, introduce the idea of 
approximating a solution by numerical computation, and discuss some theoretical questions 
related to the existence and uniqueness of solutions. The final section includes an example of 
chaotic solutions in the context of first-order difference equations, which have some important 
points of similarity with differential equations and are simpler to investigate. 


21 Linear Differential Equations; Method 
of Integrating Factors 


If the function f in equation (1) depends linearly on the dependent variable y, then equation (1) 
is a first-order linear differential equation. In Sections 1.1 and 1.2 we discussed a restricted 
type of first-order linear differential equation in which the coefficients are constants. A typical 
example is 

dy b 2 

de ay + D, (2) 
where a and b are given constants. Recall that an equation of this form describes the motion 
of an object falling in the atmosphere. 

Now we want to consider the most general first-order linear differential equation, which 
is obtained by replacing the coefficients a and b in equation (2) by arbitrary functions of t. We 
will usually write the general first-order linear differential equation in the standard form 

dy 

dt 
where p and g are given functions of the independent variable t. Sometimes it is more 
convenient to write the equation in the form 


+ p(t)y = g(t), (3) 


d 
Pa) + Q(Dy = Gt), (4) 


where P, Q, and G are given. Of course, as long as P(t) Æ 0, you can convert equation (4) 
to equation (3) by dividing both sides of equation (4) by P(t). 

In some cases it is possible to solve a first-order linear differential equation immediately 
by integrating the equation, as in the next example. 
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EXAMPLE 1 


Solve the differential equation 
2, dy 
(4 P) qr + 2ty = 4t. (5) 


Solution: 


The left-hand side of equation (5) is a linear combination of dy/dt and y, a combination that also 
appears in the rule from calculus for differentiating a product. In fact, 


dy d 
(fautes +2ty = ze EP 


it follows that equation (5) can be rewritten as 


d 2 

zi bd )y) — 4t. (6) 
Thus, even though y is unknown, we can integrate both sides of equation (6) with respect to t, thereby 
obtaining 

(4 0)y 228 +c, (7) 
where c is an arbitrary constant of integration. Solving for y, we find that 

21? " [s (8) 
TUR dap 


This is the general solution of equation (5). 


Unfortunately, most first-order linear differential equations cannot be solved as illustrated 
in Example 1 because their left-hand sides are not the derivative of the product of y and some 
other function. However, Leibniz discovered that if the differential equation is multiplied by 
a certain function u(t), then the equation is converted into one that is immediately integrable 
by using the product rule for derivatives, just as in Example 1. The function u(t) is called an 
integrating factor and our main task in this section is to determine how to find it for a given 
equation. We will show how this method works first for an example and then for the general 
first-order linear differential equation in the standard form (3). 


EXAMPLE 2 


Find the general solution of the differential equation 


Le). (9) 


Draw some representative integral curves; that is, plot solutions corresponding to several values of 
the arbitrary constant c. Also find the particular solution whose graph contains the point (0, 1). 


Solution: 


The first step is to multiply equation (9) by a function u(t), as yet undetermined; thus 
dy 1 1 1/3 
t) — -u(t)y = -u(t . 10 
LC eT ub )y z” )e (10) 


The question now is whether we can choose u(t) so that the left-hand side of equation (10) is the 
derivative of the product u(t) y. For any differentiable function u(t) we have 


du(t) 
dt 


Thus the left-hand side of equation (10) and the right-hand side of equation (11) are identical, 
provided that we choose u(t) to satisfy 
du(t) 1 


ET ze). (12) 


d B dy 
g eO = OS T (11) 
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Our search for an integrating factor will be successful if we can find a solution of equation (12). 
Perhaps you can readily identify a function that satisfies equation (12): What well-known function 
from calculus has a derivative that is equal to one-half times the original function? More 
systematically, rewrite equation (12) as 


1 du(t) 1 
u( dt 2’ 
which is equivalent to 
2. aes: (13) 
dt 2 


Then it follows that 
1 
Inu) = 51 4 C. 


or 

p(t) = ce, (14) 

The function u(t) given by equation (14) is an integrating factor for equation (9). Since we do 

not need the most general integrating factor, we will choose c to be 1 in equation (14) and use 
p(t) = e’. 

Now we return to equation (9), multiply it by the integrating factor e’ /? and obtain 
dy 1 1 
t/20r L2 =e 51/6 15 
eda 7-3 eS 


By the choice we have made of the integrating factor, the left-hand side of equation (15) is the 


derivative of e'/? y, so that equation (15) becomes 


d 1 
wey) = ao (16) 
By integrating both sides of equation (16), we obtain 
3 
ety = se" e, (17) 


where c is an arbitrary constant. Finally, on solving equation (17) for y, we have the general solution 
of equation (9), namely, 


y- Se bce, (18) 


To find the solution passing through the point (0, 1), we set t = 0 and y = 1 in equation (18), 
obtaining 1 = 3/5 + c. Thus c = 2/5, and the desired solution is 
3 3L 2,2 
= re? + et. 19 
yes 5 (19) 
Figure 2.1.1 includes the graphs of equation (18) for several values of c with a direction field 
in the background. The solution satisfying y(0) = 1 is shown by the green curve. 
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| FIGURE 2.1.1 | Direction field and integral curves of y’ + ly = tet/3: the 


green curve passes through the point (0, 1). 
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Let us now extend the method of integrating factors to equations of the form 


dy 

dt 

where a is a given constant and g(t) is a given function. Proceeding as in Example 2, we find 
that the integrating factor u(t) must satisfy 
du 


SE AN 21 
g E (21) 


rather than equation (12). Thus the integrating factor is u(t) = e“. Multiplying equation (20) 
by u(t), we obtain 


Tay = g(t), (20) 


ao +ae“y =e" g(t), 
or 
d at at 
a y) =e" g(t). (22) 


By integrating both sides of equation (22), we find that 
e" y — pets dt +c, (23) 


where c is an arbitrary constant. For many simple functions g(t), we can evaluate the integral 
in equation (23) and express the solution y in terms of elementary functions, as in Example 2. 
However, for more complicated functions g(t), it is necessary to leave the solution in integral 
form. In this case 


t 
y= e [ e^ g(s) ds 4- ce ". (24) 
to 


Note that in equation (24) we have used s to denote the integration variable to distinguish it 
from the independent variable t, and we have chosen some convenient value f, as the lower 
limit of integration. (See Theorem 2.4.1.) The choice of tọ determines the specific value of the 
constant c but does not change the solution. For example, plugging t = tọ into the solution 
formula (24) shows that c = y(19) e^». 


EXAMPLE 3 


Find the general solution of the differential equation 


— -2y-4-t Q5) 


and plot the graphs of several solutions. Discuss the behavior of solutions as t > oo. 


Solution: 
Equation (25) is of the form (20) with a = —2; therefore, the integrating factor is u(t) = e`”. 
Multiplying the differential equation (25) by u(t), we obtain 
pay 2e y = 4e ?! m" te”, 
dt 

or 

a (e^ y) — 4e? — tet, (26) 

dt 


Then, by integrating both sides of this equation, we have 


Ioa. d 
$ 
26 t7 


ety = 2e a e? cc, 
where we have used integration by parts on the last term in equation (26). Thus the general solution 
of equation (25) is 

7 


1 2t 
prece qe tee". (27) 
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Figure 2.1.2 shows the direction field and graphs of the solution (27) for several values of c. The 
behavior of the solution for large values of t is determined by the term ce”. If c 7 0, then the solution 
grows exponentially large in magnitude, with the same sign as c itself. Thus the solutions diverge 
as t becomes large. The boundary between solutions that ultimately grow positively and those that 
ultimately grow negatively occurs when c = 0. If we substitute c = 0 into equation (27) and then set 


t = 0, we find that y = —7/4 is the separation point on the y-axis. Note that for this initial value, 
7 
the solution is y = — à + P it grows positively, but linearly rather than exponentially. 
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| FIGURE 2.1.2 | Direction field and integral curves of y' — 2y — 4 — t. 


Now we return to the general first-order linear differential equation (3) 
dy 
dt 

where p and g are given functions. To determine an appropriate integrating factor, we multiply 
equation (3) by an as yet undetermined function u(t), obtaining 


+ p(t)y = g(t), 


d 
WD + POROY = KDB): (28) 


Following the same line of development as in Example 2, we see that the left-hand side of 
equation (28) is the derivative of the product u(t) y, provided that u(t) satisfies the equation 


O — par). (29) 
If we assume temporarily that z(t) is positive, then we have 
| du(t) 
p(t) dt 


= p(t), 
and consequently 
In |u(t)| = no dt 4 k. 


By choosing the arbitrary constant k to be zero, we obtain the simplest possible function for 
H, namely, 


u(t) =exp f io dt. (30) 
Note that u(t) is positive for all t, as we assumed. Returning to equation (28), we have 

d 

g O = p(t)g(t). (31) 
Hence 


ny = f tos) did (32) 
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where c is an arbitrary constant. Sometimes the integral in equation (32) can be evaluated in 
terms of elementary functions. However, in general this is not possible, so the general solution 


of equation (3) is 
1 t 
y= I u(s)g(s)ds +c |. (33) 
u(t) 1p 


where again tọ is some convenient lower limit of integration. Observe that equation (33) 
involves two integrations, one to obtain u(t) from equation (30) and the other to determine y 
from equation (33). 


EXAMPLE 4 


Solve the initial value problem 


ty! +2y = At?, (34) 
x(1) = 2. (35) 


Solution: 


In order to determine p(t) and g(t) correctly, we must first rewrite equation (34) in the standard 
form (3). Thus we have 


2 
ym (36) 


so p(t) = 2/t and g(t) = 4t. To solve equation (36), we first compute the integrating factor u(t): 


2 
L(t) = e( f ar) = gn. 


On multiplying equation (36) by u(t) = t?, we obtain 
ty’ +2ty = (y)! = 4C, 
and therefore 
ty= fea =t +c, 
where c is an arbitrary constant. It follows that, for t > 0, 
ysüa > (37) 


is the general solution of equation (34). Integral curves of equation (34) for several values of c are 
shown in Figure 2.1.3. 
To satisfy initial condition (35), set t = 1 and y = 2 in equation (37): 2 = 1 +c, so c = 1; thus 


Em T 
y=t Toys t>0 (38) 


is the solution of the initial value problem (24), (25). This solution is shown by the green curve in 
Figure 2.1.3. Note that it becomes unbounded and is asymptotic to the positive y-axis ast — 0 
from the right. This is the effect of the infinite discontinuity in the coefficient p(t) at the origin. It 
is important to note that while the function y = t? + 1/1? fort < O is part of the general solution of 
equation (34), it is not part of the solution of this initial value problem. 

This is the first example in which the solution fails to exist for some values of t. Again, this 
is due to the infinite discontinuity in p(t) at t = 0, which restricts the solution to the interval 
0 « t « oo. 

Looking again at Figure 2.1.3, we see that some solutions (those for which c > 0) are asymptotic 
to the positive y-axis as t — 0 from the right, while other solutions (for which c « 0) are asymptotic 
to the negative y-axis. If we generalize the initial condition (35) to 


y(1) = yo, (39) 
then c = yo — 1 and the solution (38) becomes 


yo — 1 
E) 


y=ť + , t>0 (40) 


Note that when yọ = 1, so c = 0, the solution is y = t”, which remains bounded and differentiable 
even at t = 0. (This is the red curve in Figure 2.1.3.) 
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As in Example 3, this is another instance where there is a critical initial value, namely, yọ = 1, that 
separates solutions that behave in one way from others that behave quite differently. 


y 

3 

2 
(1,2) 

1 


-1 


| FIGURE 2.1.3 | Integral curves of the differential equation ty’ + 2y = 4t?; 
the green curve is the particular solution with y(1) = 2. The red curve is the 


particular solution with y(1) = 1. 


EXAMPLE 5 


Solve the initial value problem 


2y' + ty = 2, (41) 
y(0) = I. (42) 


Solution: 


To convert the differential equation (41) to the standard form (3), we must divide equation (41) by 
2, obtaining 


" t 
y t=. (43) 


Thus p(t) = t/2, and the integrating factor is u(t) = exp(t?/4). Then multiply equation (43) by 
L(t), so that 


t 
et ly! a se ^y = e^. (44) 


The left-hand side of equation (44) is the derivative of el ^y, so by integrating both sides of 
equation (44), we obtain 


ey = ar +c. (45) 


The integral on the right-hand side of equation (45) cannot be evaluated in terms of the usual 
elementary functions, so we leave the integral unevaluated. By choosing the lower limit of integration 
as the initial point t = 0, we can replace equation (45) by 


t 
et /4y = f eI qs + C, (46) 
0 
where c is an arbitrary constant. It then follows that the general solution y of equation (41) is given by 


t 
y= ee] e I4 ds + ce 1^. (47) 
0 


To determine the particular solution that satisfies the initial condition (42), set t = 0 and y = 1 in 
equation (47): 


0 
beu e 9 I^ ds + ce? 
0 


=0+c, 


soc -— Il. 
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The main purpose of this example is to illustrate that sometimes the solution must be left in terms 
of an integral. This is usually at most a slight inconvenience, rather than a serious obstacle. For a given 
value of t, the integral in equation (47) is a definite integral and can be approximated to any desired 
degree of accuracy by using readily available numerical integrators. By repeating this process for 
many values of t and plotting the results, you can obtain a graph of a solution. Alternatively, you can 
use a numerical approximation method, such as those discussed in Chapter 8, that proceed directly 
from the differential equation and need no expression for the solution. Software packages such as 
Maple, Mathematica, MATLAB and Sage readily execute such procedures and produce graphs of 
solutions of differential equations. 

Figure 2.1.4 displays graphs of the solution (47) for several values of c. The particular solution 
satisfying the initial condition y(0) — 1 is shown in black. From the figure it may be plausible to 


conjecture that all solutions approach a limit as t — oo. The limit can also be found analytically (see 
Problem 22). 


Integral curves of 2y’+ty = 2; the green curve is the particular 
solution satisfying the initial condition y(0) — 1. 


Problems 


In each of Problems 1 through 8: 9, y —y 22te?, y(0)=1 
O a. Drawa direction field for the given differential equation. 10. y +2y=te”, y(1) =0 
b. Based on an inspection of the direction field, describe how 2 cosi 
solutions behave for large t. 1l. y +-y= —^ wr) =0, t>0 
c. Find the general solution of the given differential equation, ; d n 
and use it to determine how solutions behave as t — oo. 12. y'+0+Dy=f, ydn2-1 (20 
Lh yoy see e? In each of Problems 13 and 14: 
Q a. Draw a direction field for the given differential equation. 
2. y -2y ze? How do solutions appear to behave as t becomes large? Does the 
3. y'+y=te'+1 behavior depend on the choice of the initial value a? Let ap be 
Q2] the value of a for which the transition from one type of behavior 
& oy P7 3cos(2t), t>0 to another occurs. Estimate the value of ag. 
5. y —2y «3e b. Solve the initial value problem and find the critical value ao 
; Ne exactly. 
vere gU C. Describe the behavior of the solution corresponding to the 
7. y +y = 5sin(2t) initial value a. 
/ = 2 
m taa : : T 13. y - ly = 2cos1, y(0) =a 
In each of Problems 9 through 12, find the solution of the given initial 2 
value problem. 14. 3y/'-2y-e7'?, y(0) =a 
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In each of Problems 15 and 16: 
Q a. Draw a direction field for the given differential equation. 
How do solutions appear to behave as t — 0? Does the behavior 
depend on the choice of the initial value a? Let a be the critical 
value of a, that is, the initial value such that the solutions for 
a < ag and the solutions for a > a have different behaviors as 
t — oo. Estimate the value of ap. 
b. Solve the initial value problem and find the critical value ao 
exactly. 
C. Describe the behavior of the solution corresponding to the 
initial value ag. 


15. ty'c- (t4 Dy =2te™, 
16. (sint)y' 4 (cost) y — e!, 


y(1) =a, t»0 
y(l)=a, 0«t«m 
© 17. Consider the initial value problem 


1 
y + = 2cost, y(0) = -l1. 


Find the coordinates of the first local maximum point of the solution 
fort > 0. 


@ 18. Consider the initial value problem 


4 : 1 ! t, y0) 

y 3" = z’ y = yo. 
Find the value of yọ for which the solution touches, but does not cross, 
the t-axis. 


19. Consider the initial value problem 


1 
y + -y =3 + 2cos(2t), 


7 y(0) =0. 


a. Find the solution of this initial value problem and describe its 
behavior for large t. 

@ b. Determine the value of ¢ for which the solution first 
intersects the line y = 12. 


20. Find the value of yọ for which the solution of the initial value 
problem 


y —y=1+3sint, y(0) = yo 
remains finite as t — oo. 
21. Consider the initial value problem 
y-iy230426, y(0) =y: 


2 


Find the value of yo that separates solutions that grow positively as 
t — oo from those that grow negatively. How does the solution that 
corresponds to this critical value of yo behave as t — oo? 


22. Show that all solutions of 2y' 4- ty — 2 [equation (41) of the 
text] approach a limit as t — oo, and find the limiting value. 
Hint: Consider the general solution, equation (47). Show that the first 


term in the solution (47) is indeterminate with form 0 - oo. Then, use 
l'Hópital's rule to compute the limit as t — oo. 


23. Show that if a and A are positive constants, and b is any real 
number, then every solution of the equation 
yl + ay = be 
has the property that y > 0 ast — oo. 
Hint: Consider the cases a = \ and a Z A separately. 


In each of Problems 24 through 27, construct a first-order linear 
differential equation whose solutions have the required behavior as 
t — oo. Then solve your equation and confirm that the solutions do 
indeed have the specified property. 


24. All solutions have the limit 3 as t > oo. 

25. All solutions are asymptotic to the line y = 3 — t as t > oo. 
26. All solutions are asymptotic to the line y = 2t — 5 as t > oo. 
27. All solutions approach the curve y = 4 — t? as t > oc. 


28. Variation of Parameters. Consider the following method of 
solving the general linear equation of first order: 


y p(t)y = g(t). (48) 


a. If g(t) = 0 for all t, show that the solution is 


y = Aexp (- fro ir), (49) 


where A is a constant. 
b. If g(t) is not everywhere zero, assume that the solution of 
equation (48) is of the form 


y = A(t) ew(- f v0 2! (50) 


where A is now a function of t. By substituting for y in the given 
differential equation, show that A(t) must satisfy the condition 


A'(t) = g(t) exp ( i p(t) a) (51) 


c. Find A(t) from equation (51). Then substitute for A(t) in 
equation (50) and determine y. Verify that the solution obtained 
in this manner agrees with that of equation (33) in the text. This 
technique is known as the method of variation of parameters; 
it is discussed in detail in Section 3.6 in connection with second- 
order linear equations. 


In each of Problems 29 and 30, use the method of Problem 28 to solve 
the given differential equation. 


29. y —2y = Pe? 


1 
30. y’ + zy = cos(2t), t>0 


22 Separable Differential Equations 


In Section 1.2 we used a process of direct integration to solve first-order linear differential 
equations of the form 
dy 


— = b 1 
di ay + b, (1) 


where a and b are constants. We will now show that this process is actually applicable to a 
much larger class of nonlinear differential equations. 

We will use x, rather than ¢, to denote the independent variable in this section for two 
reasons. In the first place, different letters are frequently used for the variables in a differential 
equation, and you should not become too accustomed to using a single pair. In particular, x 
often occurs as the independent variable. Further, we want to reserve f for another purpose 
later in the section. 

The general first-order differential equation is 


dy _ 
a o (2) 


Linear differential equations were considered in the preceding section, but if equation (2) is 
nonlinear, then there is no universally applicable method for solving the equation. Here, we 
consider a subclass of first-order equations that can be solved by direct integration. 

To identify this class of equations, we first rewrite equation (2) in the form 


d 
M(x, y) + N(x, TT =0. 3) 


It is always possible to do this by setting M(x, y) = — f(x, y) and N(x, y) = 1, but there 
may be other ways as well. When M is a function of x only and N is a function of y only, then 
equation (3) becomes 


d 
M(x) +N) Z =0. (4) 
dx 
Such an equation is said to be separable, because if it is written in the differential form 
M(x) dx + N(y) dy =0, (5) 


then, if you wish, terms involving each variable may be placed on opposite sides of the 
equation. The differential form (5) is also more symmetric and tends to suppress the distinction 
between independent and dependent variables. 

A separable equation can be solved by integrating the functions M and N. We illustrate 
the process by an example and then discuss it in general for equation (4). 


EXAMPLE 1 


Show that the equation 


dy _ x? 


= 6 
dx 1-y? (9 
is separable, and then find an equation for its integral curves. 
Solution: 
If we write equation (6) as 
dy 
— cy) -0 (7) 
dx 


then it has the form (4) and is therefore separable. Recall from calculus that if y is a function of x, 
then by the chain rule, 


d _d dy _ py. D 
TIO” m TO 
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For example, if f(y) = y — y?/3, then 


d y 2, dy 
=(1 : 
dx (> 3 ) Vega 
Thus the second term in equation (7) is the derivative with respect to x of y — y? /3, and the first 
term is the derivative of —x? /3. Thus equation (7) can be written as 


d x? d y? 
=(), 
xí a (> 3 


ae 2m 
dii s «41-7 


Therefore, by integrating (and multiplying the result by 3), we obtain 


or 


=x? +3y — y? =c, (8) 


where c is an arbitrary constant. 

Equation (8) is an equation for the integral curves of equation (6). A direction field and 
several integral curves are shown in Figure 2.2.1. Any differentiable function y = (x) that 
satisfies equation (8) is a solution of equation (6). An equation of the integral curve passing through 
a particular point (xo, yo) can be found by substituting xo and yo for x and y, respectively, in 
equation (8) and determining the corresponding value of c. 


yA 
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| FIGURE 2.2.1 | Direction field and integral curves of y' 2 x?/(1 — y?). 


Essentially the same procedure can be followed for any separable equation. Returning to 
equation (4), let Hı and H, be any antiderivatives of M and N, respectively. Thus 


Hix) = M(x), Hy) = N(Y), 9) 


and equation (4) becomes 
dy 
H, H; — ac. 10 
(x) + Hy) m (10) 


If y is regarded as a function of x, then according to the chain rule, 


Hy? e mo) 2 = f np) a) 
2SYUdx dy TUM da dk OA 
Consequently, we can write equation (10) as 
d 
—(Hi(x) + Hy(y)) = 0. (12) 


dx 


By integrating equation (12) with respect to x, we obtain 
A(x) + HXCy) =c, (13) 


where c is an arbitrary constant. Any differentiable function y = @(x) that satisfies 
equation (13) is a solution of equation (4); in other words, equation (13) defines the solution 
implicitly rather than explicitly. In practice, equation (13) is usually obtained from equation (5) 
by integrating the first term with respect to x and the second term with respect to y. The 
justification for this is the argument that we have just given. 

The differential equation (4), together with an initial condition 


y(xo) = yo. (14) 


forms an initial value problem. To solve this initial value problem, we must determine the 
appropriate value for the constant c in equation (13). We do this by setting x — xo and y — yo 
in equation (13) with the result that 


c = Hi(xo) + Hoyo). (15) 


Substituting this value of c in equation (13) and noting that 
x y 
Ho) - Ho) = [ Mods Hio) = rho) = f NGYs. 
Xo Yo 


we obtain 
X y 
J Moas + | N(s)ds = 0. (16) 
Xo Yo 


Equation (16) is an implicit representation of the solution of the differential equation (4) that 
also satisfies the initial condition (14). Bear in mind that to determine an explicit formula for 
the solution, you need to solve equation (16) for y as a function of x. Unfortunately, it is often 
impossible to do this analytically; in such cases you can resort to numerical methods to find 
approximate values of y for given values of x. 


EXAMPLE 2 


Solve the initial value problem 


dy _ 3x? E Ax +2 
dx 2%y-l) ' 


y(0) = —1, (17) 
and determine the interval in which the solution exists. 


Solution: 
The differential equation can be written as 
2(y — Ddy = (3x? + 4x + 2)dx. 
Integrating the left-hand side with respect to y and the right-hand side with respect to x gives 
y? —2y = x? + 2x? 42x +e, (18) 


where c is an arbitrary constant. To determine the solution satisfying the prescribed initial condition, 
we substitute x = 0 and y = —1 in equation (18), obtaining c = 3. Hence the solution of the initial 
value problem is given implicitly by 


y? — 2y = x? 4 2x7 42x 43. (19) 


To obtain the solution explicitly, we must solve equation (19) for y in terms of x. That is a simple 
matter in this case, since equation (19) is quadratic in y, and we obtain 


y= lt Vx34+ 2x24 2x +4. (20) 


Equation (20) gives two solutions of the differential equation, only one of which, however, satisfies 
the given initial condition. This is the solution corresponding to the minus sign in equation (20), so 


we finally obtain 
y= (x) =1— yx? +2x?+2x +4 (21) 
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as the solution of the initial value problem (15). Note that if we choose the plus sign by mistake in 
equation (20), then we obtain the solution of the same differential equation that satisfies the initial 
condition y(0) = 3. Finally, to determine the interval in which the solution (21) is valid, we must find 
the interval in which the quantity under the radical is positive. The only real zero of this expression 
is x = —2, so the desired interval is x > —2. Some integral curves of the differential equation are 
shown in Figure 2.2.2. The green curve passes through the point (0, —1) and thus is the solution of 
the initial value problem (15). Observe that the boundary of the interval of validity of the solution 
(21) is determined by the point (—2, 1) at which the tangent line is vertical. 
s4 


Integral curves of y' = (3x? + 4x + 2) /2( y — 1); the solution 
satisfying y(0) = — 1 is shown in green and is valid for x > — 2. 


EXAMPLE 3 


Solve the separable differential equation 


dy _4x-x° 


dx 4+y 


(22) 


and draw graphs of several integral curves. Also find the solution passing through the point (0, 1) 
and determine its interval of validity. 


Solution: 
Rewriting equation (22) as 
(4+ dy = (4x — x)dx, 
integrating each side, multiplying by 4, and rearranging the terms, we obtain 
yí + 16y + xt — 8x? = 6; Q3) 


where c is an arbitrary constant. Any differentiable function y = o (x) that satisfies equation (23) is 
a solution of the differential equation (22). Graphs of equation (23) for several values of c are shown 
in Figure 2.2.3. 

To find the particular solution passing through (0, 1), we set x = 0 and y = 1 in equation (23) 
with the result that c — 17. Thus the solution in question is given implicitly by 


yt + 16y - x* — 8x? = 17. (24) 


It is shown by the green curve in Figure 2.2.3. The interval of validity of this solution extends on 
either side of the initial point as long as the function remains differentiable. From the figure we see 
that the interval ends when we reach points where the tangent line is vertical. It follows from the 
differential equation (22) that these are points where 4+ y? = 0, or y = (—4) /? = —1.5874. From 
equation (24) the corresponding values of x are x = +3.3488. These points are marked on the graph 
in Figure 2.2.3. 


Integral curves of y’ = (4x — x?) /(44- y?). The solution passing 
through (0, 1) is shown by the green curve. 


Note 1: Sometimes a differential equation of the form (2): 


d 


T= fy) 


has a constant solution y = yo. Such a solution is usually easy to find because if f(x, yo) = 0 
for some value yo and for all x, then the constant function y = yo isa solution of the differential 
equation (2). For example, the equation 
d — 3) cos 
y (y-3cosx (25) 
dx 1 +2y? 
has the constant solution y = 3. Other solutions of this equation can be found by separating 
the variables and integrating. 
Note 2: The investigation of a first-order nonlinear differential equation can sometimes 
be facilitated by regarding both x and y as functions of a third variable t. Then 


dy dy/dt 

= : 26 

dx — dx/dt m 
If the differential equation is 

d F 

or Pos y) » (27) 

dx G(x, y) 


then, by comparing numerators and denominators in equations (26) and (27), we obtain the 
system 


dx dy 
eG yh = 
a e er 


At first sight it may seem unlikely that a problem will be simplified by replacing a single 
equation by a pair of equations, but in fact, the system (28) may well be more amenable to 
investigation than the single equation (27). Chapter 9 is devoted to nonlinear systems of the 
form (28). 

Note 3: In Example 2 it was not difficult to solve explicitly for y as a function of x. 
However, this situation is exceptional, and often it will be better to leave the solution in 
implicit form, as in Examples 1 and 3. Thus, in the problems below and in other sections 
where nonlinear equations appear, the words “solve the following differential equation" mean 
to find the solution explicitly if it is convenient to do so, but otherwise to find an equation 
defining the solution implicitly. 


= F(x, y). (28) 
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Problems 


In each of Problems 1 through 8, solve the given differential equation. 


Ly = E 
y 

2. y y?sinx 20 
3. y’ =cos*(x) cos(2y) 
4. xy 2(1— y)? 
2 m dei 

dx ycte 
g CL 

dx 1+ y? 

d 
7 9.» 

dx x 

d EN 
2 

dx y 


In each of Problems 9 through 16: 

a. Find the solution of the given initial value problem in explicit 

form. 

@ b. Plot the graph of the solution. 

c. Determine (at least approximately) the interval in which the 

solution is defined. 

9. y =(1-2x)y’, 

10. y =(1—2x)/y, 
1l. xdx+yedy =0, 
12. dr/d0 = r?/9, 


y(0) = —1/6 
y(1) = -2 

y(0) =1 
r(1) =2 


13. y =x (+x, y(0) =1 

14. y’=2x/(14+2y), y(2) =0 

15. y'= (3x? —e*)/(2y—5), y(0)=1 

16. sin(2x) dx - cos(GBy) dy 20,  y(m/2) = 1/3 


Some of the results requested in Problems 17 through 22 can be 
obtained either by solving the given equations analytically or by 
plotting numerically generated approximations to the solutions. Try 
to form an opinion about the advantages and disadvantages of each 
approach. 


O 17. 


Solve the initial value problem 


,_ 1+3x? 


=——.,_ y=! 
55-4) »(0) 


y 
and determine the interval in which the solution is valid. 
Hint: To find the interval of definition, look for points where the 
integral curve has a vertical tangent. 


Q 18. Solve the initial value problem 


2 
j 3x 


=, y(1)=0 
3y2—4 y1) 


y 
and determine the interval in which the solution is valid. 
Hint: To find the interval of definition, look for points where the 
integral curve has a vertical tangent. 


© 19. Solve the initial value problem 


y 22y «xy. y0)-1 

and determine where the solution attains its minimum value. 
Q 20. Solve the initial value problem 

_ 24=e 


~ 342y’ 


f 


y y(0) =0 


and determine where the solution attains its maximum value. 


© 21. Consider the initial value problem 


yas 

3 ; 

a. Determine how the behavior of the solution as f increases 
depends on the initial value yo. 


b. Suppose that yo = 0.5. Find the time T at which the solution 
first reaches the value 3.98. 


y(0) = yo. 


Q 22. Consider the initial value problem 


y= DM. y0) = yo > 0. 
a. Determine how the solution behaves as t — oo. 
b. If yo = 2, find the time T at which the solution first reaches 
the value 3.99. 
c. Find the range of initial values for which the solution lies in 
the interval 3.99 < y < 4.01 by the time t = 2. 


23. Solve the equation 


dy _ay+b 
dx cy+d’ 


where a, b, c, and d are constants. 


24. Use separation of variables to solve the differential equation 


a r(a+bQ), 


2 Q(0) — Qo. 


where a, b, r, and Qo are constants. Determine how the solution 
behaves as t — oo 


Homogeneous Equations. If the right-hand side of the equation 
dy/dx = f(x, y) can be expressed as a function of the ratio y/x 
only, then the equation is said to be homogeneous.! Such equations 
can always be transformed into separable equations by a change of 
the dependent variable. Problem 25 illustrates how to solve first-order 
homogeneous equations. 


'The word “homogeneous” has different meanings in different mathematical 
contexts. The homogeneous equations considered here have nothing to do with 
the homogeneous equations that will occur in Chapter 3 and elsewhere. 
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@ 25. Consider the equation through the origin, although the slope changes from one line to 
d =" another. Therefore, the direction field and the integral curves are 
d nce (29) symmetric with respect to the origin. Is this symmetry property 


dx Xy evident from your plot? 


a. Show that equation (29) can be rewritten as The method outlined in Problem 25 can be used for any 
homogeneous equation. That is, the substitution y = xv(x) transforms 
dy  (y/x) —4. dn 
206 1- 07x) ; (30) a homogeneous equation into a separable equation. The latter equation 
can be solved by direct integration, and then replacing v by y/x 
thus equation (29) is homogeneous. gives the solution to the original equation. In each of Problems 26 
b. Introduce a new dependent variable v so that v = y/x,or through 31: 
y = xv(x). Express dy/dx in terms of x, v, and dv / dx. a. Show that the given equation is homogeneous. 


C. Replace y and dy/dx in equation (30) by the expressions 
from part b that involve v and dv/dx. Show that the resulting 
differential equation is 


b. Solve the differential equation. 
Q c. Draw a direction field and some integral curves. Are they 
symmetric with respect to the origin? 


d —4 2 2 
jeg : 26. dy x +xy +y 
dx | l1—v dx x 
2 2 
or gs du 4 e 
dx 2xy 
d ? —4 
i n ' dx 2x— y 
Observe that equation (31) is separable. dy 4x +3y 
. s PME oso 
d. Solve equation (31), obtaining v implicitly in terms of x. dx 2x +y 
e. Find the solution of equation (29) by replacing v by y/x in d 2.8 
the solution in part d. 30. EX uo TE 


f. Draw a direction field and some integral curves for dx 2xy 
2 


equation (29). Recall that the right-hand side of equation (29) 31 dy 3y? =x 
actually depends only on the ratio y/x. This means that integral ` dx 2xy 
curves have the same slope at all points on any given straight line 


23. Modeling with First-Order Differential 
Equations 


Differential equations are of interest to nonmathematicians primarily because of the possibility 
of using them to investigate a wide variety of problems in the physical, biological, and social 
sciences. One reason for this is that mathematical models and their solutions lead to equations 
relating the variables and parameters in the problem. These equations often enable you to make 
predictions about how the natural process will behave in various circumstances. It is often easy 
to vary parameters in the mathematical model over wide ranges, whereas this may be very 
time-consuming or expensive, if not impossible, in an experimental setting. Nevertheless, 
mathematical modeling and experiment or observation are both critically important and 
have somewhat complementary roles in scientific investigations. Mathematical models are 
validated by comparison of their predictions with experimental results. On the other hand, 
mathematical analyses may suggest the most promising directions to explore experimentally, 
and they may indicate fairly precisely what experimental data will be most helpful. 

In Sections 1.1 and 1.2 we formulated and investigated a few simple mathematical models. 
We begin by recapitulating and expanding on some of the conclusions reached in those 
sections. Regardless of the specific field of application, there are three identifiable steps that 
are always present in the process of mathematical modeling. 


Step 1: Construction of the Model. In this step the physical situation is translated into 
mathematical terms, often using the steps listed at the end of Section 1.1. Perhaps most critical 
at this stage is to state clearly the physical principle(s) that are believed to govern the process. 
For example, it has been observed that in some circumstances heat passes from a warmer to 
a cooler body at a rate proportional to the temperature difference, that objects move about 
in accordance with Newton's laws of motion, and that isolated insect populations grow at 
a rate proportional to the current population. Each of these statements involves a rate of 
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change (derivative) and consequently, when expressed mathematically, leads to a differential 
equation. The differential equation is a mathematical model of the process. 

It is important to realize that the mathematical equations are almost always only 
an approximate description of the actual process. For example, bodies moving at speeds 
comparable to the speed of light are not governed by Newton’s laws, insect populations do 
not grow indefinitely as stated because of eventual lack of food or space, and heat transfer is 
affected by factors other than the temperature difference. Thus you should always be aware of 
the limitations of the model so that you will use it only when it is reasonable to believe that 
it is accurate. Alternatively, you can adopt the point of view that the mathematical equations 
exactly describe the operation of a simplified physical model, which has been constructed (or 
conceived of) so as to embody the most important features of the actual process. Sometimes, 
the process of mathematical modeling involves the conceptual replacement of a discrete 
process by a continuous one. For instance, the number of members in an insect population 
changes by discrete amounts; however, if the population is large, it seems reasonable to 
consider it as a continuous variable and even to speak of its derivative. 


Step 2: Analysis of the Model. Once the problem has been formulated mathematically, 
you are often faced with the problem of solving one or more differential equations or, failing 
that, of finding out as much as possible about the properties of the solution. It may happen 
that this mathematical problem is quite difficult, and if so, further approximations may be 
indicated at this stage to make the problem mathematically tractable. For example, a nonlinear 
equation may be approximated by a linear one, or a slowly varying coefficient may be replaced 
by a constant. Naturally, any such approximations must also be examined from the physical 
point of view to make sure that the simplified mathematical problem still reflects the essential 
features of the physical process under investigation. At the same time, an intimate knowledge 
of the physics of the problem may suggest reasonable mathematical approximations that will 
make the mathematical problem more amenable to analysis. This interplay of understanding 
of physical phenomena and knowledge of mathematical techniques and their limitations 
is characteristic of applied mathematics at its best, and it is indispensable in successfully 
constructing useful mathematical models of intricate physical processes. 


Step 3: Comparison with Experiment or Observation. Finally, having obtained the 
solution (or at least some information about it), you must interpret this information in 
the context in which the problem arose. In particular, you should always check that the 
mathematical solution appears physically reasonable. If possible, calculate the values of the 
solution at selected points and compare them with experimentally observed values. Or ask 
whether the behavior of the solution after a long time is consistent with observations. Or 
examine the solutions corresponding to certain special values of parameters in the problem. Of 
course, the fact that the mathematical solution appears to be reasonable does not guarantee that 
it is correct. However, if the predictions of the mathematical model are seriously inconsistent 
with observations of the physical system it purports to describe, this suggests that errors have 
been made in solving the mathematical problem, that the mathematical model itself needs 
refinement, or that observations must be made with greater care. 

The examples in this section are typical of applications in which first-order differential 
equations arise. 


EXAMPLE1 | Mixing 


At time ¢ = 0 a tank contains Qy lb of salt dissolved in 100 gal of water; see Figure 2.3.1. Assume 


1 
that water containing d Ib of salt per gallon is entering the tank at a rate of r gal/min and that the 


well-stirred mixture is draining from the tank at the same rate. Set up the initial value problem that 
describes this flow process. Find the amount of salt Q(t) in the tank at any time, and also find the 
limiting amount Qz that is present after a very long time. If r = 3 and Qo = 2Q,, find the time T 
after which the salt level is within 2% of Qz. Also find the flow rate that is required if the value of 
T is not to exceed 45 min. 


2.3 Modeling with First-Order Differential Equations 


Solution: 


r gal/min, 5 Ib/gal 


r gal/min 


| FIGURE 2.3.1 | The water tank in Example 1. 


We assume that salt is neither created nor destroyed in the tank. Therefore, variations in the 
amount of salt are due solely to the flows in and out of the tank. More precisely, the rate of change 
of salt in the tank, d Q / dt, is equal to the rate at which salt is flowing in minus the rate at which it is 
flowing out. In symbols, 


—— = rate in — rate out. (1) 
dt 


The rate at which salt enters the tank is the concentration 1 Ib/gal times the flow rate r gal/min, or 
r /4 lb/min. To find the rate at which salt leaves the tank, we need to multiply the concentration of 
salt in the tank by the rate of outflow, r gal/min. Since the rates of flow in and out are equal, the 
volume of water in the tank remains constant at 100 gal, and since the mixture is “well-stirred,” the 
concentration throughout the tank is the same, namely, Q(t) / 100 Ib/gal. Therefore, the rate at which 
salt leaves the tank is r Q(t) / 100 Ib/min. Thus the differential equation governing this process is 


dQ FÉ rQ 
us ^d 00 (2) 
t 4 100 
The initial condition is 
Q(0) = Qo. (3) 


Upon thinking about the problem physically, we might anticipate that eventually the mixture 
originally in the tank will be essentially replaced by the mixture flowing in, whose concentration is 


P Ib/gal. Consequently, we might expect that ultimately the amount of salt in the tank would be very 


close to 25 Ib. We can also find the limiting amount Qr, = 25 by setting dQ/dt equal to zero in 
equation (2) and solving the resulting algebraic equation for Q. 

To solve the initial value problem (2), (3) analytically, note that equation (2) is linear. (It is 
also separable, see Problem 24 in Section 2.2.) Rewriting the differential equation (2) in the standard 
form for a linear differential equation, we have 

dQ rQ r 


= EL 4 
dr ^ 100-4 e 


Thus the integrating factor is e/'/ 100 


and the general solution is 
Qf) 525 +cat, (5) 


where c is an arbitrary constant. To satisfy the initial condition (3), we must choose c = Qo — 25. 
Therefore, the solution of the initial value problem (2), (3) is 


Q(t) =25 + (Qo — 25) e™™!/ 100, a 
or 
Q(t) =25(1 — e7"t/100) + Que "!/ 100, T 


From either form of the solution, (6) or (7), you can see that Q(t) — 25 (Ib) as tf — oo, so the 
limiting value Q; is 25, confirming our physical intuition. 
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Further, Q(t) approaches the limit more rapidly as r increases. In interpreting the solution (7), 
note that the second term on the right-hand side is the portion of the original salt that remains at time 
t, while the first term gives the amount of salt in the tank as a consequence of the flow processes. 
Plots of the solution for r = 3 and for several values of Qo are shown in Figure 2.3.2. 


20 40 60 80 


Solutions of the initial value problem (2): 
dQ/dt =r/4—rQ/100, Q(0) = Qo forr = 3 and several values of Qo. 


Now suppose that r = 3 and Qo = 2Q;, = 50; then equation (6) becomes 
Q(t) = 25 + 25e 999, (8) 


Since 2% of 25 is 0.5, we wish to find the time T at which Q(t) has the value 25.5. Substituting 
t = T and Q = 25.5 in equation (8) and solving for T, we obtain 


p = POO x 130.4 (min). (9) 


To determine r so that T = 45, return to equation (6), set t = 45, Qo = 50, Q(t) = 25.5, and 
solve for r. The result is 
100 y 
r = — |n 50 = 8.69 gal/min. (10) 
45 
Since this example is hypothetical, the validity of the model is not in question. If the flow rates 
are as stated, and if the concentration of salt in the tank is uniform, then the differential equation 
(1) is an accurate description of the flow process. Although this particular example has no special 
significance, models of this kind are often used in problems involving a pollutant in a lake, or a drug 
in an organ of the body, for example, rather than a tank of salt water. In such cases the flow rates 
may not be easy to determine or may vary with time. Similarly, the concentration may be far from 
uniform in some cases. Finally, the rates of inflow and outflow may be different, which means that 
the variation of the amount of liquid in the problem must also be taken into account. 


EXAMPLE2 | Compound Interest 


Suppose that a sum of money, Sọ, is deposited in a bank or money fund that pays interest at an 
annual rate r. The value S(t) of the investment at any time t depends on the frequency with which 
interest is compounded as well as on the interest rate. Financial institutions have various policies 
concerning compounding: some compound monthly, some weekly, and some even daily. Assume 
that compounding takes place continuously. Set up an initial value problem that describes the growth 
of the investment. 


2.3 Modeling with First-Order Differential Equations 


Solution: 

The rate of change of the value of the investment is d S/ dt, and this quantity is equal to the rate at 

which interest accrues, which is the interest rate r times the current value of the investment S(t). Thus 
dS 
T 

is the differential equation that governs the process. If we let t denote the time, in years, since the 

original deposit, the corresponding initial condition is 


S(O) = So. (12) 


rs (11) 


Then the solution of the initial value problem (8) gives the balance S(t) in the account at any time 
t. This initial value problem is readily solved, since the differential equation (11) is both linear and 
separable. Consequently, by solving equations (11) and (12), we find that 


S(t) = Soe”. (13) 


Thus a bank account with continuously compounding interest grows exponentially. 


The model in Example 2 is easily extended to situations involving deposits or withdrawals 
in addition to the accrual of interest, dividends, or annual capital gains. If we assume that the 
deposits or withdrawals take place at a constant rate k, then equation (11) is replaced by 


dS 
eee 
or, in standard form, 
dS 
r“ rS =k, (14) 


where k is positive for deposits and negative for withdrawals. 
Equation (14) is linear with the integrating factor e~"’, so its general solution is 


k 
S(t) = ce" — -, 
7 


where c is an arbitrary constant. To satisfy the initial condition (12), we must choose c = 
So + k/ r. Thus the solution of the initial value problem (10), (8) is 


k 
S(t) = Soe + ce —1). (15) 


The first term in expression (15) is the part of S(t) that is due to the return accumulated on 
the initial amount So, and the second term is the part that is due to the deposit or withdrawal 
rate k. 

The advantage of stating the problem in this general way without specific values for So, 
r, or k lies in the generality of the resulting formula (15) for S(t). With this formula we can 
readily compare the results of different investment programs or different rates of return. 

For instance, suppose that one opens an individual retirement account (IRA) at age 25 
and makes annual investments of $2000 thereafter in a continuous manner. Assuming a rate 
of return of 8%, what will be the balance in the IRA at age 65? We have Sọ = 0, r = 0.08, 
and k = $2000, and we wish to determine $(40). From equation (15) we have 


S(40) = 25,000(e?? — 1) = $588,313. (16) 


It is interesting to note that the total amount invested is $80,000, so the remaining amount of 
$508,313 results from the accumulated return on the investment. The balance after 40 years 
is also fairly sensitive to the assumed rate. For instance, $(40) = $508,948 if r = 0.075 and 
S(40) = $681,508 if r = 0.085. 

Let us now examine the assumptions that have gone into the model. First, we have 
assumed that the return is compounded continuously and that additional capital is invested 
continuously. Neither of these is true in an actual financial situation. We have also assumed 
that the return rate r is constant for the entire period involved, whereas in fact it is likely to 
fluctuate considerably. Although we cannot reliably predict future rates, we can use solution 
(15) to determine the approximate effect of different rate projections. It is also possible to 
consider r and k in equation (14) to be functions of t rather than constants; in that case, of 
course, the solution may be much more complicated than equation (15). 

The initial value problem (10), (8) and the solution (15) can also be used to analyze a 
number of other financial situations, including annuities, mortgages, and automobile loans. 
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Let us now compare the results from the model with continuously compounded interest 
(and no other deposits or withdrawals) with the corresponding situation in which compounding 
occurs at finite time intervals. If interest is compounded once a year, then after t years 


S(t) = S(1-- r)'. 


If interest is compounded twice a year, then at the end of 6 months the value of the investment 
is So( 1 + (r/2)), and at the end of 1 year it is So( 1 + r/2)?. Thus, after t years, we have 


2t 
r 
S(t) = s(1 J ;) 


In general, if interest is compounded m times per year, then 
mt 
S(t) = sy(1 + Z) . (17) 
m 
The relation between formulas (13) and (17) is clarified if we recall from calculus that 


lim s(1 ES Ly = Se. 
moo m 

The same model applies equally well to more general investments in which dividends and 
perhaps capital gains can also accumulate, as well as interest. In recognition of this fact, we 
will from now on refer to r as the rate of return. 

Table 2.3.1 shows the effect of changing the frequency of compounding for a return 
rate r of 896. The second and third columns are calculated from equation (17) for quarterly 
and daily compounding, respectively, and the fourth column is calculated from equation (13) 
for continuous compounding. The results show that the frequency of compounding is not 
particularly important in most cases. For example, during a 10-year period the difference 
between quarterly and continuous compounding is $17.50 per $1000 invested, or less than 
$2/year. The difference would be somewhat greater for higher rates of return and less for lower 
rates. From the first row in the table, we see that for the return rate r = 8%, the annual yield 
for quarterly compounding is 8.24% and for daily or continuous compounding it is 8.33%. 


Growth of Capital at a Return Rater = 8% 
for Several Modes of Compounding 


TABLE 2.3.1 


S(t)/S(t)) From Equation (17) S(t)/S(to) 
m=4 m = 365 From Equation (13) 
1 1.0824 1.0833 1.0833 
2 1.1717 1.1735 1.1735 
3 1.4859 1.4918 1.4918 
10 2.2080 2.2253 2.2255 
20 4.8754 4.9522 4.9530 
30 10.7652 11.0203 11.0232 
40 23.7699 24.5239 24.5325 


EXAMPLE 3 | Chemicals in a Pond 


Consider a pond that initially contains 10 million gallons of fresh water. Water containing an 
undesirable chemical flows into the pond at the rate of 5 million gallons per year, and the mixture 
in the pond flows out at the same rate. The concentration y(t) of chemical in the incoming water 
varies periodically with time according to the expression y(t) = 2 + sin(2r) g/gal. Construct a 
mathematical model of this flow process and determine the amount of chemical in the pond at any 
time. Plot the solution and describe in words the effect of the variation in the incoming concentration. 


Solution: 


Since the incoming and outgoing flows of water are the same, the amount of water in the pond remains 
constant at 107 gal. Let us denote time by t, measured in years, and the chemical by Q(t), measured 
in grams. This example is similar to Example 1, and the same inflow/outflow principle applies. Thus 


—— = rate in — rate out, 
dt 
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where “rate in" and “rate out" refer to the rates at which the chemical flows into and out of the pond, 
respectively. The rate at which the chemical flows in is given by 


rate in = (5 x 10°) gal/yr (2 + sin(27)) g/gal. (18) 
The concentration of chemical in the pond is Q(t) / 107 g/gal, so the rate of flow out is 
rate out = (5 x 10°) gal/year (Q(t) / 10") g/gal = Q(t) /2 glyr. (19) 
Thus we obtain the differential equation 
d t 
- = (5 x 106) (2 + sin(2t)) — ao 3 (20) 


where each term has the units of g/yr. 

To make the coefficients more manageable, it is convenient to introduce a new dependent 
variable defined by q(t) = Q(t) / 106, or O(t) = 10° q(t). This means that g(t) is measured 
in millions of grams, or megagrams (metric tons). If we make this substitution in equation (20), then 
each term contains the factor 10°, which can be canceled. If we also transpose the term involving 
q(t) to the left-hand side of the equation, we finally have 


dd (1g = 10+ Ssin(21) (21) 
—4+-q= sin n 
dr ^ 24 

Originally, there is no chemical in the pond, so the initial condition is 


q(0) — 0. Q2) 


Equation (21) is linear, and although the right-hand side is a function of time, the coefficient 
of q is a constant. Thus the integrating factor is e'/?. Multiplying equation (21) by this factor and 
integrating the resulting equation, we obtain the general solution 


40 10 
q(t) = 20 — — cos(2t) + — sin(2t) + ce !/?, (23) 
17 17 
The initial condition (22) requires that c = —300/17, so the solution of the initial value problem 


(17), (18) is 


00 2. 


(t) = 20 i) cos(2t) + 10 sin(2t) 3 (24) 
eee 17 17 17 


A plot of the solution (24) is shown in Figure 2.3.3, along with the line g = 20 (shown in black). 
The exponential term in the solution is important for small t, but it diminishes rapidly as t increases. 
Later, the solution consists of an oscillation, due to the sin(2t) and cos(2t) terms, about the constant 
level q = 20. Note that if the sin(2t) term were not present in equation (21), then q = 20 would be 
the equilibrium solution of that equation. 


Solution of the initial value problem (17), (18): 
dq/dt +q/2 = 10 + 5sin(2t), q(0) = 0. 


Let us now consider the adequacy of the mathematical model itself for this problem. The model 
rests on several assumptions that have not yet been stated explicitly. In the first place, the amount of 
water in the pond is controlled entirely by the rates of flow in and out— none is lost by evaporation 
or by seepage into the ground, and none is gained by rainfall. The same is true of the chemical; it 
flows into and out of the pond, but none is absorbed by fish or other organisms living in the pond. In 
addition, we assume that the concentration of chemical in the pond is uniform throughout the pond. 
Whether the results obtained from the model are accurate depends strongly on the validity of these 
simplifying assumptions. 
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EXAMPLE 4 | Escape Velocity 


A body of constant mass m is projected away from the earth in a direction perpendicular to the earth’s 
surface with an initial velocity vo. Assuming that there is no air resistance, but taking into account 
the variation of the earth's gravitational field with distance, find an expression for the velocity during 
the ensuing motion. Also find the initial velocity that is required to lift the body to a given maximum 
altitude A max above the surface of the earth, and find the least initial velocity for which the body will 
not return to the earth; the latter is the escape velocity. 


A body in the earth's gravitational field is pulled 


towards the center of the earth. 


Solution: 


Let the positive x-axis point away from the center of the earth along the line of motion with 
x = O lying on the earth’s surface; see Figure 2.3.4. The figure is drawn horizontally to remind 
you that gravity is directed toward the center of the earth, which is not necessarily downward from 
a perspective away from the earth's surface. The gravitational force acting on the body (that is, its 
weight) is inversely proportional to the square of the distance from the center of the earth and is given 
by w(x) = —k/(x + R)?, where k is a constant, R is the radius of the earth, and the minus sign 
signifies that w(x) is directed in the negative x direction. We know that on the earth's surface w(0) 
is given by —mg, where g is the acceleration due to gravity at sea level. Therefore, k = mg R? and 


w(x) = —-———_.. (25) 
X 


Since there are no other forces acting on the body, the equation of motion is 


d R? 
QU LL R (26) 
dt (R +x)? 
and the initial condition is 
v(0) = vo. (27) 


Unfortunately, equation (26) involves too many variables since it depends on f, x, and v. To 
remedy this situation, we can eliminate t from equation (26) by thinking of x, rather than f, as the 
independent variable. Then we can express dv / dt in terms of dv / dx by using the chain rule; hence 


dv dv dx dv 


= =y 5 
dt dx dt dx 


and equation (26) is replaced by 


d R? 
ju e (28) 
dx (R +x)? 


Equation (28) is separable but not linear, so by separating the variables and integrating, we obtain 
y? g R? 


—— a 2 
2 mea d 


Since x = 0 when f = 0, the initial condition (27) at t = 0 can be replaced by the condition that 
y = vo when x = 0. Hence c = (6/2) — gR and 


2 2gR? 
vet Wg ARN (30) 


Note that equation (30) gives the velocity as a function of altitude rather than as a function of time. 
The plus sign must be chosen if the body is rising, and the minus sign must be chosen if it is falling 
back to earth. 


2.3 Modeling with First-Order Differential Equations 


To determine the maximum altitude A,,,, that the body reaches, we set v = 0 and x = Ajax in 
equation (30) and then solve for A max, obtaining 
v : R 
Amax = e 2' GI) 
2gR — vo 


Solving equation (31) for vo, we find the initial velocity required to lift the body to the altitude A max» 
namely, 


Amax 


R————. 32 
R + Amax ( ) 


vo = 4/28 


The escape velocity v, is then found by letting Amax — oo. Consequently, 


Ve = V2gR. (33) 


The numerical value of v, is approximately 6.9 mi/s, or 11.1 km/s. 

The preceding calculation of the escape velocity neglects the effect of air resistance, so the 
actual escape velocity (including the effect of air resistance) is somewhat higher. On the other hand, 
the effective escape velocity can be significantly reduced if the body is transported a considerable 
distance above sea level before being launched. Both gravitational and frictional forces are thereby 
reduced; air resistance, in particular, diminishes quite rapidly with increasing altitude. You should 
keepin mind also that it may well be impractical to impart too large an initial velocity instantaneously; 


47 


space vehicles, for instance, receive their initial acceleration during a period of a few minutes. 


Problems 


1. Consider a tank used in certain hydrodynamic experiments. 
After one experiment the tank contains 200 L of a dye solution with 
a concentration of 1 g/L. To prepare for the next experiment, the tank 
is to be rinsed with fresh water flowing in at a rate of 2 L/min, the 
well-stirred solution flowing out at the same rate. Find the time that 
will elapse before the concentration of dye in the tank reaches 196 of 
its original value. 


2. A tank initially contains 120 L of pure water. A mixture 
containing a concentration of ^; g/L of salt enters the tank at a rate 
of 2 L/min, and the well-stirred mixture leaves the tank at the same 
rate. Find an expression in terms of y for the amount of salt in the 
tank at any time f. Also find the limiting amount of salt in the tank as 
t — oo. 


3. A tank contains 100 gal of water and 50 oz of salt. Water 
1 1 
containing a salt concentration of 1 1+ 5 sint | oz/gal flows into 


the tank at a rate of 2 gal/min, and the mixture in the tank flows out at 
the same rate. 
a. Find the amount of salt in the tank at any time. 
Q b. Plot the solution for a time period long enough so that you 
see the ultimate behavior of the graph. 
C. Thelong-time behavior of the solution is an oscillation about 
a certain constant level. What is this level? What is the amplitude 
of the oscillation? 


4. Suppose that a tank containing a certain liquid has an outlet near 
the bottom. Let A(t) be the height of the liquid surface above the outlet 
at time f. Torricelli's^ principle states that the outflow velocity v at the 
outlet is equal to the velocity of a particle falling freely (with no drag) 
from the height h. 


? Evangelista Torricelli (1608-1647), successor to Galileo as court 
mathematician in Florence, published this result in 1644. In addition to this 
work in fluid dynamics, he is also known for constructing the first mercury 
barometer and for making important contributions to geometry. 


a. Show that v — 4/2gh, where g is the acceleration due to 
gravity. 

b. By equating the rate of outflow to the rate of change of liquid 
in the tank, show that A(t) satisfies the equation 


di 
AQ) 77 — Deh, (34) 


where A(/) is the area of the cross section of the tank at height h 
and a is the area of the outlet. The constant o is a contraction 
coefficient that accounts for the observed fact that the cross 
section of the (smooth) outflow stream is smaller than a. The 
value of a for water is about 0.6. 

C. Consider a water tank in the form of a right circular cylinder 
that is 3m high above the outlet. The radius of the tank is 1m, 
and the radius of the circular outlet is 0.1 m. If the tank is initially 
full of water, determine how long it takes to drain the tank down 
to the level of the outlet. 


5. Suppose that a sum Sp is invested at an annual rate of return r 
compounded continuously. 
a. Find the time T required for the original sum to double in 
value as a function of r. 
b. Determine T ifr = 7%. 
C. Find the return rate that must be achieved if the initial 
investment is to double in 8 years. 


6. A young person with no initial capital invests k dollars per 
year at an annual rate of return r. Assume that investments are made 
continuously and that the return is compounded continuously. 

a. Determine the sum S(t) accumulated at any time t. 

b. Ifr = 7.596, determine k so that $1 million will be available 
for retirement in 40 years. 

c. If k — $2000/year, determine the return rate r that must be 
obtained to have $1 million available in 40 years. 
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7. A certain college graduate borrows $8000 to buy a car. The 
lender charges interest at an annual rate of 10%. Assuming that interest 
is compounded continuously and that the borrower makes payments 
continuously at a constant annual rate k, determine the payment rate 
k that is required to pay off the loan in 3 years. Also determine how 
much interest is paid during the 3-year period. 


@ 8. A recent college graduate borrows $150,000 at an interest 
rate of 6% to purchase a condominium. Anticipating steady salary 
increases, the buyer expects to make payments at a monthly rate of 
800 + 10r, where f is the number of months since the loan was made. 
a. Assuming that this payment schedule can be maintained, 
when will the loan be fully paid? 
b. Assuming the same payment schedule, how large a loan could 
be paid off in exactly 20 years? 


9. An important tool in archeological research is radiocarbon 
dating, developed by the American chemist Willard F. Libby.? This 
is a means of determining the age of certain wood and plant remains, 
and hence of animal or human bones or artifacts found buried at the 
same levels. Radiocarbon dating is based on the fact that some wood 
or plant remains contain residual amounts of carbon-14, a radioactive 
isotope of carbon. This isotope is accumulated during the lifetime 
of the plant and begins to decay at its death. Since the half-life of 
carbon-14 is long (approximately 5730 years), measurable amounts 
of carbon-14 remain after many thousands of years. If even a tiny 
fraction of the original amount of carbon-14 is still present, then by 
appropriate laboratory measurements the proportion of the original 
amount of carbon-14 that remains can be accurately determined. In 
other words, if Q(t) is the amount of carbon-14 at time ¢ and Qo is 
the original amount, then the ratio Q(t) / Qo can be determined, as 
long as this quantity is not too small. Present measurement techniques 
permit the use of this method for time periods of 50,000 years or more. 

a. Assuming that Q satisfies the differential equation 
Q' — —r Q, determine the decay constant r for carbon-14. 

b. Find an expression for Q(t) at any time t, if Q(0) = Qo. 

€. Suppose that certain remains are discovered in which the 
current residual amount of carbon-14 is 20% of the original 
amount. Determine the age of these remains. 


@ 10. Suppose that a certain population has a growth rate that 
varies with time and that this population satisfies the differential 
equation 


dy edt 
— =(0.5 t)-. 
di ( + sint) 5 
a. If y(0) = 1, find (or estimate) the time 7 at which the 


population has doubled. Choose other initial conditions and 
determine whether the doubling time 7 depends on the initial 
population. 

b. Suppose that the growth rate is replaced by its average value 
1/10. Determine the doubling time 7 in this case. 

C. Suppose that the term sin: in the differential equation is 
replaced by sin 27 t; that is, the variation in the growth rate has a 
substantially higher frequency. What effect does this have on the 
doubling time 7 ? 

d. Plot the solutions obtained in parts a, b, and c on a single set 
of axes. 


3Willard F. Libby (1908-1980) was born in rural Colorado and received his 
education at the University of California at Berkeley. He developed the method 
of radiocarbon dating beginning in 1947 while he was at the University of 
Chicago. For this work he was awarded the Nobel Prize in Chemistry in 1960. 


McGraw-Hill Encyclopedia of Science and Technology (8th ed.) (New York: 
McGraw-Hill, 1997), Vol. 5, p. 48. 


Q 11. 


problem 


Suppose that a certain population satisfies the initial value 


dy/dt =r(t)y—k, y(0) = yo. 


where the growth rate r(t) is given by r(t) = (1 + sint)/5, and k 
represents the rate of predation. 
@ a. Suppose that k = 1 / 5. Plot y versus t for several values 
of yo between 1/2 and 1. 
b. Estimate the critical initial population ye below which the 
population will become extinct. 
C. Choose other values of k and find the corresponding ye for 
each one. 
Q d. Use the data you have found in parts b and c to plot y; 
versus k. 


12. Newton's law of cooling states that the temperature of an object 
changes at a rate proportional to the difference between its temperature 
and that of its surroundings. Suppose that the temperature of a cup of 
coffee obeys Newton's law of cooling. If the coffee has a temperature 
of 200°F when freshly poured, and 1 min later has cooled to 190°F in 
a room at 70°F, determine when the coffee reaches a temperature of 
150°F. 


13. Heat transfer from a body to its surroundings by radiation, 
based on the Stefan - Boltzmann? law, is described by the differential 
equation 

du 

dt 
where u(t) is the absolute temperature of the body at time t, T is 
the absolute temperature of the surroundings, and œ is a constant 
depending on the physical parameters of the body. However, if u 


is much larger than T, then solutions of equation (35) are well 
approximated by solutions of the simpler equation 


Ht eut (36) 
dt 
Suppose that a body with initial temperature 2000 K is surrounded by 
a medium with temperature 300 K and that a = 2.0 x 107? K~*/s. 
a. Determine the temperature of the body at any time by solving 
equation (36). 
Q b. Plot the graph of u versus t. 
O c. Find the time 7 at which u(r) = 600—that is, twice 
the ambient temperature. Up to this time the error in using 
equation (36) to approximate the solutions of equation (35) is 
no more than 1%. 


= —a(u* — T^), (35) 


@ 14. Consider an insulated box (a building, perhaps) with 
internal temperature u(t). According to Newton’s law of cooling, u 
satisfies the differential equation 
T = ku - TO) (37) 
— = —k(u-— ; 
dt 
where T(t) is the ambient (external) temperature. Suppose that 
T(t) varies sinusoidally; for example, assume that 
T(t) = To + Tı cos( wt). 


5Jozef Stefan (1835-1893), professor of physics at Vienna, stated the radiation 
law on empirical grounds in 1879. His student Ludwig Boltzmann (1844-1906) 
derived it theoretically from the principles of thermodynamics in 1884. 
Boltzmann is best known for his pioneering work in statistical mechanics. 


a. Solve equation (37) and express u(t) in terms of t, k, To, Tj, 
and w. Observe that part of your solution approaches zero as t 
becomes large; this is called the transient part. The remainder of 
the solution is called the steady state; denote it by S(t). 

Q b. Suppose that t is measured in hours and that w = 7/12, 
corresponding to a period of 24 h for T(t). Further, let 
Ty = 60°F, T, = 15°F, and k = 0.2/h. Draw graphs of S(t) 
and T(t) versus t on the same axes. From your graph estimate 
the amplitude R of the oscillatory part of S(t). Also estimate 
the time lag 7 between corresponding maxima of T(t) and S(t). 
C. Letk, To, Tj, and w now be unspecified. Write the oscillatory 
part of S(t) in the form Rcos(w(t — 7)). Use trigonometric 
identities to find expressions for R and 7. Let T; and w have 
the values given in part b, and plot graphs of R and 7 versus k. 


15. Consider a lake of constant volume V containing at time t 
an amount Q(t) of pollutant, evenly distributed throughout the lake 
with a concentration c(t), where c(t) = Q(t)/V. Assume that 
water containing a concentration k of pollutant enters the lake at a 
rate r, and that water leaves the lake at the same rate. Suppose that 
pollutants are also added directly to the lake at a constant rate P. 
Note that the given assumptions neglect a number of factors that may, 
in some cases, be important—for example, the water added or lost 
by precipitation, absorption, and evaporation; the stratifying effect of 
temperature differences in a deep lake; the tendency of irregularities 
in the coastline to produce sheltered bays; and the fact that pollutants 
are deposited unevenly throughout the lake but (usually) at isolated 
points around its periphery. The results below must be interpreted in 
light of the neglect of such factors as these. 
a. Ifattime t = 0 the concentration of pollutant is co, find an 
expression for the concentration c(t) at any time. What is the 
limiting concentration as t — oo? 
b. If the addition of pollutants to the lake is terminated (k — 0 
and P = 0 fort > 0), determine the time interval T that must 
elapse before the concentration of pollutants is reduced to 5096 
of its original value; to 10% of its original value. 
c. Table 2.3.2 contains data? for several of the Great Lakes. 
Using these data, determine from part b the time T that is needed 
to reduce the contamination of each of these lakes to 1046 of the 
original value. 


TABLE 2.3.2 Volume and Flow Data for the Great 


Lakes 
Lake 10? x V (km?) r (km? /year) 
Superior 12.2 65.2 
Michigan 4.9 158 
Erie 0.46 iS 
Ontario 1.6 209 


@ 16. A ball with mass 0.15 kg is thrown upward with initial 
velocity 20 m/s from the roof of a building 30 m high. Neglect air 
resistance. 
a. Find the maximum height above the ground that the ball 
reaches. 
b. Assuming that the ball misses the building on the way down, 
find the time that it hits the ground. 
Q c. Plot the graphs of velocity and position versus time. 


6 This problem is based on R. H. Rainey, “Natural Displacement of Pollution 
from the Great Lakes," Science 155 (1967), pp. 1242-1243; the information in 
the table was taken from that source. 
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@ 17. Assume that the conditions are as in Problem 16 except 
that there is a force due to air resistance of magnitude |v|/30 directed 
opposite to the velocity, where the velocity v is measured in m/s. 
a. Find the maximum height above the ground that the ball 
reaches. 
b. Find the time that the ball hits the ground. 
Q c. Plot the graphs of velocity and position versus time. 
Compare these graphs with the corresponding ones in Problem 
16. 


@ 18. Assume that the conditions are as in Problem 16 except that 
there is a force due to air resistance of magnitude v?/1325 directed 
opposite to the velocity, where the velocity v is measured in m/s. 
a. Find the maximum height above the ground that the ball 
reaches. 
b. Find the time that the ball hits the ground. 
Q c. Plot the graphs of velocity and position versus time. 
Compare these graphs with the corresponding ones in Problems 
16 and 17. 


19. A body of constant mass m is projected vertically upward with 
an initial velocity vp in a medium offering a resistance k|v|, where k 
is a constant. Neglect changes in the gravitational force. 


a. Find the maximum height x,, attained by the body and the 
time fm at which this maximum height is reached. 


b. Show that if kvo/ mg < 1, then tm and xm can be expressed 
as 
v 1 kv bf 
in = M : + : ee. 
g 2mg 3\mg 
Vo 2 kvo 1 kvo ? 
Xm — 1 =F 
2g 3mg 2\mg 


C. Show that the quantity kvo/ mg is dimensionless. 


20. A body of mass m is projected vertically upward with an 
initial velocity vg in a medium offering a resistance k|v|, where k 
is a constant. Assume that the gravitational attraction of the earth is 
constant. 


a. Find the velocity v(t) of the body at any time. 


b. Use the result of part a to calculate the limit of v(t) as 
k — O0-—that is, as the resistance approaches zero. Does this 
result agree with the velocity of a mass m projected upward with 
an initial velocity vo in a vacuum? 


C. Use the result of part a to calculate the limit of v(t) as 
m — 0—that is, as the mass approaches zero. 


21. A body falling in a relatively dense fluid, oil for example, is 
acted on by three forces (see Figure 2.3.5): a resistive force R, a 
buoyant force B, and its weight w due to gravity. The buoyant force 
is equal to the weight of the fluid displaced by the object. For a slowly 
moving spherical body of radius a, the resistive force is given by 
Stokes's law, R = 6r j1a|v|, where v is the velocity of the body, and 
u is the coefficient of viscosity of the surrounding fluid." 


7Sir George Gabriel Stokes (1819-1903) was born in Ireland but spent most of 
his life at Cambridge University, first as a student and later as a professor. 
Stokes was one of the foremost applied mathematicians of the nineteenth 
century, best known for his work in fluid dynamics and the wave theory of 
light. The basic equations of fluid mechanics (the Navier- Stokes equations) 
are named partly in his honor, and one of the fundamental theorems of vector 
calculus bears his name. He was also one of the pioneers in the use of divergent 
(asymptotic) series. 
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a. Find the limiting velocity of a solid sphere of radius a and 
density p falling freely in a medium of density p’ and coefficient 
of viscosity p. 

b. In 1910R. A. Millikan? studied the motion of tiny droplets of 
oil falling in an electric field. A field of strength E exerts a force 
Ee on a droplet with charge e. Assume that E has been adjusted 
so the droplet is held stationary (v — 0) and that w and B are 
as given above. Find an expression for e. Millikan repeated this 
experiment many times, and from the data that he gathered he 
was able to deduce the charge on an electron. 


RAAB 


l 


A body falling in a dense fluid (see 


Problem 21). 


22. Let v(t) and w(t) be the horizontal and vertical components, 
respectively, of the velocity of a batted (or thrown) baseball. In the 
absence of air resistance, v and w satisfy the equations 

dv dw 


= 0, = B 
di di E 


a. Show that 


v=ucosA, w=-—gt+usinA, 


where u is the initial speed of the ball and A is its initial angle of 
elevation. 

b. Let x(t) and y(t) be the horizontal and vertical coordinates, 
respectively, of the ball at time t. If x(0) = 0 and y(0) = h, find 
x(t) and y(t) at any time t. 

O c. Let g = 32 fUs?, u = 125 ft/s, and h = 3 ft. Plot the 
trajectory of the ball for several values of the angle A; that is, 
plot x(t) and y(t) parametrically. 

d. Suppose the outfield wall is at a distance L and has height H. 
Find a relation between u and A that must be satisfied if the ball 
is to clear the wall. 

e. Suppose that L = 350 ft and H = 10 ft. Using the relation 
in part (d), find (or estimate from a plot) the range of values of A 
that correspond to an initial velocity of u = 110 ft/s. 

f. For L = 350 and H = 10, find the minimum initial velocity 
u and the corresponding optimal angle A for which the ball will 
clear the wall. 


@ 23. A more realistic model (than that in Problem 22) of a 
baseball in flight includes the effect of air resistance. In this case the 
equations of motion are 


dt "e odi 


8Robert A. Millikan (1868-1953) was educated at Oberlin College and 
Columbia University. Later he was a professor at the University of Chicago 
and California Institute of Technology. His determination of the charge on an 
electron was published in 1910. For this work, and for other studies of the 
photoelectric effect, he was awarded the Nobel Prize for Physics in 1923. 


a. Determine v(t) and w(t) in terms of initial speed u and initial 
angle of elevation A. 

b. Find x(t) and y(t) if x(0) = 0 and y(0) = h. 

Q c. Plot the trajectory ofthe ballforr — 1/5,u — 125, h — 3, 
and for several values of A. How do the trajectories differ from 
those in Problem 22 with r — 0? 

d. Assuming that r — 1/5 and / — 3, find the minimum initial 
velocity u and the optimal angle A for which the ball will clear a 
wall that is 350 ft distant and 10 ft high. Compare this result with 
that in Problem 22f. 


24. Brachistochrone Problem. One of the famous problems in 
the history of mathematics is the brachistochrone? problem: to find 
the curve along which a particle will slide without friction in the 
minimum time from one given point P to another Q, the second 
point being lower than the first but not directly beneath it (see Figure 
2.3.6). This problem was posed by Johann Bernoulli in 1696 as a 
challenge problem to the mathematicians of his day. Correct solutions 
were found by Johann Bernoulli and his brother Jakob Bernoulli and 
by Isaac Newton, Gottfried Leibniz, and the Marquis de L'Hópital. 
The brachistochrone problem is important in the development of 
mathematics as one of the forerunners of the calculus of variations. 

In solving this problem, it is convenient to take the origin as the 
upper point P and to orient the axes as shown in Figure 2.3.6. The 
lower point Q has coordinates (xo, yo). It is then possible to show 
that the curve of minimum time is given by a function y = $(x) that 
satisfies the differential equation 


(L9 yy =k, (38) 
where k? is a certain positive constant to be determined later. 


a. Solve equation (38) for y'. Why is it necessary to choose the 
positive square root? 
b. Introduce the new variable ¢ by the relation 


y 2 sin’ t. (39) 
Show that the equation found in part a then takes the form 
2k? sin? t dt = dx. (40) 
C. Letting 0 = 2t, show that the solution of equation (40) for 
which x = 0 when y = 0 is given by 
x 2K(0 —sin0)/2, y—K(1—cos0)/2. (41) 
Equations (41) are parametric equations of the solution of 
equation (38) that passes through (0, 0). The graph of equations 
(41) is called a cycloid. 
d. If we make a proper choice of the constant &, then the cycloid 
also passes through the point (xo, yo) and is the solution of the 


brachistochrone problem. Find k if xy = 1 and yọ = 2. 
P x 


Q(xo, yg) 


YY 


The brachistochrone 
(see Problem 24). 


?The word “brachistochrone” comes from the Greek words brachistos, 
meaning shortest, and chronos, meaning time. 
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24 Differences Between Linear and 
Nonlinear Differential Equations 


Up to now, we have been primarily concerned with showing that first-order differential 
equations can be used to investigate many different kinds of problems in the natural sciences, 
and with presenting methods of solving such equations if they are either linear or separable. 
Now it is time to turn our attention to some more general questions about differential equations 
and to explore in more detail some important ways in which nonlinear equations differ from 
linear ones. 


Existence and Uniqueness of Solutions. So far, we have discussed a number of initial 
value problems, each of which had a solution and apparently only one solution. That raises the 
question of whether this is true of all initial value problems for first-order equations. In other 
words, does every initial value problem have exactly one solution? This may be an important 
question even for nonmathematicians. If you encounter an initial value problem in the course 
of investigating some physical problem, you might want to know that it has a solution before 
spending very much time and effort in trying to find it. Further, if you are successful in 
finding one solution, you might be interested in knowing whether you should continue a 
search for other possible solutions or whether you can be sure that there are no other solutions. 
For linear equations, the answers to these questions are given by the following fundamental 
theorem. 


Theorem 2.4.1 | Existence and Uniqueness Theorem for 


First-Order Linear Equations 


If the functions p and g are continuous on an open interval J:a < t < f) containing the point f = fo, 
then there exists a unique function y = ¢ (t) that satisfies the differential equation 


y'- p(t)y = g(t) (1) 
for each t in Z, and that also satisfies the initial condition 
y(to) = yo. (2) 


where yo is an arbitrary prescribed initial value. 


Observe that Theorem 2.4.1 states that the given initial value problem has a solution and 
also that the problem has only one solution. In other words, the theorem asserts both the 
existence and the uniqueness of the solution of the initial value problem (1). In addition, it 
states that the solution exists throughout any interval J containing the initial point tọ in which 
the coefficients p and g are continuous. That is, the solution can be discontinuous or fail to 
exist only at points where at least one of p and g is discontinuous. Such points can often be 
identified at a glance. 

The proof of this theorem is partly contained in the discussion in Section 2.1 leading to 
the formula (see equation (32) in Section 2.1) 


noy = f wen dt +c, (3) 


where [equation (30) in Section 2.1] 
L(t) = exp f p(t) dt. (4) 


The derivation in Section 2.1 shows that if equation (1) has a solution, then it must be given by 
equation (3). By looking slightly more closely at that derivation, we can also conclude that the 
differential equation (1) must indeed have a solution. Since p is continuous fora < t < f, 
it follows that on the interval a < t < f, p is defined, is a differentiable function, and is 
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nonzero. Upon multiplying equation (1) by u(t), we obtain 
(H(t) y)’ = ug). (5) 


Since both j| and g are continuous, the function jg is integrable, and equation (3) follows from 
equation (5). Further, the integral of pg is differentiable, so y as given by equation (3) exists 
and is differentiable throughout the interval a < t < 0. By substituting the expression for y 
from equation (3) into either equation (1) or equation (5), you can verify that this expression 
satisfies the differential equation throughout the interval a < t < 5. Finally, the initial 
condition (2) determines the constant c uniquely, so there is only one solution of the initial 
value problem; this completes the proof. 

Equation (4) determines the integrating factor u(t) only up to a multiplicative factor that 
depends on the lower limit of integration. If we choose this lower limit to be fp, then 


t 
u(t) = exp | p(s)ds, (6) 
to 


and it follows that u (tọ) = 1. Using the integrating factor given by equation (6), and choosing 
the lower limit of integration in equation (3) also to be tọ, we obtain the general solution of 
equation (1) in the form 


1 t 
y=—~ f u(s)g(s)ds t c |. (7) 
n) to 
To satisfy the initial condition (2), we must choose c = yo. Thus the solution of the initial 
value problem (1) is 
1 t 
jee—— J us) gCs) ds + yo |, (8) 
n) to 


where u(t) is given by equation (6). 
Turning now to nonlinear differential equations, we must replace Theorem 2.4.1 by a 
more general theorem, such as the one that follows. 


Theorem 2.4.2 | Existence and Uniqueness Theorem for 


First-Order Nonlinear Equations 


Let the functions f and 0f/0y be continuous in some rectangle a < t < B, y < y < 6 containing 
the point (£o, yo). Then, in some interval tọ — h < t < tọ + h contained ina < t < D,there isa 
unique solution y = ¢ (t) of the initial value problem 


y = f(t, y), y(to) = yo- (9) 


Observe that the hypotheses in Theorem 2.4.2 reduce to those in Theorem 2.4.1 if the 
differential equation is linear. In this case 


—p(t), 


Of (t, 
f(t, y) = —p(t)y + g(t) and VAM = 


o 
so the continuity of f and a is equivalent to the continuity of p and g. 


The proof of Theorem 2.4.1 was comparatively simple because it could be based on the 
expression (3) that gives the solution of an arbitrary linear equation. There is no corresponding 
expression for the solution of the differential equation (9), so the proof of Theorem 2.4.2 is 
much more difficult. It is discussed to some extent in Section 2.8 and in greater depth in more 
advanced books on differential equations. 

We note that the conditions stated in Theorem 2.4.2 are sufficient to guarantee the 
existence of a unique solution of the initial value problem (6) in some interval (to — h, to + h), 
but they are not necessary. That is, the conclusion remains true under slightly weaker 
hypotheses about the function f. In fact, the existence of a solution (but not its uniqueness) 
can be established on the basis of the continuity of f alone. 

An important geometrical consequence of the uniqueness parts of Theorems 2.4.1 and 
2.4.2 is that the graphs of two solutions cannot intersect each other. Otherwise, there would 
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be two solutions that satisfy the initial condition corresponding to the point of intersection, in 
contradiction to Theorem 2.4.1 or 2.4.2. 
We now consider some examples. 


EXAMPLE 1 


Use Theorem 2.4.1 to find an interval in which the initial value problem 
ty’ +2y = 4C, (10) 
x1)22 (41) 


has a unique solution. Then do the same when the initial condition is changed to y( —1) — 2. 


Solution: 


Rewriting equation (10) in the standard form (1), we have 
y+ Q/t)y = 4t, 


so p(t) = 2/t and g(t) = 4t. Thus, for this equation, g is continuous for all t, while p is continuous 
only fort < Oor for t > 0. The interval t > 0 contains the initial point; consequently, Theorem 
2.4.1 guarantees that the problem (7), (8) has a unique solution on the interval 0 < t < oo. In 
Example 4 of Section 2.1 we found the solution of this initial value problem to be 


-—— 
yeu 1599 (12) 


Now suppose that the initial condition (11) is changed to y(—1) — 2. Then Theorem 2.4.1 
asserts the existence of a unique solution for t < 0. As you can readily verify, the solution is again 
given by equation (12), but now on the interval t < 0. 


EXAMPLE 2 


Apply Theorem 2.4.2 to the initial value problem 


dy _ 3x? +4x+2 
dx  Xy-1) 


» y0) =-1. (13) 
Repeat this analysis when the initial condition is changed to y(0) = 1. 


Solution: 


Note that Theorem 2.4.1 is not applicable to this problem since the differential equation is nonlinear. 
To apply Theorem 2.4.2, observe that 


K NS ho Of = 3x? +4x +2 
dd T 3y 77 3G =? 


2(y = 1) 

Thus each of these functions is continuous everywhere except on the line y = 1. Consequently, a 
rectangle can be drawn about the initial point (0, —1) in which both f and Of/0y are continuous. 
Therefore, Theorem 2.4.2 guarantees that the initial value problem has a unique solution in some 
interval about x = 0. However, even though the rectangle can be stretched infinitely far in both the 
positive and the negative x directions, this does not necessarily mean that the solution exists for all x. 
Indeed, the initial value problem (9) was solved in Example 2 of Section 2.2, and the solution exists 
only for x > —2. 

Now suppose we change the initial condition to y(0) — 1. The initial point now lies on the 
line y = 1, so no rectangle can be drawn about it within which f and Of/Oy are continuous. 
Consequently, Theorem 2.4.2 says nothing about possible solutions of this modified problem. 
However, if we separate the variables and integrate, as in Section 2.2, we find that 


y? —2y 2 x? + 2x? 42x +e. 


Further, if x = 0 and y = 1, then c = — 1. Finally, by solving for y, we obtain 


y-lctwyxi)-c2x2-2x. (14) 


54 CHAPTER2 First-Order Differential Equations 


Equation (14) provides two functions that satisfy the given differential equation for x > 0 and also 
satisfy the initial condition y(0) — 1. The fact that there are two solutions to this initial value problem 
reinforces the conclusion that Theorem 2.4.2 does not apply to this initial value problem. 


EXAMPLE 3 


Consider the initial value problem 


y =y", y(0) =0 (15) 
for t > 0. Apply Theorem 2.4.2 to this initial value problem and then solve the problem. 


Solution: 


The function f(t, y) = y!/? is continuous everywhere, but a = : y ?/5 does not exist when 
y = 0, and hence it is not continuous there. Thus Theorem 2.4.2 does not apply to this problem, and 
no conclusion can be drawn from it. However, by the remark following Theorem 2.4.2, the continuity 
of f does ensure the existence of solutions, though not their uniqueness. 

To understand the situation more clearly, we must actually solve the problem, which is easy to 
do since the differential equation is separable. Thus we have 


y My — dt, 
so 

3 9/3 

= =t+ 

3^ : 
and 


2 3/2 


The initial condition is satisfied if c = 0, so 


2 3/2 
y=HoH= G) > t>0 (16) 
satisfies both of equations (15). On the other hand, the function 
3/2 
2 
y = Q(t) --($) > £20 (17) 
is also a solution of the initial value problem. Moreover, the function 
y-cv()-20, rz0 (18) 


is yet another solution. Indeed, for an arbitrary positive fo, the functions 


0, if0 « t «€ to, 


" 32 | (19) 
&(3« - 0) , dft >t 


yHX() = 


are continuous, are differentiable (in particular atf = tọ), and are solutions of the initial value problem 
(11). Hence this problem has an infinite family of solutions; see Figure 2.4.1, where a few of these 
solutions are shown. 


y 


Several solutions of the initial value problem 
y'= yl? y(0) — 0. 
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As already noted, the nonuniqueness of the solutions of the problem (11) does not contradict the 
existence and uniqueness theorem, since Theorem 2.4.2 is not applicable if the initial point lies on 
the t-axis. If (19, yo) is any point not on the t-axis, however, then the theorem guarantees that there 
is a unique solution of the differential equation y’ = y 


1/3 passing through (to, yo). 


Interval of Existence. According to Theorem 2.4.1, the solution of a linear equation (1) 
y t p(t)y = g(t), 


subject to the initial condition y(fọ) = yo, exists throughout any interval about t = tọ in 
which the functions p and g are continuous. Thus vertical asymptotes or other discontinuities 
in the solution can occur only at points of discontinuity of p or g. For instance, the solutions 
in Example 1 (with one exception) are asymptotic to the y-axis, corresponding to the 
discontinuity at / = 0 in the coefficient p(t) = 2/t, but none of the solutions has any other 
point where it fails to exist and to be differentiable. The one exceptional solution shows that 
solutions may sometimes remain continuous even at points of discontinuity of the coefficients. 

On the other hand, for a nonlinear initial value problem satisfying the hypotheses of 
Theorem 2.4.2, the interval in which a solution exists may be difficult to determine. The 
solution y = $ (t) is certain to exist as long as the point (t, 6 (1)) remains within a region in 
which the hypotheses of Theorem 2.4.2 are satisfied. This is what determines the value of h 
in that theorem. However, since ¢ (1) is usually not known, it may be impossible to locate the 
point (t, @(t)) with respect to this region. In any case, the interval in which a solution exists 
may have no simple relationship to the function f in the differential equation y' = f(t, y). 
This is illustrated by the following example. 


EXAMPLE 4 


Solve the initial value problem 


y ey yet (20) 


and determine the interval in which the solution exists. 


Solution: 
ede 2, 4 OF 
Theorem 2.4.2 guarantees that this problem has a unique solution since f(t, y) = y~ and a 2y 
y 
are continuous everywhere. To find the solution, we separate the variables and integrate with the 
result that 


y dy 2 dt (21) 
and 
= y! ed 
Then, solving for y, we have 
= (22) 
m tlc. 
To satisfy the initial condition, we must choose c — —1, so 
= (23) 
eed 


is the solution of the given initial value problem. Clearly, the solution becomes unbounded as t — 1; 
therefore, the solution exists only in the interval —oo < f < 1. There is no indication from the 
differential equation itself, however, that the point ? = 1 is in any way remarkable. Moreover, if the 
initial condition is replaced by 


y(0) = yo. (24) 
then the constant c in equation (22) must be chosen to be c = —1/ yo (yo Æ 0), and it follows that 
M 
y (25) 


E 1 — yot 
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is the solution of the initial value problem with the initial condition (24). Observe that the solution 
(25) becomes unbounded ast — 1/ yo, so the interval of existence of the solutionis —oo < t < 1/y 
if yo > 0, and is 1/ yo < t < oo if yo < 0. This example illustrates another feature of initial value 
problems for nonlinear equations: the singularities of the solution may depend in an essential way 
on the initial conditions as well as on the differential equation. 


General Solution. Another way in which linear and nonlinear equations differ concerns 
the concept of a general solution. For a first-order linear differential equation it is possible to 
obtain a solution containing one arbitrary constant, from which all possible solutions follow 
by specifying values for this constant. For nonlinear equations this may not be the case; even 
though a solution containing an arbitrary constant may be found, there may be other solutions 
that cannot be obtained by giving values to this constant. For instance, for the differential 
equation y’ = y? in Example 4, the expression in equation (22) contains an arbitrary constant 
but does not include all solutions of the differential equation. To show this, observe that the 
function y = O for all t is certainly a solution of the differential equation, but it cannot be 
obtained from equation (22) by assigning a value to c. In this example we might anticipate 
that something of this sort might happen, because to rewrite the original differential equation 
in the form (21), we must require that y is not zero. However, the existence of “additional” 
solutions is not uucommon for nonlinear equations; a less obvious example is given in Problem 
18. Thus we will use the term "general solution" only when discussing linear equations. 


Implicit Solutions. Recall again that for an initial value problem for a first-order linear 
differential equation, equation (8) provides an explicit formula for the solution y = ¢ (t). As 
long as the necessary antiderivatives can be found, the value of the solution at any point can be 
determined merely by substituting the appropriate value of t into the equation. The situation 
for nonlinear equations is much less satisfactory. Usually, the best that we can hope for is to 
find an equation 


F(t, y) 20 (26) 


involving f and y that is satisfied by the solution y = ¢ (t). Even this can be done only for 
differential equations of certain particular types, of which separable equations are the most 
important. The equation (26) is called an integral, or first integral, of the differential equation, 
and (as we have already noted) its graph is an integral curve, or perhaps a family of integral 
curves. Equation (26), assuming it can be found, defines the solution implicitly; that is, for each 
value of t we must solve equation (26) to find the corresponding value of y. If equation (26) 
is simple enough, it may be possible to solve it for y by analytical means and thereby obtain 
an explicit formula for the solution. However, more frequently this will not be possible, and 
you will have to resort to a numerical calculation to determine (approximately) the value of 
y for a given value of t. Once several pairs of values of t and y have been calculated, it is 
often helpful to plot them and then to sketch the integral curve that passes through them. You 
should take advantage of the wide range of computational and graphical utilities available to 
carry out these calculations and to create the graph of one or more integral curves. 

Examples 2, 3, and 4 involve nonlinear problems in which it is easy to solve for an explicit 
formula for the solution y = ¢ (t). On the other hand, Examples 1 and 3 in Section 2.2 are 
cases in which it is better to leave the solution in implicit form and to use numerical means to 
evaluate it for particular values of the independent variable. The latter situation is more typical; 
unless the implicit relation is quadratic in y or has some other particularly simple form, it is 
unlikely that it can be solved exactly by analytical methods. Indeed, more often than not, it 
is impossible even to find an implicit expression for the solution of a first-order nonlinear 
equation. 


Graphical or Numerical Construction of Integral Curves. Because of the difficulty in 
obtaining exact analytical solutions of nonlinear differential equations, methods that yield 
approximate solutions or other qualitative information about solutions are of correspondingly 
greater importance. We have already described, in Section 1.1, how the direction field of a 
differential equation can be constructed. The direction field can often show the qualitative 
form of solutions and can also be helpful in identifying regions of the ty-plane where solutions 
exhibit interesting features that merit more detailed analytical or numerical investigation. 
Graphical methods for first-order differential equations are discussed further in Section 2.5. 


2.4 Differences Between Linear and Nonlinear Differential Equations 


An introduction to numerical methods for first-order equations is given in Section 2.7, and a 
systematic discussion of numerical methods appears in Chapter 8. However, it is not necessary 
to study the numerical algorithms themselves in order to use effectively one of the many 
software packages that generate and plot numerical approximations to solutions of initial value 
problems. 


Summary. The linear equation y’ + p(t)y = g(t) has several nice properties that can be 
summarized in the following statements: 


1. Assuming that the coefficients are continuous, there is a general solution, containing an 
arbitrary constant, that includes all solutions of the differential equation. A particular 
solution that satisfies a given initial condition can be picked out by choosing the proper 
value for the arbitrary constant. 


2. There is an expression for the solution, namely, equation (7) or equation (8). Moreover, 
although it involves two integrations, the expression is an explicit one for the solution 
y = ó(t) rather than an equation that defines o implicitly. 


3. The possible points of discontinuity, or singularities, of the solution can be identified 
(without solving the problem) merely by finding the points of discontinuity of the 
coefficients. Thus, if the coefficients are continuous for all t, then the solution also exists 
and is differentiable for all t. 


None of these statements are true, in general, of nonlinear equations. Although a nonlinear 
equation may well have a solution involving an arbitrary constant, there may also be other 
solutions. There is no general formula for solutions of nonlinear equations. If you are able 
to integrate a nonlinear equation, you are likely to obtain an equation defining solutions 
implicitly rather than explicitly. Finally, the singularities of solutions of nonlinear equations 
can usually be found only by solving the equation and examining the solution. It is likely that 
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the singularities will depend on the initial condition as well as on the differential equation. 


Problems 


In each of Problems 1 through 4, determine (without solving the 
problem) an interval in which the solution of the given initial value 
problem is certain to exist. 


1. (t—3)y’+(nt)y =2t, y(1 =2 
2. y’+(tant)y =sint, y(7) =0 

3. (4—- £y + 2ty = 32, y(-3) =1 
4. (nt)y’+y=cott, y(2) =3 


In each of Problems 5 through 8, state where in the ty-plane the 
hypotheses of Theorem 2.4.2 are satisfied. 


5. y 2(1—12—y212 


In |ry| 
6. y = ——— 
y 1-124 y? 
"m. y =(P +y? 
14+? 
8. y= t 
ayy? 


In each of Problems 9 through 12, solve the given initial value problem 
and determine how the interval in which the solution exists depends 
on the initial value yo. 


9. y'=—4t/y, y(0 = yo 
10. y'—2ty?, y(0) = yo 
11. y'+y?=0, y(0) =y 


12. y=———.,_ (0) = 
y 


In each of Problems 13 through 16, draw a direction field and plot (or 
sketch) several solutions of the given differential equation. Describe 
how solutions appear to behave as f increases and how their behavior 
depends on the initial value yọ when t = 0. 


Q 13. y=nH3-y) 

Q 14. y =y3-ty) 

@ 15. y’=-y(3-ty) 

O 16. y =t-1-y? 

17. Consider the initial value problem y' = y!/?, y(0) = 0 from 

Example 3 in the text. 
a. Is there a solution that passes through the point (1, 1)? If so, 
find it. 
b. Is there a solution that passes through the point (2, 1)? If so, 
find it. 
c. Consider all possible solutions of the given initial value 
problem. Determine the set of values that these solutions have 


att = 2. 
18. a. Verify that both yı(t) = 1 — t and yo(t) = —17/4 are 
solutions of the initial value problem 
—t t? 4 
c" ee, y(2) 2-1. 


Where are these solutions valid? 
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b. Explain why the existence of two solutions of the 
given problem does not contradict the uniqueness part of 


Theorem 2.4.2. 
c. Show that y = ct + c?, where c is an arbitrary constant, 
satisfies the differential equation in part a for t > —2c. Ifc = —1, 


the initial condition is also satisfied, and the solution y = y,(1) is 
obtained. Show that there is no choice of c that gives the second 
solution y = y(t). 

19. a. Show that 6(t) = e” is a solution of y' — 2y = 0 and that 

y = c@(t) is also a solution of this equation for any value of the 
constant c. 
b. Show that o (f) = 1/t isa solution of y' + y? = O fort > 0, 
but that y = có (t) is nota solution of this equation unless c = 0 
or c = 1. Note that the equation of part b is nonlinear, while that 
of part a is linear. 


20. Show that if y = ¢(t) is a solution of y' + p(t)y = 0, then 
y = có (t) is also a solution for any value of the constant c. 


21. Let y = y,(t) be a solution of 


y'- p(t)y — 0, (27) 
and let y = y2(t) be a solution of 
y+ p(t)y = g(t). (28) 


Show that y = y,(t) + y2(t) is also a solution of equation (28). 


22. a. Show that the solution (7) of the general linear equation (1) 
can be written in the form 


y = cy1(t) + yo(t), (29) 


where c is an arbitrary constant. 
b. Show that y; is a solution of the differential equation 


y ^ p(thy 2 0, (30) 


corresponding to g(t) = 0. 

C. Show that y; is a solution of the full linear equation (1). We 
see later (for example, in Section 3.5) that solutions of higher- 
order linear equations have a pattern similar to equation (29). 


Bernoulli Equations. Sometimes it is possible to solve a nonlinear 
equation by making a change of the dependent variable that converts 
it into a linear equation. The most important such equation has the 
form 


y + p(t)y 2 q(0) y", 


and is called a Bernoulli equation after Jakob Bernoulli. Problems 23 
and 25 deal with equations of this type. 


23. a. Solve Bernoulli's equation when n = 0; when n = 1. 
b. Show that if n 4 0, 1, then the substitution v = y!~” reduces 
Bernoulli's equation to a linear equation. This method of solution 
was formulated by Leibniz in 1696. 


In each of Problems 24 through 25, the given equation is a Bernoulli 
equation. In each case solve it by using the substitution mentioned in 
Problem 23b. 

24. y'—ry —ky?, r > O and k > 0. This equation is important in 
population dynamics and is discussed in detail in Section 2.5. 

25. y —ecy—oy^,c > Oando > 0. This equation occurs in the 
study of the stability of fluid flow. 


Discontinuous Coefficients. Linear differential equations sometimes 
occur in which one or both of the functions p and g have jump 
discontinuities. If tọ is such a point of discontinuity, then it is necessary 
to solve the equation separately for £ < tọ and t > tọ. Afterward, 
the two solutions are matched so that y is continuous at tọ; this 
is accomplished by a proper choice of the arbitrary constants. The 
following two problems illustrate this situation. Note in each case that 
it is impossible also to make y’ continuous at fg. 


26. Solve the initial value problem 


y--2y-g(), y(0) =0, 
where 
T 1], Ostal; 
n= 
à 0, t> 1. 


27. Solve the initial value problem 


y+ pit)y=0, y0) =1, 


where 


25 Autonomous Differential Equations 


and Population Dynamics 


An important class of first-order equations consists of those in which the independent variable 
does not appear explicitly. Such equations are called autonomous and have the form 


dy/dt = f(y). (1) 


We will discuss these equations in the context of the growth or decline of the population 
of a given species, an important issue in fields ranging from medicine to ecology to global 
economics. A number of other applications are mentioned in some of the problems. Recall that 
in Sections 1.1 and 1.2 we considered the special case of equation (1) in which f(y) = ay +b. 

Equation (1) is separable, so the discussion in Section 2.2 is applicable to it, but the main 
purpose of this section is to show how geometric methods can be used to obtain important 
qualitative information directly from the differential equation without solving the equation. Of 
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fundamental importance in this effort are the concepts of stability and instability of solutions 
of differential equations. These ideas were introduced informally in Chapter 1, but without 
using this terminology. They are discussed further here and will be examined in greater depth 
and in a more general setting in Chapter 9. 


Exponential Growth. Let y = ¢(f) be the population of the given species at time t. The 
simplest hypothesis concerning the variation of population is that the rate of change of y is 
proportional!’ to the current value of y; that is, 

dy 


DUET CL 2 
d ry, Q) 


where the constant of proportionality r is called the rate of growth or decline, depending on 
whether r is positive or negative. Here, we assume that the population is growing, sor > 0. 
Solving equation (2) subject to the initial condition!! 


y(0) = yo, (3) 
we obtain 
y = ye". (4) 


Thus the mathematical model consisting of the initial value problem (1), (2) withr > 0 
predicts that the population will grow exponentially for all time, as shown in Figure 2.5.1 for 
several values of yọ. Under ideal conditions, equation (4) has been observed to be reasonably 
accurate for many populations, at least for limited periods of time. However, itis clear that such 
ideal conditions cannot continue indefinitely; eventually, limitations on space, food supply, or 
other resources will reduce the growth rate and bring an end to uninhibited exponential growth. 


ya 


l/r 2/r 3/r Afr t 


Exponential growth: y versus t for 
dy/dt — ry (r > 0). 


Logistic Growth. To take account of the fact that the growth rate actually depends on the 
population, we replace the constant r in equation (2) by a function A( y) and thereby obtain 
the modified equation 


— =h(y)y. (5) 


We now want to choose A( y) so that h(y) = r > 0 when y is small, A( y) decreases 
as y grows larger, and h(y) < 0 when y is sufficiently large. The simplest function that has 
these properties is h(y) = r — ay, where a is also a positive constant. Using this function in 
equation (5), we obtain 

dy 


g D (6) 


!0Tt was apparently the British economist Thomas Malthus (1766-1834) who first observed that many biological 
populations increase at a rate proportional to the population. His first paper on populations appeared in 1798. 


!![n this section, because the unknown function is a population, we assume yo > 0. 
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Equation (6) is known as the Verhulst! equation or the logistic equation. It is often convenient 


to write the logistic equation in the equivalent form 
d 
<= r(1-=)y, (7) 


where K = r/a. In this form, the constant r is called the intrinsic growth rate—that is, the 
growth rate in the absence of any limiting factors. The interpretation of K will become clear 
shortly. 

We will investigate the solutions of equation (7) in some detail later in this section. Before 
doing that, however, we will show how you can easily draw a qualitatively correct sketch of 
the solutions. The same methods also apply to the more general equation (1). 

We first seek solutions of equation (7) of the simplest possible type—that is, constant 
functions. For such a solution dy/dt = 0 for all t, so any constant solution of equation (7) 
must satisfy the algebraic equation 

y 
r (1 K ) y=0. 


Thus the constant solutions are y = ġ (t) = Oand y = $5(t) = K. These solutions are called 
equilibrium solutions of equation (7) because they correspond to no change or variation in 
the value of y as ¢ increases. In the same way, any equilibrium solutions of the more general 
equation (1) can be found by locating the roots of f(y) = 0. The zeros of f(y) are also called 
critical points. 

To visualize other solutions of equation (7) and to sketch their graphs quickly, we start 
by drawing the graph of f(y) versus y. In the case of equation (7), f(y) = r(1— y/K)y, 
so the graph is the parabola shown in Figure 2.5.2. The intercepts are (0, 0) and ( K,0), 
corresponding to the critical points of equation (7), and the vertex of the parabola is 
(K/2,rK/4). Observe that dy/dt > 0 for0 < y < K. Therefore, y is an increasing 
function of f when y is in this interval; this is indicated by the rightward-pointing arrows near 
the y-axis in Figure 2.5.2. Similarly, if y > K, then dy/dt « 0; hence y is decreasing, as 
indicated by the leftward-pointing arrow in Figure 2.5.2. 


(K/2, rK/4) 


| FIGURE 2.5.2 | f(y) versus y fordy/dt = r(1—y/K)y. 


In this context the y-axis is often called the phase line, and it is reproduced in its more 
customary vertical orientation in Figure 2.5.3a. The dots at y = 0 and y = K are the critical 
points, or equilibrium solutions. The arrows again indicate that y is increasing whenever 
0 « y< K and that y is decreasing whenever y > K. 


12Pierre F. Verhulst (1804-1849) was a Belgian mathematician who introduced equation (6) as a model for human 
population growth in 1838. He referred to it as logistic growth, so equation (6) is often called the logistic equation. 
He was unable to test the accuracy of his model because of inadequate census data, and it did not receive much 
attention until many years later. Reasonable agreement with experimental data was demonstrated by R. Pearl (1930) 
for Drosophila melanogaster (fruit fly) populations and by G. F. Gause (1935) for Paramecium and Tribolium (flour 
beetle) populations. 
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Further, from Figure 2.5.2, note that if y is near zero or K, then the slope f(y) is near 
zero, so the solution curves are relatively flat. They become steeper as the value of y leaves 
the neighborhood of zero or K. 

To sketch the graphs of solutions of equation (7) in the ry-plane, we start with the 
equilibrium solutions y = @\(t) = 0 and y = ¢2(t) = K; then we draw other curves that 
are increasing when 0 < y < K, decreasing when y > K, and flatten out as y approaches 
either of the values 0 or K. Thus the graphs of solutions of equation (7) must have the general 
shape shown in Figure 2.5.3b, regardless of the values of r and K. 

Figure 2.5.3b may seem to show that other solutions intersect the equilibrium solution 
y — K,butis this really possible? No, the uniqueness part of Theorem 2.4.2, the fundamental 
existence and uniqueness theorem, states that only one solution can pass through a given point 
in the ty-plane. Thus, although other solutions may be asymptotic to the equilibrium solution 
as t — oo, they cannot intersect it at any finite time. Consequently, a solution that starts in 
the interval 0 < y « K remains in this interval for all time, and similarly for a solution that 
starts in K < y < oo. 


(a) (b) 
| FIGURE 2.5.3 | Logistic growth: dy /dt = r(1— y/ K) y. (a) The phase line. (b) Plots 


of y versus f. 


To carry the investigation one step further, we can determine the concavity of the solution 
curves and the location of inflection points by finding d? y /dt*. From the differential equation 
(1), we obtain (using the chain rule) 


d?y mE d dy m d — fl dy — f! 
"ign 5; 49 Eu = f Vf). (8) 


The graph of y versus f is concave up when y" > 0—that is, when f and f' have the same 
sign. Similarly, it is concave down when y" < 0, which occurs when f and f’ have opposite 
signs. The signs of f and f' can be easily identified from the graph of f(y) versus y. Inflection 
points may occur when f'( y) = 0. 

In the case of equation (7), solutions are concave up for 0 < y < K/2 where f is 
positive and increasing (see Figure 2.5.2), so that both f and f" are positive. Solutions are 
also concave up for y > K where f is negative and decreasing (both f and f’ are negative). 
For K/2 « y « K, solutions are concave down since here f is positive and decreasing, so 
f is positive but f' is negative. There is an inflection point whenever the graph of y versus t 
crosses the line y = K/2. The graphs in Figure 2.5.3b exhibit these properties. 

Finally, observe that K is the upper bound that is approached, but not exceeded, by 
growing populations starting below this value. Thus it is natural to refer to K as the saturation 
level, or the environmental carrying capacity, for the given species. 

A comparison of Figures 2.5.1 and 2.5.35 reveals that solutions of the nonlinear equation 
(7) are strikingly different from those of the linear equation (1), at least for large values 
of t. Regardless of the value of K —that is, no matter how small the nonlinear term in 
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equation (7)—solutions of that equation approach a finite value as t — oo, whereas solutions 
of equation (1) grow (exponentially) without bound as t — oo. Thus even a tiny nonlinear 
term in the differential equation (7) has a decisive effect on the solution for large t. 

In many situations it is sufficient to have the qualitative information about a solution 
y = (t) of equation (7) that is shown in Figure 2.5.35. This information was obtained 
entirely from the graph of f(y) versus y and without solving the differential equation (7). 
However, if we wish to have a more detailed description of logistic growth —for example, if 
we wish to know the value of the population at some particular time—then we must solve 
equation (7) subject to the initial condition (3). Provided that y 4 0 and y Z K, we can write 
equation (7) in the form 


dy u 
(l—y/K)y | 


Using a partial fraction expansion on the left-hand side, we have 


Gee Ja dt 
E y — r dt. 
y l—-y/K 


Then, by integrating both sides, we obtain 


r dt. 


y 
1 =Infl = 2] =r 9 
nbi - m[t - 2] =r +e ©) 


where c is an arbitrary constant of integration to be determined from the initial condition 
y(0) = yo. We have already noted that if 0 < yọ < K, then y remains in this interval for all 
time. Thus in this case we can remove the absolute value bars in equation (9), and by taking 
the exponential of both sides, we find that 


M rt 
—— = Ce", (10) 
1—(y/K) 
where C = e°“. In order to satisfy the initial condition y(0) = yo, we must choose 


C = yo/(1—(y0/K)). Using this value for C in equation (10) and solving for y (see Problem 
10), we obtain 


nes yoK 
yo (CK — yoe ™™ ` 


(41) 


We have derived the solution (11) under the assumption that 0 < yo < K. If yo > K, then 
the details of dealing with equation (9) are only slightly different, and we leave it to you to 
show that equation (11) is also valid in this case. Finally, note that equation (11) also contains 
the equilibrium solutions y = ¢)(t) = O and y = @2(t) = K corresponding to the initial 
conditions yo = 0 and yo = K, respectively. 

All the qualitative conclusions that we reached earlier by geometrical reasoning can be 
confirmed by examining the solution (11). In particular, if yọ = 0, then equation (11) requires 
that y(t) = 0 for all t. If yo > 0, and if we let  — oo in equation (11), then we obtain 


K 
iim y= k. 


1—00 Yo 


Thus, for each yọ > 0, the solution approaches the equilibrium solution y = (t) = K 
asymptotically as £ — oo. Therefore, we say that the constant solution @2(t) = K is an 
asymptotically stable solution of equation (7) or that the point y = K is an asymptotically 
stable equilibrium or critical point. After a long time, the population is close to the saturation 
level K regardless of the initial population size, as long as it is positive. Other solutions 
approach the equilibrium solution more rapidly as r increases. 

On the other hand, the situation for the equilibrium solution y = (t) = O is quite 
different. Even solutions that start very near zero grow as f increases and, as we have seen, 
approach K as t — oo. We say that @;(t) = 0 is an unstable equilibrium solution or that 
y = 0 is an unstable equilibrium or critical point. This means that the only way to guarantee 
that the solution remains near zero is to make sure its initial value is exactly equal to zero. 
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EXAMPLE 1 


The logistic model has been applied to the natural growth of the halibut population in certain areas 
of the Pacific Ocean. !? Let y, measured in kilograms, be the biomass, that is, the total mass, of the 
halibut population, at time t. The parameters in the logistic equation are estimated to have the values 
r = 0.71/year and K = 80.5 x 10° kg. If the initial biomass is yọ = 0.25K, find the biomass 2 
years later. Also find the time 7 for which y(7) =0.75K. 


Solution: 


It is convenient to scale the solution (11) to the carrying capacity K; thus we write equation (11) in 
the form 


». yo/ K 
K — (yo/K) - (1 — yo/ K)e"' ` 


Using the data given in the problem, we find that 


(12) 


x2 _ 0.25 
K | 02540.75e-..2 


= 0.5797. 
Consequently, y(2) = 46.7 x 10° kg. 
To find 7, the time when yo/ K = 0.75 we first solve equation (12) for t, obtaining 


eu (o/K)(1—y/K) 
G/K)(1— yo/ K) ' 


hence 


Ln (yo/ K)(1 — y/ K) i (13) 
r VGQ/K)(1— yo/K) 


Using the given values of r and yo/ K and setting y/ K = 0.75, we find that 


1 (0.25)(0.25) 1 
0.71 5 (0.75)(0.75) 0.71 


In 9 = 3.095 years. 


The graphs of y / K versus t for the given parameter values and for several initial conditions are 
shown in Figure 2.5.4. The green curve corresponds to the initial condition yọ = 0.25K. 


y/K^ 
1.75 
1.50 
1.25 
1.00 
0.75 
0.50 l 
0.25 ; 


y/K versus t for population model of halibut in 


the Pacific Ocean. The green curve satisfies the initial condition 
y(0)/K = 0.25. The solution with y(0) = 0.25 reaches 75% of 
the carrying capacity at time t — 7 3.095 years. 


15A good source of information on the population dynamics and economics involved in making efficient use of 
a renewable resource, with particular emphasis on fisheries, is the book by Clark listed in the references at the 
end of this chapter. The parameter values used here are given on page 53 of this book and were obtained from a 
study by H. S. Mohring. 
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ACritical Threshold. We now turn to a consideration of the equation 


dy y 
à T(z)» a 
where r and T are given positive constants. Observe that (except for replacing the parameter 
K by T) this equation differs from the logistic equation (7) only in the presence of the minus 
sign on the right-hand side. However, as we will see, the solutions of equation (14) behave 
very differently from those of equation (7). 

For equation (14) the graph of f(y) versus y is the parabola shown in Figure 2.5.5. 
The intercepts on the y-axis are the critical points y = 0 and y = T, corresponding to the 
equilibrium solutions y = ¢,(t) = 0 and y = ¢o(t) = T.If 0 < y < T, then dy/dt < 0, 
and y is positive and decreases as t increases. Thus ¢;(t) = O is an asymptotically stable 
equilibrium solution. On the other hand, if y > T, then dy/dt > 0, so that y is positive and 
increasing as f increases; thus ó5(1) = T is an unstable equilibrium solution. 

Furthermore, the concavity of solutions can be determined by looking at the sign of 
y" = f'(y)f(y); see equation (8). Figure 2.5.5 clearly shows that f'(y) is negative for 
0 < y < T/2 and positive for T/2 < y < T, so the graph of y versus t is concave up 
and concave down, respectively, in these intervals. Also, f'( y) and f(y) are both positive for 
y > T, so the graph of y versus t is also concave up there. 


(T/2, -rT/4) 


| FIGURE 2.5.5 | f(y) versus y for dy/dt = —r(1— y/ Ty. 


Figure 2.5.6(a) shows the phase line (the y-axis) for equation (14). The dots at y — 0 and 
y — T are the critical points, or equilibrium solutions, and the arrows indicate where solutions 
are either increasing or decreasing. 

Solution curves of equation (14) can now be sketched quickly, as follows. First draw the 
equilibrium solutions y = ġı(t) = 0 and y = $»(t) = T. Then sketch curves in the strip 
0 < y < T that are decreasing as t increases and change concavity as they cross the line 
y = T/2. Next draw some curves above y = T that increase more and more steeply as t 
and y increase. Make sure that all curves become flatter as y approaches either zero or T. The 
result is Figure 2.5.6(b), which is a qualitatively accurate sketch of solutions of equation (14) 
for any r and T. From this figure it appears that as time increases, y either approaches zero or 
grows without bound, depending on whether the initial value yọ is less than or greater than T. 
Thus T is a threshold level, below which growth does not occur. 

We can confirm the conclusions that we have reached through geometrical reasoning 
by solving the differential equation (14). This can be done by separating the variables and 
integrating, just as we did for equation (7). However, if we note that equation (14) can be 
obtained from equation (7) by replacing K by T and r by —r, then we can make the same 
substitutions in the solution (11) and thereby obtain 


J= YoT 
yo + CT — yo)e" 
which is the solution of equation (14) subject to the initial condition y(0) = yo. 


If 0 < yo < T, then it follows from equation (15) that y — 0 as t — oo. This agrees 
with our qualitative geometric analysis. If yọ > T, then the denominator on the right-hand 


, (15) 
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yA 


(a) (b) 


Growth with a threshold: dy/dt = —r(1— y/T)y;y = T is an 
asymptotically unstable equilibrium, while y = 0 is asymptotically stable. (a) The 
phase line. (b) Plots of y versus t. 


side of equation (15) is zero for a certain finite value of t. We denote this value by ¢* and 
calculate it from 


Yo — Qo — Tye" — 0, 
which gives (see Problem 12) 


m i M. (16) 
r y-T 
Thus, if the initial population yo is above the threshold T, the threshold model predicts that the 
graph of y versus f has a vertical asymptote at t = ¢*; in other words, the population becomes 
unbounded in a finite time, whose value depends on yo, T, and r. The existence and location 
of this asymptote were not apparent from the geometric analysis, so in this case the explicit 
solution yields additional important qualitative, as well as quantitative, information. 

The populations of some species exhibit the threshold phenomenon. If too few are present, 
then the species cannot propagate itself successfully and the population becomes extinct. 
However, if the population is larger than the threshold level, then further growth occurs. 
Of course, the population cannot become unbounded, so eventually equation (14) must be 
modified to take this into account. 

Critical thresholds also occur in other circumstances. For example, in fluid mechanics, 
equations of the form (7) or (14) often govern the evolution of a small disturbance y in 
a laminar (or smooth) fluid flow. For instance, if equation (14) holds and y < T, then 
the disturbance is damped out and the laminar flow persists. However, if y ^ T, then the 
disturbance grows larger and the laminar flow breaks up into a turbulent one. In this case T is 
referred to as the critical amplitude. Experimenters speak of keeping the disturbance level in 
a wind tunnel low enough so that they can study laminar flow over an airfoil, for example. 


Logistic Growth with a Threshold. As we mentioned in the last subsection, the threshold 
model (14) may need to be modified so that unbounded growth does not occur when y is above 
the threshold T. The simplest way to do this is to introduce another factor that will have the 
effect of making dy / dt negative when y is large. Thus we consider 


$ --(-20-D» on 


wherer > OandO< T « K. 

The graph of f(y) versus y is shown in Figure 2.5.7. In this problem there are three 
critical points, y = 0, y = T, and y = K, corresponding to the equilibrium solutions 
y = o\(t) = 0, y = $2(t) = T, and y = ġ3(t) = K, respectively. From Figure 2.5.7 
we observe that dy/dt > O for T < y < K, and consequently y is increasing there. Further, 
dy/dt < 0 for y < T and for y > K, so y is decreasing in these intervals. Consequently, 
the equilibrium solutions y = (t) = 0 and y = ġ3(t) = K are asymptotically stable, and 
the solution y = ġ2(t) = T is unstable. 
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Seis eae) f(y) versus y for dy/dt =—r(1—y/T)(1—y/K)y. 


The phase line for equation (17) is shown in Figure 2.5.8a, and the graphs of some 
solutions are sketched in Figure 2.5.8b. You should make sure that you understand the relation 
between these two figures, as well as the relation between Figures 2.5.7 and 2.5.8a. From 
Figure 2.5.8b we see that if y starts below the threshold 7, then y declines to ultimate 
extinction. On the other hand, if y starts above T, then y eventually approaches the carrying 
capacity K. The inflection points on the graphs of y versus ¢ in Figure 2.5.85 correspond 
to the maximum and minimum points, y; and y2, respectively, on the graph of f(y) versus 
y in Figure 2.5.7. These values can be obtained by differentiating the right-hand side of 
equation (17) with respect to y, setting the result equal to zero, and solving for y. We obtain 


yu22 (K +T E VK? — KT +T?)/3, (18) 


where the plus sign yields y; and the minus sign y2. 


K 


(a) (b) 


Logistic growth with a threshold: dy/dt = —r(1— y/ T)(1— y/K)y; 
y = $\(t) = Oand y = ¢3(t) = K are asymptotically stable equilibria and y = $2(t) = T 
is an asymptotically unstable equilibrium. (a) The phase line. (b) Plots of y versus t. 


A model of this general sort apparently describes the population of the passenger pigeon, ^ 


which was present in the United States in vast numbers until the late nineteenth century. It was 
heavily hunted for food and for sport, and consequently its numbers were drastically reduced 
by the 1880s. Unfortunately, the passenger pigeon could apparently breed successfully only 
when present in a large concentration, corresponding to a relatively high threshold T. Although 
a reasonably large number of individual birds remained alive in the late 1880s, there were not 
enough in any one place to permit successful breeding, and the population rapidly declined to 
extinction. The last passenger pigeon died in 1914. The precipitous decline in the passenger 
pigeon population from huge numbers to extinction in a few decades was one of the early 
factors contributing to a concern for conservation in this country. 


See, for example, Oliver L. Austin, Jr., Birds of the World (New York: Golden Press, 1983), pp. 143-145. 


Problems 


Problems | through 4 involve equations of the formdy/dt = f(y).In 
each problem sketch the graph of f(y) versus y, determine the critical 
(equilibrium) points, and classify each one as asymptotically stable or 
unstable. Draw the phase line, and sketch several graphs of solutions 
in the ty-plane. 


Q 1. dy/dt=ay+by*, a>0, b>0, —oo« yo « oo 
Q 2. ay/at2 (y -D0(y -2, y» z0 
Q 3. dy/dt=e’-1, -< y< œ% 
Q 4. dy/dt=e%-1, -w< yy < œ% 


5. Semistable Equilibrium Solutions. Sometimes a constant 
equilibrium solution has the property that solutions lying on one side 
of the equilibrium solution tend to approach it, whereas solutions lying 
on the other side depart from it (see Figure 2.5.9). In this case the 
equilibrium solution is said to be semistable. 

a. Consider the equation 


dy/dt =k(1—y)’, (19) 


where k is a positive constant. Show that y = 1 is the only critical 
point, with the corresponding equilibrium solution (t) = 1. 
@ b. Sketch f(y) versus y. Show that y is increasing as a 
function of t for y < 1 and also for y > 1. The phase line 
has upward-pointing arrows both below and above y — 1. Thus 
solutions below the equilibrium solution approach it, and those 
above it grow farther away. Therefore, 6 (t) = 1 is semistable. 
€. Solve equation (19) subject to the initial condition y(0) = yo 
and confirm the conclusions reached in part b. 


In both cases the equilibrium solution o (f) = k 
is semistable. (a) dy/dt < 0; (b) dy/dt = 0. 


Problems 6 through 9 involve equations of the form dy/dt = f(y). 
In each problem sketch the graph of f(y) versus y, determine the 
critical (equilibrium) points, and classify each one as asymptotically 
stable, unstable, or semistable (see Problem 5). Draw the phase line, 
and sketch several graphs of solutions in the ty-plane. 


Q 6. dy/dt 2 Xy? - 1), 
dy/dt = y(1— y^), 
dy/dt = yK4 — y^), 
dy/dt = y — y)’, 


—o0 < yo < oo 
@ 7. —00 < yg < 00 
O s —00 < yg < oo 
O ». 


—00 < yo < OO 
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10. Complete the derivation of the explicit formula for the solution 
(11) of the logistic model by solving equation (10) for y. 


11. In Example 1, complete the manipulations needed to arrive at 
equation (13). That is, solve the solution (11) for t. 


12. Complete the derivation of the location of the vertical asymptote 
in the solution (15) when yọ > T. That is, derive formula (16) by 
finding the value of t when the denominator of the right-hand side of 
equation (15) is zero. 


13. Complete the derivation of formula (18) for the locations of the 
inflection points of the solution of the logistic growth model with a 
threshold (17). Hint: Follow the steps outlined on p. 66. 


14. Consider the equation dy/dt = f(y) and suppose that y; is a 
critical point —that is, f(y,) = 0. Show that the constant equilibrium 
solution ¢ (f) = y; is asymptotically stable if f’(y,) < 0 and unstable 
if f(y) > 0. 
15. Suppose that a certain population obeys the logistic equation 
dy/dt =ry(1—(y/K)). 
a. If yo = K/3, find the time 7 at which the initial population 
has doubled. Find the value of 7 corresponding to r = 0.025 per 
year. 
b. If yo/K = a, find the time T at which y(T)/K = 8, 
where 0 < o, B < 1. Observe that T — coasa — O or 
as 9 — 1. Find the value of T for r = 0.025 per year, a = 0.1, 
and 3 = 0.9. 


© 16. Another equation that has been used to model population 
growth is the Gompertz!> equation 


— —ryln| — J, 
dt y 


where r and K are positive constants. 

a. Sketch the graph of f(y) versus y, find the critical points, 
and determine whether each is asymptotically stable or unstable. 
b. For0 < y x K, determine where the graph of y versus f is 
concave up and where it is concave down. 

c. Foreach y in 0 < y < K, show that dy/dt as given by 
the Gompertz equation is never less than dy / dt as given by the 
logistic equation. 


17. a. Solve the Gompertz equation 


— -—ryln| — |. 
dt y 


subject to the initial condition y(0) — yo. 

Hint: You may wish to let u = In( y/ K). 

b. For the data given in Example 1 in the text (r — 0.71 per 
year, K = 80.5 x 10° kg, yg/ K = 0.25), use the Gompertz 
model to find the predicted value of y(2). 

C. Forthe same data as in part b, use the Gompertz model to find 
the time 7 at which y(7) = 0.75K. 


PBenjamin Gompertz (1779-1865) was an English actuary. He developed his 
model for population growth, published in 1825, in the course of constructing 
mortality tables for his insurance company. 
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18. A pond forms as water collects in a conical depression of radius 
a and depth A. Suppose that water flows in at a constant rate k and is 
lost through evaporation at a rate proportional to the surface area. 
a. Show that the volume V(t) of water in the pond at time t 
satisfies the differential equation 


eps om (3a/ n h) PV, 

dt 

where a is the coefficient of evaporation. 

b. Find the equilibrium depth of water in the pond. Is the 
equilibrium asymptotically stable? 

c. Find a condition that must be satisfied if the pond is not to 
overflow. 


Harvesting a Renewable Resource. Suppose that the population y of 
a certain species of fish (for example, tuna or halibut) in a given area 
of the ocean is described by the logistic equation 


3 =7(1-2) 
dt — KJ" 


Although it is desirable to utilize this source of food, it is intuitively 
clear that if too many fish are caught, then the fish population may be 
reduced below a useful level and possibly even driven to extinction. 
Problems 19 and 20 explore some of the questions involved in 
formulating a rational strategy for managing the fishery. !ó 


19. Ata given level of effort, it is reasonable to assume that the rate 
at which fish are caught depends on the population y: the more fish 
there are, the easier it is to catch them. Thus we assume that the rate at 
which fish are caught is given by Ey, where E is a positive constant, 
with units of 1/time, that measures the total effort made to harvest the 
given species of fish. To include this effect, the logistic equation is 
replaced by 


dy =r (1 
dt 
This equation is known as the Schaefer model after the biologist 
M. B. Schaefer, who applied it to fish populations. 
a. Show that if E < r, then there are two equilibrium points, 
y; = 0 and y; = K(1 — E/r) > 0. 
b. Show that y = y; is unstable and y = y; is asymptotically 
stable. 
C. Asustainable yield Y of the fishery is a rate at which fish can 
be caught indefinitely. It is the product of the effort E and the 
asymptotically stable population y2. Find Y as a function of the 
effort E; the graph of this function is known as the yield — effort 
curve. 
d. Determine E so as to maximize Y and thereby find the 
maximum sustainable yield Y,,. 


x » Ey. (20) 


20. In this problem we assume that fish are caught at a constant rate 
h independent of the size of the fish population. Then y satisfies 


râ ap h. (21) 


dt K 


The assumption of a constant catch rate h may be reasonable when y 
is large but becomes less so when y is small. 
a. If ^ < rK/4, show that equation (21) has two equilibrium 
points y, and y» with y, < y»; determine these points. 
b. Show that y, is unstable and y» is asymptotically stable. 
C. From a plot of f(y) versus y, show that if the initial 
population yọ > yı, then y — yo ast — oo, but that if 


16 An excellent treatment of this kind of problem, which goes far beyond what 
is outlined here, may be found in the book by Clark mentioned previously, 
especially in the first two chapters. Numerous additional references are given 
there. 


yo € yı, then y decreases as t increases. Note that y = 0 is not an 
equilibrium point, so if yọ < yı, then extinction will be reached 
in a finite time. 

d. If ^ > rK/4, show that y decreases to zero as ¢ increases, 
regardless of the value of yo. 

e. If ^ = rK/4, show that there is a single equilibrium point 
y — K/2 and that this point is semistable (see Problem 5). Thus 
the maximum sustainable yield is hm = rK/4, corresponding 
to the equilibrium value y = K/2. Observe that hm has the 
same value as Y,, in Problem 19d. The fishery is considered to 
be overexploited if y is reduced to a level below K /2. 


Epidemics. The use of mathematical methods to study the spread of 
contagious diseases goes back at least to some work by Daniel 
Bernoulli in 1760 on smallpox. In more recent years many 
mathematical models have been proposed and studied for many 
different diseases." Problems 21 through 23 deal with a few of the 
simpler models and the conclusions that can be drawn from them. 
Similar models have also been used to describe the spread of rumors 
and of consumer products. 


21. Suppose that a given population can be divided into two parts: 
those who have a given disease and can infect others, and those 
who do not have it but are susceptible. Let x be the proportion of 
susceptible individuals and y the proportion of infectious individuals; 
then x + y = 1. Assume that the disease spreads by contact between 
sick and well members of the population and that the rate of spread 
dy / dt is proportional to the number of such contacts. Further, assume 
that members of both groups move about freely among each other, so 
the number of contacts is proportional to the product of x and y. Since 
x = 1 — y, we obtain the initial value problem 
dy 


— — ay(1— y). 


Ai y(0) = yo. (22) 


where a is a positive proportionality factor, and yo is the initial 
proportion of infectious individuals. 
a. Find the equilibrium points for the differential equation (22) 
and determine whether each is asymptotically stable, semistable, 
or unstable. 
b. Solve the initial value problem 22 and verify that the 
conclusions you reached in part a are correct. Show that y(1) — 
1 as t — oo, which means that ultimately the disease spreads 
through the entire population. 


22. Some diseases (such as typhoid fever) are spread largely by 
carriers, individuals who can transmit the disease but who exhibit no 
overt symptoms. Let x and y denote the proportions of susceptibles 
and carriers, respectively, in the population. Suppose that carriers are 
identified and removed from the population at a rate 3, so 


dy E 
rm —By. (23) 


Suppose also that the disease spreads at a rate proportional to the 
product of x and y; thus 


— = —Q xy. (24) 


a. Determine y at any time ¢ by solving equation (23) subject to 
the initial condition y(0) = yo. 

b. Use the result of part a to find x at any time ¢ by solving 
equation (24) subject to the initial condition x(0) = xo. 

Cc. Find the proportion of the population that escapes the 
epidemic by finding the limiting value of x as t > oo. 


17 A standard source is the book by Bailey listed in the references. The models 
in Problems 21, 22, and 23 are discussed by Bailey in Chapters 5, 10, and 20, 
respectively. 


23. Daniel Bernoulli’s work in 1760 had the goal of appraising the 
effectiveness of a controversial inoculation program against smallpox, 
which at that time was a major threat to public health. His model 
applies equally well to any other disease that, once contracted and 
survived, confers a lifetime immunity. 

Consider the cohort of individuals born in a given year (t = 0), 
and let n(t) be the number of these individuals surviving t years later. 
Let x(t) be the number of members of this cohort who have not had 
smallpox by year t and who are therefore still susceptible. Let G be 
the rate at which susceptibles contract smallpox, and let v be the rate 
at which people who contract smallpox die from the disease. Finally, 
let u(t) be the death rate from all causes other than smallpox. Then 
dx / dt, the rate at which the number of susceptibles declines, is given 
by 

dx 

g 7 tus. (25) 
The first term on the right-hand side of equation (25) is the rate at 
which susceptibles contract smallpox, and the second term is the rate 
at which they die from all other causes. Also 


dn 


2 = —v px — u(t)n, (26) 


where dn/dt is the death rate of the entire cohort, and the two terms 
on the right-hand side are the death rates due to smallpox and to all 
other causes, respectively. 


a. Let z = x/n, and show that z satisfies the initial value 
problem 

dz 

1 —8z(l1—vz) 20) =1. (27) 
Observe that the initial value problem (27) does not depend on 
u(t). 
b. Find z(t) by solving equation (27). 
c€. Bernoulli estimated that v = 8 = 1/8. Using these 


values, determine the proportion of 20-year-olds who have not 
had smallpox. 
Note: On the basis of the model just described and the best mortality 
data available at the time, Bernoulli calculated that if deaths due to 
smallpox could be eliminated (v =0), then approximately 3 years 
could be added to the average life expectancy (in 1760) of 26 years, 7 
months. He therefore supported the inoculation program. 


Bifurcation Points. For an equation of the form 


a 28 

449» (28) 
where a is a real parameter, the critical points (equilibrium solutions) 
usually depend on the value of a. As a steadily increases or decreases, 
it often happens that at a certain value of a, called a bifurcation point, 
critical points come together, or separate, and equilibrium solutions 
may be either lost or gained. Bifurcation points are of great interest in 
many applications, because near them the nature of the solution of the 
underlying differential equation is undergoing an abrupt change. For 
example, in fluid mechanics a smooth (laminar) flow may break up and 
become turbulent. Or an axially loaded column may suddenly buckle 
and exhibit a large lateral displacement. Or, as the amount of one of 
the chemicals in a certain mixture is increased, spiral wave patterns of 
varying color may suddenly emerge in an originally quiescent fluid. 
Problems 24 through 26 describe three types of bifurcations that can 
occur in simple equations of the form (28). 


24. Consider the equation 


dy 2 
^ =a- y?. 29 
Tm a—y (29) 
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a. Find all of the critical points for equation (29). Observe that 
there are no critical points if a < 0, one critical point if a = 0, 
and two critical points if a > 0. 


Ga b. Draw the phase line in each case and determine whether 
each critical point is asymptotically stable, semistable, or 
unstable. 


Q c. In each case sketch several solutions of equation (29) in 
the t y-plane. 


Note: If we plot the location of the critical points as a function of a in 
the ay-plane, we obtain Figure 2.5.10. This is called the bifurcation 
diagram for equation (29). The bifurcation at a = 0 is called a 
saddle — node bifurcation. This name is more natural in the context 
of second-order systems, which are discussed in Chapter 9. 
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| FIGURE 2.5.10 | Bifurcation diagram for y' = a — y?. 


25. Consider the equation 


d 
I say- y -ya- y. (30) 


Ga. Again consider the casesa < 0,a = 0, anda > 0. In each 
case find the critical points, draw the phase line, and determine 
whether each critical point is asymptotically stable, semistable, 
or unstable. 


@ b. In each case sketch several solutions of equation (30) in 
the t y-plane. 
Q c. Draw the bifurcation diagram for equation (30)— that is, 
plot the location of the critical points versus a. 
Note: For equation (30) the bifurcation point at a = 0 is called a 
pitchfork bifurcation. Your diagram may suggest why this name is 
appropriate. 


26. Consider the equation 


=ay—y = y(a — y). (31) 


a. Again consider the cases a < 0, a = 0, and a > 0. In each 

case find the critical points, draw the phase line, and determine 

whether each critical point is asymptotically stable, semistable, 

or unstable. 

b. In each case sketch several solutions of equation (31) in the 

ty-plane. 

c. Draw the bifurcation diagram for equation (31). 
Note: Observe that for equation (31) there are the same number of 
critical points fora < Oanda > 0 but that their stability has changed. 
For a < 0 the equilibrium solution y = 0 is asymptotically stable 
and y = a is unstable, while for a > O the situation is reversed. 
Thus there has been an exchange of stability as a passes through 
the bifurcation point a = 0. This type of bifurcation is called a 
transcritical bifurcation. 
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27. Chemical Reactions. A second-order chemical reaction 
involves the interaction (collision) of one molecule of a substance 
P with one molecule of a substance Q to produce one molecule of 
a new substance X; this is denoted by P + Q — X. Suppose that 
p and q, where p + q, are the initial concentrations of P and Q, 
respectively, and let x(t) be the concentration of X at time t. Then 
p — x(t) and q — x(t) are the concentrations of P and Q at time f, 
and the rate at which the reaction occurs is given by the equation 


T = OP x)(q — x), (32) 


where a is a positive constant. 


a. If x(0) = 0, determine the limiting value of x(t) as t — oo 

without solving the differential equation. Then solve the initial 

value problem and find x(t) for any t. 

b. If the substances P and Q are the same, then p — q and 

equation (32) is replaced by 

& = a(p—x)? (33) 
dt 

If x(0) = 0, determine the limiting value of x(t) as £ — oo 

without solving the differential equation. Then solve the initial 

value problem and determine x(t) for any t. 


26 Exact Differential Equations and 


Integrating Factors 


For first-order differential equations there are a number of integration methods that are 
applicable to various classes of problems. The most important of these are linear equations 
and separable equations, which we have discussed previously. Here, we consider a class of 
equations known as exact differential equations for which there is also a well-defined method 
of solution. Keep in mind, however, that the first-order differential equations that can be solved 
by elementary integration methods are rather special; most first-order equations cannot be 


solved in this way. 


EXAMPLE 1 


Solution: 


Solve the differential equation 


2x + y? + 2xyy’ = 0. (1) 


The equation is neither linear nor separable, so the methods suitable for those types of equations are 
not applicable here. However, observe that the function i^ (x, y) = x? + xy? has the property that 


Therefore, the differential equation can be written as 


[2] ð 
x+y E, aay = T. (2) 
Ow  Owvdy | 
er^ a 9 


Assuming that y is a function of x, wecan use the chain rule to write the left-hand side of equation (3) 
as di» (x, y) / dx. Then equation (3) has the form 


d d 
Tis y) 2 GO? xy =0. (4) 


Integrating equation (4) we obtain 


ix y) Sa ey =c, (5) 


where c is an arbitrary constant. The level curves of ~(x, y) are the integral curves of equation (1). 
Solutions of equation (1) are defined implicitly by equation (5). 


In solving equation (1) the key step was the recognition that there is a function v that 
satisfies equations (2). More generally, let the differential equation 


M(x, y) + N(x, y) y 20 (6) 
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be given. Suppose that we can identify a function 7 (x, y) such that 


0 ð 
Lis, jamei Ae sf o buo (7) 
x Oy 


and such that (x, y) = c defines y = ó(x) implicitly as a differentiable function of x.!? 
When there is a function Y% (x, y) such that Y, = M and v, = N, we can write 
OW | OW dy | 


M(x, y) + N(x, y)y’ = T D 07 


and the differential equation (6) becomes 


d 
qj; 0e $60) 


d 
4, 7 05 $09) =0. (8) 
X 


In this case equation (6) is said to be an exact differential equation because it can be expressed 
exactly as the derivative of a specific function. Solutions of equation (6), or the equivalent 
equation (8), are given implicitly by 


p(x, y) =c, (9) 


where c is an arbitrary constant. 

In Example 1 it was relatively easy to see that the differential equation was exact and, 
in fact, easy to find its solution, at least implicitly, by recognizing the required function «. 
For more complicated equations it may not be possible to do this so easily. How can we tell 
whether a given equation is exact, and if it is, how can we find the function w(x, y)? The 
following theorem answers the first question, and its proof provides a way of answering the 
second. 


Theorem 2.6.1 


Let the functions M, N, M y, and Ny, where subscripts denote partial derivatives, be continuous in 
the rectangular? region R: à, < x < 3,7 < y < 6. Then equation (6) 


M(x, y) + N(x, y)y’ 20 
is an exact differential equation in R if and only if 
My(x, y) = Nx(x, y) (10) 
at each point of R. That is, there exists a function w satisfying equations (7), 
VxGG y) = M(x, y), vyGG y) = N(x, y), 
if and only if M and N satisfy equation (10). 


The proof of this theorem has two parts. First, we show that if there is a function 7) such 
that equations (7) are true, then it follows that equation (10) is satisfied. Computing M, and 
N, from equations (7), we obtain 


M,(x, y) = Vix, y) N(x, y) = Wyx(x, y). (11) 


Since M, and N, are continuous, it follows that ,, and v, are also continuous. This 
guarantees their equality, and equation (10) is valid. 

We now show that if M and N satisfy equation (10), then equation (6) is exact. The proof 
involves the construction of a function v satisfying equations (7) 


Plx, y) = M(x, y), pyx, y) = N(x, y). 


15 While a complete discussion of when Y% (x, y) = c defines y = ¢ (x) implicitly as a differentiable function of x is 
beyond the scope and focus of this course, in general terms this condition is satisfied, locally, at points ( x, y), where 
Ow /Oy(x, y) #0. More details can be found in most books on advanced calculus. 

19Tt is not essential that the region be rectangular, only that it be simply connected. In two dimensions this means that 
the region has no holes in its interior. Thus, for example, rectangular or circular regions are simply connected, but an 
annular region is not. More details can be found in most books on advanced calculus. 
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We begin by integrating the first of equations (7) with respect to x, holding y constant. We 
obtain 


VG, y) = Q(x, y) + h), (12) 


where Q(x, y) is any differentiable function such that Q, = M. For example, we might 
choose 


(Gis T M(s, y)ds, (3) 


where xo is some specified constant with a < xo < 8. The function h in equation (12) is an 
arbitrary differentiable function of y, playing the role of the arbitrary constant (with respect 
to x). Now we must show that it is always possible to choose /i( y) so that the second of 
equations (7) is satisfied —that is, v, = N. By differentiating equation (12) with respect to y 
and setting the result equal to N(x, y), we obtain 


ð 
Wy(x, y) = Ba, y) +h'(y) = N(x, y). 
Then, solving for h'(y), we have 
; dQ 
h(y) = N(x, y) — By y). (14) 
y 


In order for us to determine h( y) from equation (14), the right-hand side of equation (14), 
despite its appearance, must be a function of y only. One way to show that this is true is to 
show that its derivative with respect to x is zero. Thus we differentiate the right-hand side of 
equation (14) with respect to x, obtaining the expression 


ON o ðQ 
, 2 s 15 
a: 9 Ae Dy y) (15) 
By interchanging the order of differentiation in the second term of equation (15), we have 
ON oQ 
Bx D) By Ox (x, y), 
or, since O, = M, 
ON OM 
g y) = on y), 


which is zero on account of equation (10). Hence, despite its apparent form, the right-hand side 
of equation (14) does not, in fact, depend on x. Then we find A( y) by integrating equation (14) 
and, upon substituting this function in equation (12), we obtain the required function w(x, y). 
This completes the proof of Theorem 2.6.1. 

It is possible to obtain an explicit expression for 7 (x, y) in terms of integrals (see Problem 
13), but in solving specific exact equations, it is usually simpler and easier just to repeat the 
procedure used in the preceding proof. That is, after showing that M, = Nx, integrate i», = M 
with respect to x, including an arbitrary function of A( y) instead of an arbitrary constant, and 
then differentiate the result with respect to y and set it equal to N. Finally, use this last equation 
to solve for A( y). The next example illustrates this procedure. 


EXAMPLE 2 


Solve the differential equation 


(y cosx + 2xe?) + (sinx + x?e? — 1) y/ 2 0. (16) 


Solution: 


By calculating M, and N,, we find that 
My(x, y) = cosx + 2xe” = N,(x, y), 
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so the given equation is exact. Thus there is a (x, y) such that 
Vx(x, y) = ycosx + 2xe’, 
Wy(x, y) = sinx 4- x2e? — ]. 
Integrating the first of these equations with respect to x, we obtain 
w(x, y) = ysinx +x7e? +h(y). (17) 
Next, computing 7 from equation (17) and setting Yy = N gives 
Wy(x, y) = sinx + x?e? +h'(y) =sinx + x?e? — 1. 


Thus A'(y) = —1 and A(y) = —y. The constant of integration can be omitted since any solution 
of the preceding differential equation is satisfactory; we do not require the most general one. 
Substituting for A( y) in equation (17) gives 


w(x, y) = ysinx 4 x?e? — y. 


Hence solutions of equation (16) are given implicitly by 


ysinx +x — y 2 c. (18) 


EXAMPLE 3 


Solve the differential equation 
(3xy + y^) +X?  xy)y' = 0. (19) 


Solution: 
We have 
My(x,y) =3x+2y, N(x, y) =2x +y; 


since M, # N,, the given equation is not exact. To see that it cannot be solved by the procedure 
described above, let us seek a function y such that 


Pala, y) =3xyty?, Pyl, y) =a? + xy. (20) 
Integrating the first of equations (20) with respect to x gives 


3 
p(x, y) = 3*» xy! RC), (21) 


where A is an arbitrary function of y only. To try to satisfy the second of equations (20), we compute 
Wy from equation (21) and set it equal to N, obtaining 


3 
5X *2xy ENO) =x + xy 
or 
/ 1 2 
h (y) = B — xy. (22) 


Since the right-hand side of equation (22) depends on x as well as y, itis impossible to solve equation 
(22) for A( y). Thus there is no w(x, y) satisfying both of equations (20). 


Integrating Factors. It is sometimes possible to convert a differential equation that is not 
exact into an exact differential equation by multiplying the equation by a suitable integrating 
factor. Recall that this is the procedure that we used in solving linear differential equations 
in Section 2.1. To investigate the possibility of implementing this idea more generally, let us 
multiply the equation 


M(x, y) + N(x, y)y' 20 Q3) 
by a function u and then try to choose ju so that the resulting equation 
u(x, y)M(x, y) + u(x, y) N(x, y) y' 20 Q4) 


is exact. By Theorem 2.6.1, equation (24) is exact if and only if 


(uM), — (EN). (25) 
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Since M and N are given functions, equation (25) states that the integrating factor j, must 
satisfy the first-order partial differential equation 


Mpy — Nux + (My — Nx) = 0. (26) 


If a function u satisfying equation (26) can be found, then equation (24) will be exact. The 
solution of equation (24) can then be obtained by the method described in the first part of this 
section. The solution found in this way also satisfies equation (23), since the integrating factor 
jt can be canceled out of equation (24). 

A partial differential equation of the form (26) may have more than one solution; if this is 
the case, any such solution may be used as an integrating factor of equation (23). This possible 
nonuniqueness of the integrating factor is illustrated in Example 4. 

Unfortunately, equation (26), which determines the integrating factor jz, is ordinarily at 
least as hard to solve as the original equation (23). Therefore, although in principle integrating 
factors are powerful tools for solving differential equations, in practice they can be found only 
in special cases. The most important situations in which simple integrating factors can be found 
occur when yu is a function of only one of the variables x or y, instead of both. 

Let us determine conditions on M and N so that equation (23) has an integrating factor u 
that depends on x only. If we assume that ji is a function of x only, then the partial derivative 
Hx reduces to the ordinary derivative du /dx and py = 0. Making these substitutions in 
equation (26), we find that 

du My, — N; 
dx N 
If (M, — N,)/N is a function of x only, then there is an integrating factor js that also depends 
only on x; further, u(x) can be found by solving differential equation (27), which is both 
linear and separable. 

A similar procedure can be used to determine a condition under which equation (23) has 

an integrating factor depending only on y; see Problem 17. 


"m (27) 


EXAMPLE 4 


Find an integrating factor for the equation 


(3xy + y?) G7 * xy)y' =0 (19) 
and then solve the equation. 


Solution: 


In Example 3 we showed that this equation is not exact. Let us determine whether it has an integrating 
factor that depends on x only. On computing the quantity (My — N,)/N, we find that 


My(x, y) — Nx(x, y) 3x c2y -QQx ct y). 1 


à 28 
N(x, y) x? t xy x (28) 
Thus there is an integrating factor u that is a function of x only, and it satisfies the differential 
equation 
du p 
ELS 29 
dx x (2) 
Hence (see Problem 7 in Section 2.2) 
(x) =x. (30) 
Multiplying equation (19) by this integrating factor, we obtain 
(3x7y 4- xy?) + G8 4 xy) y! =0. (31) 


Equation (31) is exact, since 
a 2 2 2 E 2 
——(3x"y +xy*) = 3x* + 2xy = —(x° + xy). 
Oy Ox 


Thus there is a function v such that 


Plx, y) =3x y + xy, wy(x, y) =x? xy. (32) 


Integrating the first of equations (32) with respect to x, we obtain 


1 
(x,y) =x3y + ae +h(y). 
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Substituting this expression for Ņ (x, y) in the second of equations (32), we find that 


x3 4 xy +h'(y) =x +x7y, 


so h'(y) = 0 and h(y) isa constant. Thus the solutions of equation (31), and hence of equation (19), 


are given implicitly by 


1 
xy + y Le. 


(33) 


Solutions may also be found in explicit form since equation (33) is quadratic in y. 
You may also verify that a second integrating factor for equation (19) is 


1 


H0 7 SOx +y) 


and that the same solution is obtained, though with much greater difficulty, if this integrating factor 
is used (see Problem 22). 


Problems 


Determine whether each of the equations in Problems 1 through 8 is 
exact. If itis exact, find the solution. 


1. Qx43)4-(2y-2)y'20 

2. (2x +4y) + (2x -2y)y' 20 

3. (332—2xy +2) + (65? - x? +3)y’ Z0 
4 dy | ax+by 

dx bx-cy 

dy ax — by 


dx bx— Cy 
6. (ye? cos(2x) —2e*” sin(2x) +2x) +(xe*” cos(2x) —3) y' «0 
7. (y/x +6x) -(Inx -2,y'20, x>0 
d 
8 — — 4 
(x2 + y2)3/2 (x2 + y2)3/2 dx 
In each of Problems 9 and 10, solve the given initial value problem 
and determine at least approximately where the solution is valid. 


9. (2x-—y)+(2y-x)y'=0, y(1) 23 
10. (932 +y—1) -(4y-x3)y/'20, y(1) =0 


In each of Problems 11 and 12, find the value of b for which the given 
equation is exact, and then solve it using that value of b. 


11. (xy? + bx?y) +(x + y) xy! 2 0 
12. (ye? +x) + bxe*y’ 20 


13. Assume that equation (6) meets the requirements of Theorem 
2.6.1 in a rectangle R and is therefore exact. Show that a possible 
function w(x, y) is 


w(x, y) >| M(s, yo) ass f N(x, t) dt, 
Xp y 


M 


where (xo, yo) is a point in R. 


14. Show that any separable equation 
M(x) + N(y)y' =0 


is also exact. 


In each of Problems 15 and 16, show that the given equation is not 
exact but becomes exact when multiplied by the given integrating 
factor. Then solve the equation. 


15. xy +a + yy’ =0, p(x, y) = 1/ (xy?) 


16. (x+2)siny+(xcosy)y’=0, p(x, y) =xe* 


17. Show that if (N, — M,)/M = Q, where Q is a function of y 
only, then the differential equation 


M+Ny'=0 


has an integrating factor of the form 
u(y) = exp | Q(y)dy. 


In each of Problems 18 through 21, find an integrating factor and solve 
the given equation. 


18. (3x?y + 2xy + y?) +(x? + y*)y’ =0 
19. yW=e*%+y-1 
20. 1c (x/y—siny)y/20 
21. y xy — e?») y' 20 
22. Solve the differential equation 
(3xy + y?) +(x? + xy)y 20 


using the integrating factor u(x, y) = (xy(2x + y))7!. Verify that 
the solution is the same as that obtained in Example 4 with a different 
integrating factor. 
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2.7 Numerical Approximations: 
Euler’s Method 


Recall two important facts about the first-order initial value problem 


I — fq y), y(to) = yo. (1) 
dt 
First, if f and 0f/0y are continuous, then the initial value problem (1) has a unique solution 
y = ó(t) in some interval surrounding the initial point £ = tọ. Second, it is usually not 
possible to find the solution ¢ by symbolic manipulations of the differential equation. Up to 
now we have considered the main exceptions to the latter statement: differential equations that 
are linear, separable, or exact, or that can be transformed into one of these types. Nevertheless, 
it remains true that solutions of the vast majority of first-order initial value problems cannot 
be found by analytical means, such as those considered in the first part of this chapter. 
Therefore, it is important to be able to approach the problem in other ways. As we 
have already seen, one of these ways is to draw a direction field for the differential 
equation (which does not involve solving the equation) and then to visualize the behavior of 
solutions from the direction field. This has the advantage of being a relatively simple process, 
even for complicated differential equations. However, it does not lend itself to quantitative 
computations or comparisons, and this is often a critical shortcoming. 
For example, Figure 2.7.1 shows a direction field for the differential equation 


— =3—-2t —0.5y. (2) 
From the direction field you can visualize the behavior of solutions on the rectangle shown 


in the figure. On this rectangle a solution starting at a point on the y-axis initially increases 
with f, but it soon reaches a maximum value and then begins to decrease as t increases further. 
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A direction field for equation (2): dy/dt = 3 — 2t — 0.5y. 


You may also observe that in Figure 2.7.1 many tangent line segments at successive values 
of t almost touch each other. It takes only a bit of imagination to consider starting at a point on 
the y-axis and linking line segments for successive values of t in the grid, thereby producing 
a piecewise linear graph. Such a graph would apparently be an approximation to a solution of 
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the differential equation. To convert this idea into a useful method for generating approximate 
solutions, we must answer several questions, including the following: 


1. Can we carry out the linking of tangent lines in a systematic and straightforward 
manner? 


2. If so, does the resulting piecewise linear function provide an approximation to an actual 
solution of the differential equation? 


3. If so, can we assess the accuracy of the approximation? That is, can we estimate how far 
the approximation deviates from the solution itself? 


It turns out that the answer to each of these questions is affirmative. The resulting method 
was originated by Euler about 1768 and is referred to as the tangent line method or the Euler 
method. We will deal with the first two questions in this section, but will defer a systematic 
discussion of the third question until Chapter 8. 

To see how the Euler method works, let us consider how the tangent lines might be used 
to approximate the solution y = ¢ (t) of initial value problem (1) near t = tọ. We know that 
the solution passes through the initial point (to, yo), and from the differential equation, we also 
know that its slope at this point is f(1o, yo). Thus we can write down an equation for the line 
tangent to the solution curve at (fo, yo), namely, 


y = yo + f(to, yo) (t — to). (3) 


The tangent line is a good approximation to the actual solution curve on an interval short 
enough so that the slope of the solution does not change appreciably from its value at the 
initial point; see Figure 2.7.2. Thus, if t; is close enough to tọ, we can approximate ¢ (t1) by 
the value y, determined by substituting ¢ = f, into the tangent line approximation at t = tọ; 
thus 


yı = Yo + f (to. yo) (tı — to). (4) 
y 
Tangent line 
Qe dot fto, Yo) (t — to) 
Jy H------- oe » d 


ee eae ol Solution 
ó (t) "d yz w) 


didus yee) A tangent line approximation of 
y' = f(t, y) at (fo, yo). 


To proceed further, we can try to repeat the process. Unfortunately, we do not know the 
value $ (tı) of the solution at tı. The best we can do is to use the approximate value y; instead. 
Thus we construct the line through (tı, y1) with the slope f(t), y1), 


y =y + f(t, y(t — t). (5) 


To approximate the value of ¢ (t) ata nearby point t, we use equation (5) instead of equation 
(3), obtaining 


y» = yy + f(t, y 5 — t1). (6) 


Continuing in this manner, we use the value of y calculated at each step to determine 
the slope of the approximation for the next step. The general expression for the tangent line 
starting at (tn, Yn) is 


y — Yn fg Yn) = th); (7) 


hence the approximate value y,,, at t,4, in terms of tn, tn+1, and y, is 


Yn41 = Yn + ftn; Yn) Cai mad tn), n = 0, 1, 2. et (8) 


TT 
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If we introduce the notation f, = f (ta, Yn), then we can rewrite equation (8) as 
Yntt = Yat fn: Cua — tn), n =0, 1, 2, see (9) 


Finally, if we assume that there is a uniform step size h between the points fg, t1, t2, ... , then 
tn+1 = fn + h for each n, and we obtain Euler’s formula in the form 


Piai = Ya - fahs n 20,12, .... (10) 


To use Euler's method, you repeatedly evaluate equation (9) or equation (10), depending 
on whether or not the step size is constant, using the result of each step to execute the next 
step. In this way you generate a sequence of values y1, y2, ys. ... that approximate the values 
of the solution $ (1) at the points f), fo, t3, ... . If, instead of a sequence of points, you need 
a function to approximate the solution o (f), then you can use the piecewise linear function 
constructed from the collection of tangent line segments. That is, let y be given in [to, t1] by 
equation (7) with n = 0, in [t;, t2] by equation (7) with n = 1, and so on. 


EXAMPLE 1 


Consider the initial value problem 


— =3-2t-0. =i; 11 
d 3—-2t—0.5y, y(0) (11) 


Use Euler’s method with step size h = 0.2 to find approximate values of the solution of initial value 
problem (9) at t = 0.2, 0.4, 0.6, 0.8, and 1. Compare them with the corresponding values of the 
actual solution of the initial value problem. 


Solution: 


Note that the differential equation in the given initial value problem is the same as in equation (2); its 
direction field is shown in Figure 2.7.1. Before applying Euler's method, observe that this differential 
equation is linear, so it can be solved as in Section 2.1, using the integrating factor e' /? The resulting 
solution of the initial value problem (9) is 


y =¢(t) 214 — 4t — 13e, (12) 


We will use this information to assess how the approximate solution obtained by Euler's method 
compares with the exact solution. 

To approximate this solution by Euler's method, note that f(t, y) = 3 — 2t — 0.5y. Using the 
initial values £j = 0 and yọ = 1, we find that 


to = f(to. yo) = f(0,1) 23—-0—0.522.5 
and then, from equation (3), the tangent line approximation near t = 0 is 
y =1+2.5(t —0) = 1 +2.5t. (13) 
Setting t = 0.2 in equation (13), we find the approximate value y; of the solution at t = 0.2, namely, 
yı = 14 (2.5)(0.2) = 1.5. 
At the next step we have 
fi= ft. yp = f(0.2, 1.5) = 3 — 2(0.2) — (0.5)(1.5) = 3 — 0.4 — 0.75 = 1.85. 

Then the tangent line approximation near t = 0.2 is 

y = 1.54 1.85(t — 0.2) = 1.13 + 1.852. (14) 


Evaluating the expression in equation (14) for t = 0.4, we obtain 


yo = 1.13 + 1.85(0.4) = 1.87. 


Repeating this computational procedure three more times, we obtain the results shown in Table 2.7.1. 
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Results of Euler’s Method with h = 0.2 for 
y! =3 — 2t —0.5y, y(0) =1 


TABLE 2.7.1 


Sn = f (ta, Yn) Tangent Line Exact y (tn) 
O0 0.0 1.00000 25 y=1+4+2.5(t — 0) 1.00000 
1 02 1.50000 1.85 y=1.541.85(t — 0.2) 1.43711 
2 04 1.87000 1.265 y = 1.87 + 1.265(t — 0.4) 1.75650 
3 0.6 2.12300 0.7385 y — 2.123 4- 0.7385(t — 0.6) 1.96936 
4 0.8 2.27070 0.26465 y = 2.2707 + 0.26465(t — 0.8) 2.08584 
5 10 2.32363 2.11510 


The second column contains the t-values separated by the step size h = 0.2. The third column 
shows the corresponding y-values computed from Euler’s formula (10). Column four contains the 
slopes f, of the tangent line at the current point, (tn, Yn). In the fifth column are the tangent line 
approximations found from equation (7). The sixth column contains values of the solution (12) of 
the initial value problem (9), correct to five decimal places. The solution (12) and the tangent line 
approximation are also plotted in Figure 2.7.3. 
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Plots of the solution and a tangent line 
approximation with h = 0.2 for the initial value problem (9): 


dy/dt 23 —2t — 0.5y, y(0) =1. 


From Table 2.7.1 and Figure 2.7.3 we see that the approximations given by Euler's method for 
this problem are greater than the corresponding values of the actual solution. This is because the 
graph of the solution is concave down and therefore the tangent line approximations lie above the 
graph. 

The accuracy of the approximations in this example is not good enough to be satisfactory in a 
typical scientific or engineering application. For example, at £ = 1 the error in the approximation 
is 2.32363 — 2.11510 = 0.20853, which is a percentage error of about 9.86% relative to the exact 
solution. One way to achieve more accurate results is to use a smaller step size, with a corresponding 
increase in the number of computational steps. We explore this possibility in the next example. 


Of course, computations such as those in Example 1 and in the other examples in this 
section are usually done on a computer. Some software packages include code for the Euler 
method, while others do not. In any case, it is straightforward to write a computer program 
that will carry out the calculations required to produce results such as those in Table 2.7.1. 
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Basically, what is required is a loop that will evaluate equation (10) repetitively, along 
with suitable instructions for input and output. The output can be a list of numbers, as in 
Table 2.7.1, or a plot, as in Figure 2.7.3. The specific instructions can be written in any high- 
level programming language with which you are familiar. 


EXAMPLE 2 


Consider again the initial value problem (9) 


4Y _3-24-0.5y, 0-1 

d t * y , y A 
Use Euler’s method with various step sizes to calculate approximate values of the solution for 
0 < t < 5. Compare the calculated results with the corresponding values of the exact solution (12) 


y = 14 — 4t — 13e7*/?, 


Solution: 


We used step sizes h = 0.1, 0.05, 0.025, and 0.01, corresponding to 50, 100, 200, and 500 steps, 
respectively, to go from t = 0 tot = 5. The results of these calculations, along with the values of 
the exact solution, are summarized in Table 2.7.2. All computed entries are rounded to four decimal 
places, although more digits were retained in the intermediate calculations. 


Comparison of the Exact Solution with Euler's Method for 
Several Step Sizes h for y’ = 3 — 2t — 09.5y, y(0) =1 


TABLE 2.7.2 


h = 9.05 h = 0.025 h=0.01 Exact 
0.0 1.0000 1.0000 1.0000 1.0000 1.0000 
1.0 2.2164 2.1651 2.1399 2.1250 2.1151 
2.0 1.3397 1.2780 1.2476 1.2295 1.2176 
3.0 —0.7903 —0.8459 —0.8734 —0.8898 —0.9007 
4.0 —3.6707 E3058 237373 —3.7506 —3.7594 
5.0 —7.0003 —7.0337 —7.0504 —7.0604 —7.0671 


What conclusions can we draw from the data in Table 2.7.2? The most important observation 
is that, for a fixed value of t, the computed approximate values become more accurate as the step 
size h decreases. You can see this by reading across a particular row in the table from left to right. 
This is what we would expect, of course, but it is encouraging that the data confirm our expectations. 
For example, for t = 2 the approximate value with h = 0.1 is too large by 0.1221 (about 10%), 
whereas the value with h = 0.01 is too large by only 0.0119 (about 1%). In this case, reducing the 
step size by a factor of 10 (and performing 10 times as many computations) also reduces the error 
by a factor of about 10. Comparing the errors for other pairs of values in the table confirms that this 
relation between step size and error holds for them also: reducing the step size by a given factor also 
reduces the error by approximately the same factor. Does this mean that for the Euler method the 
error is approximately proportional to the step size? Of course, one example does not establish such 
a general result, but it is at least an interesting conjecture.?? 

A second observation from Table 2.7.2 is that, for a fixed step size h, the approximations become 
more accurate as ¢ increases, at least for t > 2. For instance, for h = 0.1 the error for t = 5 is only 
0.0668, which is a little more than one-half of the error at t = 2. We will return to this matter later 
in this section. 

All in all, Euler's method seems to work rather well for this problem. Reasonably good results 
are obtained even for a moderately large step size h = 0.1, and the approximation can be improved 
by decreasing h. 


20A more detailed discussion of the errors in using the Euler method appears in Chapter 8. 
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Let us now look at another example. 


EXAMPLE 3 


Consider the initial value problem 
—=4-1f+2y, y(0)=1. (15) 


The general solution of this differential equation was found in Example 2 of Section 2.1, and the 
solution of the initial value problem (11) is 


(16) 


Use Euler's method with several step sizes to find approximate values of the solution on the interval 
0 < t < 5. Compare the results with the corresponding values of the solution (16). 


Solution: 


Using the same range of step sizes as in Example 2, we obtain the results presented in Table 2.7.3. 


Comparison of the Exact Solution with Euler's Method for 
Several Step Sizes h for y’ = 4—t+2y, y(0) —1 


TABLE 2.7.3 


h — 0.025 h = 0.01 
0.0 1.000000 1.000000 1.000000 1.000000 1.000000 
1.0 15.77728 17.25062 18.10997 18.67278 19.06990 
2.0 104.6784 123.7130 135.5440 143.5835 149.3949 
3.0 652.5349 837.0745 959.2580 1045.395 1109.179 
4.0 4042.122 5633.351 6755.175 7575.577 8197.884 
5.0 25026.95 37897.43 47555.35 54881.32 60573.53 


The data in Table 2.7.3 again confirm our expectation that, for a given value of t, accuracy 
improves as the step size h is reduced. For example, for t = 1 the percentage error diminishes from 
17.3% when h = 0.1 to 2.1% when h = 0.01. However, the error increases fairly rapidly as t 
increases for a fixed ^. Even for h = 0.01, the error at  — 5 is 9.4%, and it is much greater for larger 
step sizes. Of course, the accuracy that is needed depends on the purpose for which the results are 
intended, but the errors in Table 2.7.3 are too large for most scientific or engineering applications. 
To improve the situation, we might either try even smaller step sizes or else restrict the computations 
to a rather short interval away from the initial point. Nevertheless, it is clear that Euler’s method is 
much less effective in this example than in Example 2. 


To understand better what is happening in these examples, let us look again at Euler’s 

method for the general initial value problem (1) 
=f(t,y), yo = Yo. 

whose exact solution we denote by o (t). Recall that a first-order differential equation has an 
infinite family of solutions, indexed by an arbitrary constant c, and that the initial condition 
picks out one member of this infinite family by determining the value of c. Thus in the infinite 
family of solutions, o (f) is the one solution that satisfies the initial condition ¢ (tọ) = yo. 

At the first step Euler's method uses the tangent line approximation to the graph of 
y = ġ(t) passing through the initial point (fo, yo), and this produces the approximate value 
yı at tj. Usually, y; + ó(1;), so at the second step Euler’s method uses the tangent line 
approximation not to y = (ft), but to a nearby solution y = @;(t) that passes through 
the point (tı, y1). So it is at each subsequent step. Euler's method uses a succession of 
tangent line approximations to a sequence of different solutions o (f), $1(t), d2(t), ... ofthe 
differential equation. At each step the tangent line is constructed to the solution passing through 
the point determined by the result of the preceding step, as shown in Figure 2.7.4. The quality 
of the approximation after many steps depends strongly on the behavior of the set of solutions 
that pass through the points (tn, Yn) for n = 1,2,3, ... . 
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The Euler method. 


In Example 2 the general solution of the differential equation is 


y =14 — 4t + ce !? (17) 


and the solution of the initial value problem (9) corresponds to c = —13. The family 
of solutions (17) is a converging family since the term involving the arbitrary constant c 
approaches zero as t — co. It does not matter very much which solutions we are approximating 
by tangent lines in the implementation of Euler's method, since all the solutions are getting 
closer and closer to each other as ¢ increases. 

On the other hand, in Example 3 the general solution of the differential equation is 


—— (18) 
4 2 ' 

and, because the term involving the arbitrary constant c grows without bound as t — oo, this 
is a diverging family. Note that solutions corresponding to two nearby values of c become 
arbitrarily far apart as ¢ increases. In Example 3 we are trying approximate the solution for 
c — 11/4, but in the use of Euler's method we are actually at each step following another 
solution that separates from the desired one faster and faster as f increases. This explains why 
the errors in Example 3 are so much larger than those in Example 2. 

In using a numerical procedure such as the Euler method, you must always keep in mind 
the question of whether the results are accurate enough to be useful. In the preceding examples, 
the accuracy of the numerical results could be determined directly by a comparison with the 
solution obtained analytically. Of course, usually the analytical solution is not available if a 
numerical procedure is to be employed, so what we usually need are bounds for, or at least 
estimates of, the error that do not require a knowledge of the exact solution. You should also 
keep in mind that the best that we can expect, or hope for, from a numerical approximation 
is that it reflects the behavior of the actual solution. Thus a member of a diverging family of 
solutions will always be harder to approximate than a member of a converging family. 

If you wish to read more about numerical approximations to solutions of initial value 
problems, you may go directly to Chapter 8 at this point. There, we present some information 
on the analysis of errors and also discuss several algorithms that are computationally much 
more efficient than the Euler method. 


Note about Variations of Computed Results. Most of the problems of the book are recorded to six digits in most cases, although more 
in this section call for fairly extensive numerical computations. To digits were retained in the intermediate calculations. 
handle these problems you need suitable computing hardware and In each of Problems 1 through 4: 


software. Keep in mind that numerical results may vary somewhat, @ a. Find approximate values of the solution of the given initial 
depending on how your program is constructed and on how your value problem at  — 0.1, 0.2, 0.3, and 0.4 using the Euler 
computer executes arithmetic steps, rounds off, and so forth. Minor method with h — 0.1. 

variations in the last decimal place may be due to such causes and do @ b. Repeat part (a) with h = 0.05. Compare the results with 


not necessarily indicate that something is amiss. Answers in the back those ‘found in: 3: 


Q c. Repeat part a with h = 0.025. Compare the results with 
those found in a and b. 

Q d. Find the solution y = ¢(t) of the given problem and 
evaluate ġ (t) at t = 0.1, 0.2, 0.3, and 0.4. Compare these values 
with the results of a, b, and c. 


1. y'=34t-y, y(0)=1 
2. y'=2y-1, y(0)=1 

3. y'=0.5—t+2y, y(0)=1 
4. y'—3cost—2y, y(0) =0 


In each of Problems 5 through 8, draw a direction field for the given 
differential equation and state whether you think that the solutions are 
converging or diverging. 

@ 5. y =5-3/y 

Q 6. y=y3-1y) 

@ 7. y 2-ty 401? 

[G] 8. yl = t? + y? 

In each of Problems 9 and 10, use Euler’s method to find approximate 
values of the solution of the given initial value problem at t = 0.5, 
1, 1.5, 2, 2.5, and 3: (a) With h = 0.1, (b) With h = 0.05, (c) With 
h = 0.025, (d) With h = 0.01. 


Q 9. /-5-3y» y(0) =2 
O 10. y'=y(3-ty), y(0) =0.5 
11. Consider the initial value problem 
3t? 
!-———, 1) 20. 
Ped yQ) 


Q a. Use Euler’s method with h = 0.1 to obtain approximate 
values of the solution at t = 1.2, 1.4, 1.6, and 1.8. 

QJ b. Repeat part a with h = 0.05. 

C. Compare the results of parts a and b. Note that they are 
reasonably close for t — 1.2, 1.4, and 1.6 but are quite different 
fort — 1.8. Also note (from the differential equation) that 
the line tangent to the solution is parallel to the y-axis when 
y= +2/\/3 = +1.155. Explain how this might cause such 
a difference in the calculated values. 


@ 12. Consider the initial value problem 
y(0) = I. 


Use Euler’s method with h = 0.1, 0.05, 0.025, and 0.01 to explore the 
solution of this problem for 0 < t < 1. What is your best estimate 
of the value of the solution at £ = 0.8? At t = 1? Are your results 
consistent with the direction field in Problem 8? 


y= Py 


13. Consider the initial value problem 
y'=-ty+0.1y*, y(0) =a, 


where o is a given number. 
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Q a. Draw a direction field for the differential equation (or 
reexamine the one from Problem 7). Observe that there is a 
critical value of o in the interval 2 < a < 3 that separates 
converging solutions from diverging ones. Call this critical 
value ao. 

O b. Use Euler’s method with h = 0.01 to estimate ay. Do this 
by restricting a to an interval [a, b], where b — a = 0.01. 


14. Consider the initial value problem 


y-2y-P, y(0) =a, 

where a is a given number. 
Q a. Draw a direction field for the differential equation. Note 
that there is a critical value of o in the interval 0 < a < 1 
that separates converging solutions from diverging ones. Call this 
critical value ao. 
@ b. Use Euler’s method with h = 0.01 to estimate ay. Do this 
by restricting a to an interval [a, b], where b — a = 0.01. 


15. Convergence of Euler's Method. It can be shown that 
under suitable conditions on f, the numerical approximation 
generated by the Euler method for the initial value problem 
y' = f(t, y). y(to) = yo converges to the exact solution as the step 
size h decreases. This is illustrated by the following example. Consider 
the initial value problem 


y=1-t+y, y(t) = yo 


a. Show that the exact solutionis y = ¢ (t) = (yo—to)e +t. 
Q b. Using the Euler formula, show that 


yk = (1 A)y 1) +h — ht, kK=1,2,.... 


c. Noting that y; = (1+/)(yo — to) + tj, show by induction 
that 


Yn = (1 +h)” (yo — to) + tn (19) 


for each positive integer n. 

d. Consider a fixed point £ > to and for a given n choose 

h = (t — tg) / n. Then t, = t for every n. Note also that h > 0 

asn — oo. By substituting for / in equation (19) and letting 

n — œ, show that y, > o (ft) as n > co. 

Hint: lim (1 +a/n)" = e. 

noo 

In each of Problems 16 and 17, use the technique discussed in Problem 
15 to show that the approximation obtained by the Euler method 
converges to the exact solution at any fixed point as  — 0. 


16. y =y, y(0)=1 


17. y 22y-1 y(0)=1 Hint: y, =(1+2h)/2+ 1/2 


28 The Existence and Uniqueness Theorem 


In this section we discuss the proof of Theorem 2.4.2, the fundamental existence and 
uniqueness theorem for first-order initial value problems. Recall that this theorem states that 


under certain conditions on f(t, y), the initial value problem 
y = f(t, y), 


has a unique solution in some interval containing the point tọ. 


y(to) = Yo 


(1) 
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In some cases (for example, if the differential equation is linear), the existence of a 
solution of the initial value problem (1) can be established directly by actually solving the 
problem and exhibiting a formula for the solution. However, in general, this approach is not 
feasible because there is no method of solving the differential equation that applies in all cases. 
Therefore, for the general case, it is necessary to adopt an indirect approach that demonstrates 
the existence of a solution of initial value problem (1) but usually does not provide a practical 
means of finding it. The heart of this method is the construction of a sequence of functions that 
converges to a limit function satisfying the initial value problem, although the members of the 
sequence individually do not. As a rule, it is impossible to compute explicitly more than a few 
members of the sequence; therefore, the limit function can be determined only in rare cases. 
Nevertheless, under the restrictions on f(t, y) stated in Theorem 2.4.2, it is possible to show 
that the sequence in question converges and that the limit function has the desired properties. 
The argument is fairly intricate and depends, in part, on techniques and results that are usually 
encountered for the first time in a course on advanced calculus. Consequently, we do not go 
into all the details of the proof here; we do, however, indicate its main features and point out 
some of the difficulties that must be overcome. 

First of all, we note that it is sufficient to consider the problem in which the initial point 
(to, yo) is the origin; that is, we consider the problem 


y -—f(ty, y(0 =0. Q) 


If some other initial point is given, then we can always make a preliminary change of variables, 
corresponding to a translation of the coordinate axes, that will take the given point (fp, yo) into 
the origin. The existence and uniqueness theorem can now be stated in the following way. 


Theorem 2.8.1 | 
y' =f(ty),y(0) 20 


Existence and Uniqueness of Solutions of 


If f and Of/Oy are continuous in a rectangle R:|t| < a, |y| < b, then there is some interval 
|t| € h < a in which there exists a unique solution y = ¢ (t) of the initial value problem (2). 


For the method of proof discussed here it is necessary to transform initial value problem 
(2) into a more convenient form. If we suppose temporarily that there is a differentiable 
function y = ó(t) that satisfies the initial value problem, then f(t, $(t)) is a continuous 
function of t only. Hence we can integrate y' = f(t, y) from the initial point t = O to an 
arbitrary value of t, obtaining 


(t) =} f(s, 6 (5)) ds, (3) 


where we have made use of the initial condition ¢(0) = 0. We also denote the dummy variable 
of integration by s. 

Since equation (3) contains an integral of the unknown function œ, it is called an integral 
equation. This integral equation is not a formula for the solution of the initial value problem, 
but it does provide another relation satisfied by any solution of equations (2). Conversely, 
suppose that there is a continuous function y = ¢ (t) that satisfies the integral equation (3); 
then this function also satisfies the initial value problem (2). To show this, we first substitute 
zero for t in equation (3), which shows that the initial condition is satisfied. Further, since the 
integrand in equation (3) is continuous, it follows from the fundamental theorem of calculus 
that ¢ is differentiable and that 6’(t) = f(t, $(1)). Therefore, the initial value problem and 
the integral equation are equivalent in the sense that any solution of one is also a solution of 
the other. It is more convenient to show that there is a unique solution of the integral equation 
in a certain interval |t| < ^. The same conclusion also holds for the initial value problem (2). 

One method of showing that the integral equation (3) has a unique solution is known as the 
method of successive approximations or Picard's?! iteration method. In using this method, 


21 Charles-Emile Picard (1856-1914) was appointed professor at the Sorbonne before the age of 30. Except for Henri 
Poincaré, he is perhaps the most distinguished French mathematician of his generation. He is known for important 
theorems in complex variables and algebraic geometry as well as differential equations. A special case of the method 
of successive approximations was first published by Liouville in 1838. However, the method is usually credited to 
Picard, who established it in a general and widely applicable form in a series of papers beginning in 1890. 
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we start by choosing an initial function ¢9, either arbitrarily or to approximate in some way 
the solution of the initial value problem. The simplest choice is 


$o(f) = 0; (4) 


then ġo at least satisfies the initial condition in equations (2), although presumably not the 
differential equation. The next approximation $ is obtained by substituting $o(s) for (s) 
in the right-hand side of equation (3) and calling the result of this operation ¢ (f). Thus 


p(t) = [ 16 oso) as. (5) 
Similarly, 2 is obtained from ó: 
a(t) = [ 16 o2) ds (6 
and, in general, 
Pn4i(t) = [ f(s, On(s)) ds. (7) 
In this manner we generate the sequence of functions (65) = (60, 61,02, ---, Ons -F 


Each member of the sequence satisfies the initial condition, but in general none 
satisfies the differential equation. However, if at some stage, say, for n = k, we find that 
Qf) = Qt), then it follows that x is a solution of the integral equation (3). Hence ¢, 
is also a solution of the initial value problem (2), and the sequence is terminated at this point. 
In general, this does not occur, and it is necessary to consider the entire infinite sequence. 

To establish Theorem 2.8.1, we must answer four principal questions: 


1. Do all members of the sequence {¢ ) exist, or may the process break down at some stage? 


2. Does the sequence converge? 


3. What are the properties of the limit function? In particular, does it satisfy the integral 
equation (3) and hence the initial value problem (2)? 


4. Is this the only solution, or may there be others? 


We first show how these questions can be answered in a specific and relatively simple example 
and then comment on some of the difficulties that may be encountered in the general case. 


EXAMPLE 1 


Solve the initial value problem 


y=2r1+y), y(0) =0 (8) 
by the method of successive approximations. 


Solution: 


Note first that if y = o (t), then the corresponding integral equation is 
t 
ot) =| 2s(1 + 6(5)) ds. (9) 
0 
If the initial approximation is ¢o(t) = 0, it follows that 
t t 
"NO zi 2s(1+u(5))as= | 2sds = t°. (10) 
0 0 


Similarly, 


4 


t t t 
oan = f 2s(1+0u(s))ds= f BU +sds =P + > (11) 
0 0 
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and 
t t st 14 19 
$3(t) 2 2(1+6x60))as =f 2s| 14-52 ds =t + — + . (12) 
0 0 2 2 2-3 
Equations (10), (11), and (12) suggest that 
2 t* t pn T 
@n(t) =t tata t v ta (13) 


for each n > 1, and this result can be established by mathematical induction, as follows. Equation 
(13) is certainly true for n = 1; see equation (10). We must show that if it is true for n = k, then it 
also holds for n — k + 1. We have 


t 
Ó 4a) = 2s(1 + óx(s)) ds 
0 


t 4 2k 
2,5 s 
= 2s| l+s°+—+---+ ds 
0 2! k! 


t 3 255 2g2k+1 
= | 2+2 Tu deep idi (14) 
4 6 2k+2 
Ed ing ; 
21^ 3f (k 4 D! 


and the inductive proof is complete. 

A plot of the first four iterates, o 1(1), 62(t), $3(f), and $4(1), is shown in Figure 2.8.1. As k 
increases, the iterates seem to remain close over a gradually increasing interval, suggesting eventual 
convergence to a limit function. 


-1.5 -1 -0.5 0.5 1 1.5 £ 


Plots of the first four Picard iterates 


y — 61), .... y = 4(t) for Example I: 
dy/dt =2t(1 +y), y(0) =0. 


It follows from equation (13) that (1) is the n™ partial sum of the infinite series 
2k 


kt 
k=1 


; (15) 


hence lim @,(t) exists if and only if the series (15) converges. Applying the ratio test, we see that, 
n—oo 
for each f, 


t2 


E k+] 


pea k! 
| »0as k > oo. (16) 


(k +1)! 174 


Thus the interval of convergence for series (15) is the entire t-axis. This means its sum ¢ (t) is the 

limit of the sequence (6, (1)] for every value of t. Further, since the series (15) is a Taylor series, 

it can be differentiated or integrated term-by-term for all values of t. Therefore, we can verify by 
oo 


direct computation that ¢ (t) = >> t?*/k! is a solution of the integral equation (9). Alternatively, 
k=1 
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by substituting o (f) for y in equations (8), we can verify that this function satisfies the initial value 
problem (6). In this example it is also possible, from the series (15), to identify the solution o (t) 
in terms of elementary functions, namely, o (t) = e' Pes 1. (See Problem 13.) However, this is not 
necessary for the discussion of existence and uniqueness. 

Explicit knowledge of ¢(t) does make it possible to visualize the convergence of the sequence 
of iterates more clearly by plotting the difference e,(t) = @(t) — (t) for various values of k. 
Figure 2.8.2 shows this difference for k = 1,2,3, 4. This figure clearly illustrates the gradually 
increasing interval over which successive iterates provide a good approximation to the solution of 
the initial value problem. 


-1.5 -1 -0.5 0.5 1 15 ¢ 


Plots of y = e(t) = ġ (t) — (t) for Example 1 
fork =1,...,4. 


Finally, to deal with the question of uniqueness, let us suppose that the initial value problem has 
two different solutions o and w. The assumption that @ and w are different means there is at least 
one value of t for which ¢(t) — y(t) 4 0. Also, since ¢ and w both satisfy the integral equation (9), 
we have by subtraction (and the linearity of integration) that 


t 
o) — v) =f 28( G(s) — v(s))ds. 
0 


Taking absolute values of both sides, we have, if t > 0, 


lo(t) = pel = 


] 200 -vonas «f 2s|ó (s) — p(s) |ds. 
0 0 


If we restrict f to lie in the interval 0 < t < A/2, where A is arbitrary, then 2t < A and 


t 
po - «oi « 4 f ló(s) — v(s)lds for 0 x t x A/2. (7) 
0 
It is now convenient to introduce the function U defined by 
t 
U(t) =| lọ (s) — 4% (s) |ds. (18) 
0 
Then it follows at once that 
U(0) — 0, (19) 
U(t) > 0, fort —- 0. (20) 


Further, U is differentiable, and U'(t) = |o (t) — v (1)|. Hence, by equation (17), 
U'(t) — AU(t) <0 for 0 <t x A/2. (21) 


—At 


Multiplying equation (21) by the positive quantity e ^^" gives 


(e UM) <0 for 0t < A/2. Q2) 


87 
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Then, upon integrating equation (22) from zero to ¢ and using equation (19), we obtain 
e "U(t) <0 for 0 <t < A/2. 


Hence U(t) < Ofor0 <t x A/2. However, since A is arbitrary, we conclude that U (t) < 0 for all 
nonnegative f. This result and equation (20) are compatible only if U(t) = 0 for each t > 0. Thus 
U'(t) = 0 and therefore y(t) = (1) for all t > 0. This contradicts the hypothesis that ó and w 
are two different solutions. Consequently, there cannot be two different solutions of the initial value 
problem for t > 0. A slight modification of this argument leads to the same conclusion for t < 0. 


Returning now to the general problem of solving the integral equation (3), let us consider 
briefly each of the questions raised earlier: 


1. Do all members of the sequence {ġ pn} exist? 

In the example, f and Of/Oy were continuous in the whole ty-plane, and each 
member of the sequence could be explicitly calculated. In contrast, in the general case, f 
and Of / O y are assumed to be continuous only in the rectangle R: |t] < a, |y| < b (see 
Figure 2.8.3). Furthermore, the members of the sequence cannot as a rule be explicitly 
determined. The danger is that at some stage, say, for n = k, the graph of y = ó,(t) 
may contain points that lie outside the rectangle R. More precisely, in the computation of 
ox41(t) it would be necessary to evaluate f(t, y) at points where it is not known to be 
continuous or even to exist. Thus the calculation of ,+)(t) might be impossible. 

To avoid this danger, it may be necessary to restrict t to a smaller interval than |t| < a. 
To find such an interval, we make use of the fact that a continuous function on a closed 
bounded region is bounded. Hence f is bounded on R; thus there exists a positive number 
M such that 


If(, y) <M, (t,y)inR. Q3) 
We have mentioned before that 


$54(0) =0 

for each n. Since f(t, $4(t)) is equal to $, 4). the maximum absolute slope of the 
graph of the equation y = $,,,(f) is M. Since this graph contains the point (0, 0), it 
must lie in a bow tie-shaped shaded region as shown in Figure 2.8.4. Hence the point 
(t, Ọr+1(t)) remains in R at least as long as R contains the bow tie-shaped region, which 
is for |t| < b/ M. We hereafter consider only the rectangle D: |t| < h, |y| < b, where h 
is equal either to a or to b/ M, whichever is smaller. With this restriction, all members of 
the sequence {¢,,(t)} exist. Note that whenever b/ M < a, you can try to obtain a larger 
value of h by finding a better (that is, smaller) bound M for | f(t, y)|, if this is possible. 


(-a, —b) (a, -b) 


Region of 
definition for Theorem 2.8.1. 


Bow-tie regions in which successive iterates lie. 
(a) if b/M < a then h = b/ M;(b)ifb/M > athenh =a. 
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2. Does the sequence {@,(t)} converge? 


We can identify @n(t) = $1(1) + (P24) — 610) t ++ + (Palt) — n-1(2)) as 
the n'" partial sum of the series 


$10) Y Pilt) — 6x0). Q4) 


k=1 


The convergence of the sequence {¢,,(t)} is established by showing that the series (24) 
converges. To do this, it is necessary to estimate the magnitude |6;4,(1) — ;(t)| of the 
general term. The argument by which this is done is indicated in Problems 14 through 17 
and will be omitted here. Assuming that the sequence converges, denote the limit function 
by ó, and so 


p(t) = lim ¢,(t). (25) 


3. What are the properties of the limit function ¢ ? 

In the first place, we would like to know that ¢ is continuous. This is not, however, 
a necessary consequence of the convergence of the sequence {¢,(t)}, even though 
each member of the sequence is itself continuous. Sometimes a sequence of continuous 
functions converges to a limit function that is discontinuous. A simple example of this 
phenomenon is given in Problem 11. One way to show that ¢ is continuous is to show 
not only that the sequence {¢,,} converges, but also that it converges in a certain manner, 
known as uniform convergence. We do not take up this matter here, but note only that the 
argument referred to in the discussion of question 2 is sufficient to establish the uniform 
convergence of the sequence {@,,} and, hence, the continuity of the limit function ¢ in the 
interval |t| < h. 

Now let us return to equation (7) 


t 
$541(0) = I f(s, @n(s)) ds. 
0 
Allowing n to approach oo on both sides, we obtain 


p(t) = lim | f(s, bn(s)) ds. (26) 


n—ooJ0 


We would like to interchange the operations of integrating and taking the limit on the 
right-hand side of equation (26) so as to obtain 


E f a Foa Qn 


In general, such an interchange is not permissible (see Problem 12, for example), but once 
again, the fact that the sequence (6,(1)] converges uniformly is sufficient to allow us to 
take the limiting operation inside the integral sign. Next, we wish to take the limit inside 
the function f, which would give 


(t) =f (s lim 2) ds Q8) 
0 n— oo 


and hence 
1 
ét = f fG ocn ds. (29) 
0 
The statement that 


lim f(s, ó4(5)) = rs lim Z2) 


n— oo n—oo 


is equivalent to the statement that f is continuous in its second variable, which is known 
by hypothesis. Hence equation (29) is valid, and the function $ satisfies the integral 
equation (3). Thus y = @(f) is also a solution of the initial value problem (2). 
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4. Are there other solutions of the integral equation (3) besides y = p(t)? 
To show the uniqueness of the solution y = $(t), we can proceed much as in the 
example. First, assume the existence of another solution y = 7(f). It is then possible to 
show (see Problem 18) that the difference (t) — v (t) satisfies the inequality 


ét) - «Ls a f [oy dela: (30) 


for 0 < t < h and a suitable positive number A. From this point the argument is identical 
to that given in the example, and we conclude that there is no solution of the initial value 
problem (2) other than the one generated by the method of successive approximations. 


Problems 


In each of Problems 1 and 2, transform the given initial value problem 
into an equivalent problem with the initial point at the origin. 


1. dy/dt=t?+y?, y()z2 
2. dy/dt=1- y’, y(-1) 23 
In each of Problems 3 through 4, let 99(1) = 0 and define (6,(1)) by 
the method of successive approximations. 
a. Determine 9, (t) for an arbitrary value of n. 
Q b. Plot 9, (t) forn = 1, ... , 4. Observe whether the iterates 
appear to be converging. 
Cc. Express lim ¢,(t) = ġ (t) in terms of elementary 


noo 
functions; that is, solve the given initial value problem. 


O d. Plot |ó(?) — ¢,(t)| for n = 1, ...,4. For each of 
o\(t),...,4(t), estimate the interval in which it is a 
reasonably good approximation to the actual solution. 


O 3. y=%Ay+), x0 =0 
© 4 y'=-y/2+t, x0 =0 
In each of Problems 5 and 6, let g(t) = O and use the method of 
successive approximations to solve the given initial value problem. 
a. Determine $,(f) for an arbitrary value of n. 
Q b. Plot Ọn(t) forn = 1, ... , 4. Observe whether the iterates 
appear to be converging. 
c. Show that the sequence (6, (1) ) converges. 
5. y'=ty+1, y(0)=0 
6. y=fy-t, y(0) =0 
In each of Problems 7 and 8, let g(t) = O and use the method of 
successive approximations to approximate the solution of the given 
initial value problem. 
a. Calculate ;(1), ... , @3(f). 
Q b. Plot Olt), ... ,Q3(t). Observe whether the iterates 
appear to be converging. 
7T. y'= y, yO) =0 
8. y =1-y, y(0) =0 
In each of Problems 9 and 10, let g(t) = 0 and use the method of 
successive approximations to approximate the solution of the given 
initial value problem. 
a. Calculate ¢,(t),...,@4(t), or (if necessary) Taylor 
approximations to these iterates. Keep terms up to order six. 
Q b. Plot the functions you found in part a and observe whether 
they appear to be converging. 


9. y =-—siny+1, y(0)=0 
30? 4t +2 
10. y = — —— —, 0) 20 
j X»-D »(0) 


11. Letó,(x) = x" for0 < x < 1 and show that 
b srs i 


1, xXx. 


lim n(x) em 
noo 
This example shows that a sequence of continuous functions may 
converge to a limit function that is discontinuous. 


12. Consider the sequence 9,(x) = Inre", O<x<l. 
a. Show that lim ¢,(x) =0 for 0 < x < 1; hence 


noo 


1 
n lim ó,(x)dx = 0. 
0 


noo 


1 
b. Show tat f Qnxe™ dx = ] — e”; hence 
0 


1 
im f Ọn(x)dx — 1. 
0 


noo 


Thus, in this example, 


b b 
im f On(x)dx x lim @n(x)dx, 


n— oo a noo 


even though lim ó,(x) exists and is continuous. 
n—oo 


99. 2k 
t 
13. a. Verify that ¢(t) = 1 P" is a solution of the integral 


: k=1 
equation (9). 


b. Verify that ¢ (1) is also a solution of the initial value problem 


(6). 


PO sk 
t ; ; 
C. Use the fact that 5 a e' to evaluate (t) in terms of 
k=0 
elementary functions. 
d. Solve initial value problem (6) as a separable equation. 


e. Solve initial value problem (6) as a first order linear equation. 


In Problems 14 through 17, we indicate how to prove that the sequence 
{@,(t)}, defined by equations (4) through (7), converges. 
90 2k 


PU. ; ; 
r is a solution of the integral 
k=l c 


a. Verify that (ft) = 


equation (9). 
b. Verify that ¢ (1) is also a solution of the initial value problem 


(6). 


O ake 
t i : 
c. Use the fact that 5 m e' to evaluate (f) in terms of 
k=0 
elementary functions. 
d. Solve initial value problem (6) as a separable equation. 


e. Solve initial value problem (6) as a first order linear equation. 


14. If Of/Oy is continuous in the rectangle D, show that there is a 
positive constant K such that 


IFE, y) — fI. y2)| < Klyi — yal, (31) 


where (t, y1) and (f, y2) are any two points in D having the same t 
coordinate. This inequality is known as a Lipschitz?? condition. 
Hint: Hold t fixed and use the mean value theorem on f as a function 
of y only. Choose K to be the maximum value of |Of/ 9 y| in D. 


15. Ifó, j(1) and ¢,,(t) are members of the sequence {¢,,(t) ), use 
the result of Problem 14 to show that 


|f. bn) — F (t, bn-1)| € K [bn — bn. 


16. a. Show that if |t| < h, then 
61€] < Mit], 


where M is chosen so that | f (t, y)| < M for (t, y) in D. 
b. Use the results of Problem 15 and part a of Problem 16 to 
show that 


MK|t? 


ld2o(t) — é61(01 < 2 


c. Show, by mathematical induction, that 


MK""'|t\" MK^-ljgn 
x . 


n! n! 


[PIS xa @n-1(1) | < 


17. Note that 
bnlt) = $1) + (b2(t) — 6100) 4-0 + (bat) — 6410). 


22The German mathematician Rudolf Lipschitz (1832-1903), professor at the 
University of Bonn for many years, worked in several areas of mathematics. 
The inequality (i) can replace the hypothesis that Of/O y is continuous in 
Theorem 2.8.1; this results in a slightly stronger theorem. 
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a. Show that 


lnI € lor. | +1620) — 610) | Hn — 6440). 
b. Use the results of Problem 16 to show that 


M (Kh)? (Kh)" 
eso s K(x 3 Tec m ) 


€. Show that the sum in part b converges as n — oo and, hence, 
the sum in part a also converges as n — oo. Conclude therefore 
that the sequence {@,(t)} converges since it is the sequence of 
partial sums of a convergent infinite series. 


18. Inthis problem we deal with the question of uniqueness of the 
solution of the integral equation (3) 


(t) =) f(s, o(s)) ds. 
0 


a. Suppose that ¢ and v are two solutions of equation (3). Show 
that, for t > 0, 


$) -yt = [ve $(s)) — f Cs, $C5))) ds. 
b. Show that 
lot) — v(G) s nac $(5)) — f(s, YCs))) ds. 
C. Use the result of Problem 14 to show that 
lo(t) -yl «f lọ (s) — v(G)| ds, 


where K is an upper bound for |Of/0y| in D. This is the same 
as equation (30), and the rest of the proof may be constructed as 
indicated in the text. 


29 First-Order Difference Equations 


Although a continuous model leading to a differential equation is reasonable and attractive 
for many problems, there are some cases in which a discrete model may be more natural. 
For instance, the continuous model of compound interest used in Section 2.3 is only an 
approximation to the actual discrete process. Similarly, sometimes population growth may 
be described more accurately by a discrete model than by a continuous model. This is true, for 
example, of species whose generations do not overlap and that propagate at regular intervals, 
such as at particular times of the calendar year. Then the population y„+1 of the species in the 
year n + 1 is some function of n and the population y, in the preceding year; that is, 


Yn+1 = fin, Yn), 


n=0,1,2,... 


(1) 


Equation (1) is called a first-order difference equation. It is first-order because the value 
of y,41 depends on the value of y, but not on earlier values y,.;, y,—2, and so forth. As 
for differential equations, the difference equation (1) is linear if f is a linear function of y,; 
otherwise, it is nonlinear. A solution of the difference equation (1) is a sequence of numbers 


yo: Y1, Y2, 
there may also be an initial condition 


yo = a 


that prescribes the value of the first term of the solution sequence. 


... that satisfy the equation for each n. In addition to the difference equation itself, 


(2) 


We now assume temporarily that the function f in equation (1) depends only on y,, but 


not on n. In this case 


Yn+1 = fy). 


n=0,1,2,.... 


(3) 
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If yo is given, then successive terms of the solution can be found from equation (3). Thus 


yı = f yo), 


and 


y» = f(yD = fCfCGyo). 


The quantity f( f ( yo)) is called the second iterate of the difference equation and is sometimes 
denoted by /?( y). Similarly, the third iterate y; is given by 


ys = f(y) = f(f(fo9)) = POO, 
and so on. In general, the n iterate Yn iS 
Ya = f Yn—1) = f"(yo). 


This procedure is referred to as iterating the difference equation. It is often of primary interest 
to determine the behavior of y, as n — oo. In particular, does y, approach a limit, and if so, 
what is it? 

Solutions for which y, has the same value for all n are called equilibrium solutions. 
They are frequently of special importance, just as in the study of differential equations. If 
equilibrium solutions exist, you can find them by setting y„+ı equal to y, in equation (3) and 
solving the resulting equation 


Yn = fou) (4) 
for yn. 


Linear Equations. Suppose that the population of a certain species in a given region in year 
n + 1, denoted by y,41, is a positive multiple p; of the population y, in year n; that is, 


Yn+1 = Pnyn> n= 0, l; 2, "T (5) 
Note that the reproduction rate p„ may differ from year to year. The difference equation (5) is 
linear and can easily be solved by iteration. We obtain 
yı = Poyo» 
y2 = P1y1 = piPoYo 


and, in general, 


Yn = Pn-1 tt Poyo n—L2,.... (6) 


Thus, if the initial population yo is given, then the population of each succeeding generation 
is determined by equation (6). Although for a population problem p, is intrinsically positive, 
the solution (6) is also valid if p, is negative for some or all values of n. Note, however, that 
if pn is zero for some n, then y,,; and all succeeding values of y are zero; in other words, the 
species has become extinct. 

If the reproduction rate p, has the same value p for each n, then the difference equation 
(5) becomes 


Yn+1 = PYn (7) 


and its solution is 


Yn = p"Yo- (8) 
Equation (7) also has an equilibrium solution, namely, y, = O for all n, corresponding to the 


initial value yy — 0. The limiting behavior of y, is easy to determine from equation (8). In 
fact, 
0, if |p| < 1; 
lim Yn = 4 Yo if p =1; (9) 


n—oo . " 
does not exist, otherwise. 


In other words, the equilibrium solution y, = 0 is asymptotically stable for |p| < 1 and 
unstable for |p| > 1. 


Now we will modify the population model represented by equation (5) to include the 
effect of immigration or emigration. If b, is the net increase in population in year n due to 
immigration, then the population in year n + 1 is the sum of the part of the population resulting 
from natural reproduction and the part due to immigration. Thus 


Yn+1 = PYn + b, n=O0,1,2,..., (10) 
where we are now assuming that the reproduction rate p is constant. We can solve equation (10) 
by iteration in the same manner as before. We have 
yi = pyo + bo, 
y2 = p(pyo + bo) ^ bi = p?^yo + pbo + bi, 
ys = po^ yo + pbo + bi) +b = p? yo + p?bo + phi + bz, 


and so forth. In general, we obtain 
n-l 
Yn = pP” yo + p" bo +--+ + pbn- tbn-1 = p"yo >, p" bj. (11) 
j=0 
Note that the first term on the right-hand side of equation (11) represents the descendants of 
the original population, while the other terms represent the population in year n resulting from 


immigration in all preceding years. 
In the special case where b, = b ¥ 0 for all n, the difference equation is 


Yn41 = PYn + b, (12) 
and from equation (11) its solution is 
Ya = p"yot (lt p p? +--+ p™ yb. (13) 
If p # 1, we can write this solution in the more compact form 
» 1 m p" 
Yn = p yo b, (14) 
1—p 


where again the two terms on the right-hand side are the effects of the original population and 
of immigration, respectively. Rewriting equation (14) as 


A b b 
Yn — p | Yo + (15) 
1—p 1—p 


makes the long-time behavior of y, more evident. It follows from equation (15) that 
Yn > b/(1—p) if |p| < 1.If|p| > lorif p = —1 then y, has no limit unless yo = b/(1—p). 
The quantity b/(1— p), for p ¥ 1, is an equilibrium solution of equation (12), as can readily 
be seen directly from that equation. Of course, equation (14) is not valid for p = 1. To deal 
with that case, we must return to equation (13) and let p — 1 there. It follows that 


Yn = yo + nb, (16) 


so in this case y, becomes unbounded as n — oo. 

The same model also provides a framework for solving many problems of a financial 
character. For such problems, y, is the account balance in the nth time period, p, = 1 + rn, 
where r, is the interest rate for that period, and b, is the amount deposited or withdrawn. The 
following example is typical. 


EXAMPLE 1 


A recent college graduate takes out a $10,000 loan to purchase a car. If the interest rate is 12%, what 
monthly payment is required to pay off the loan in 4 years? 


Solution: 


The relevant difference equation is equation (12), where y, is the loan balance outstanding in the n™ 
month, p = 1 +r, where r is the interest rate per month, and b is the effect of the monthly payment. 
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Note that p = 1.01, corresponding to a monthly interest rate of 1%. Since payments reduce the loan 
balance, b must be negative; the actual payment is |b|. 

The solution of the difference equation (12) with this value for p and the initial condition 
yo — 10,000 is given by equation (15); that is, 


Yn = ( 1.01)” (10,000 + 1005) — 100b. (17) 
The value of b needed to pay off the loan in 4 years is found by setting y4s = 0 and solving for b. 
This gives 
1.01)*8 
100 ( ) — —263.34. (18) 
(1.01)48 — 1 


The total amount paid on the loan is 48 times |b|, or $12,640.32. Of this amount, $10,000 is repayment 
of the principal and the remaining $2640.32 is interest. 


NonlinearEquations. Nonlinear difference equations are much more complicated and have 
much more varied solutions than linear equations. We will restrict our attention to a single 
equation, the logistic difference equation 


Yn 
Yn+1 = Yn (1 me =), (19) 
k 
which is analogous to the logistic differential equation 
dy y 
= 1— =) 20 
= =ry(l- 5 (20) 


that was discussed in Section 2.5. Note that if the derivative dy / dt in equation (20) is replaced 
by the difference quotient (y,4; — Yn) / h, then equation (20) reduces to equation (19) with 
p — l- hr and k = (1+ hr) K / hr. To simplify equation (19) a little more, we can scale the 
variable y, by introducing the new variable un = y,/ k. Then equation (19) becomes 


Unt = pus(l1 — us), (21) 


where p is a positive parameter. 

We begin our investigation of equation (21) by seeking the equilibrium, or constant, 
solutions. These can be found by setting u,, , equal to un in equation (21), which corresponds 
to setting dy /dt equal to zero in equation (20). The resulting equation is 


Un = pug — pur, (22) 
so it follows that the equilibrium solutions of equation (21) are 


U,=0, Un = pe Q3) 
p 

The next question is whether the equilibrium solutions are asymptotically stable or 
unstable. That is, for an initial condition near one of the equilibrium solutions, does the 
resulting solution sequence approach or depart from the equilibrium solution? 

One way to examine this question is by approximating equation (21) by a linear equation 
in the neighborhood of an equilibrium solution. For example, near the equilibrium solution 
Un = 0, the quantity Un is small compared to u, itself, so we assume that we can neglect the 
quadratic term in equation (21) in comparison with the linear terms. This leaves us with the 
linear difference equation 


Un+1 = pUn, (24) 


which is presumably a good approximation to equation (21) for un sufficiently near zero. 
However, equation (24) is the same as equation (7), and we have already concluded, in 
equation (9), that un — Oasn — oo if and only if |p| < 1, or (since p must be positive) 
forO < p < 1. Thus the equilibrium solution u, = 0 is asymptotically stable for the linear 
approximation (24) for this set of p values, so we conclude that itis also asymptotically stable 
for the full nonlinear equation (21). 

The previous conclusion is correct, although our argument is not complete. What is 
lacking is a theorem stating that the solutions of the nonlinear equation (21) resemble those of 


the linear equation (24) near the equilibrium solution u,, = 0. We will not take time to discuss 
this issue here; the same question is treated for differential equations in Section 9.3. 
Now consider the other equilibrium solution u, = (p — 1) /p. To study solutions in the 
neighborhood of this point, we write 
-1 
un = 5 + vas 25) 
p 
where we assume that v, is small. By substituting from equation (25) in equation (21) and 
simplifying the resulting equation, we eventually obtain 


Vn+1 = (2 — p)Vn — pv;. (26) 


Since v, is small, we again neglect the quadratic term in comparison with the linear terms and 
thereby obtain the linear equation 


Vat = (2 — p) vn. (27) 


Referring to equation (9) once more, we find that v, — 0 as n — oo for |2 — p| < 1, orin 
other words for 1 « p < 3. Therefore, we conclude that for this range of values of p, the 
equilibrium solution uv, = (p — 1)/p is asymptotically stable. 

Figure 2.9.1 contains the graphs of solutions of equation (21) for p — 0.8, p — 1.5, 
and p — 2.8, respectively. Observe that the solution converges to zero for p — 0.8 and to 
the nonzero equilibrium solution for p — 1.5 and p — 2.8. The convergence is (eventually) 
monotone for p — 0.8 and for p — 1.5 and is oscillatory for p — 2.8. The graphs shown are 
for particular initial conditions, but the graphs for other initial conditions are similar. 


Un 


0.8 
0.6 
0.4 


0.2 


| FIGURE 2.9.1 | Solutions of u, 4, = pus(1 — un): (a) p = 0.8; (b) p = 1.5; (c) p = 2.8. 


Another way of displaying the solution of a difference equation is shown in Figure 2.9.2. 
In each part of this figure, the graphs of the parabola y — px(1 — x) and of the straight line 
y — x are shown. The equilibrium solutions correspond to the points of intersection of these 
two curves. The piecewise linear graph consisting of successive vertical and horizontal line 
segments, sometimes called a stairstep or cobweb diagram, represents the solution sequence. 
The sequence starts at the point uo on the x-axis. The vertical line segment drawn upward 
to the parabola at uo corresponds to the calculation of puo(1 — ug) = u. This value is then 
transferred from the y-axis to the x-axis; this step is represented by the horizontal line segment 
from the parabola to the line y — x. Then the process is repeated over and over again. Clearly, 
the sequence converges to the origin in Figure 2.9.2a and to the nonzero equilibrium solution 
in the other two cases. 

To summarize our results so far: the difference equation (21) has two equilibrium 
solutions, 4, = 0 and u, = (p — 1)/p; the former is asymptotically stable for 0 < p < 1, 
and the latter is asymptotically stable for 1 « o « 3. When p — 1, the two equilibrium 
solutions coincide at u = 0; this solution can be shown to be asymptotically stable. In 
Figure 2.9.3 the parameter p is plotted on the horizontal axis and u on the vertical axis. The 
equilibrium solutions u — 0 and u — (p — 1)/p are shown. The intervals in which each one 
is asymptotically stable are indicated by the solid portions of the curves. There is an exchange 
of stability from one equilibrium solution to the other at p — 1. 
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0.2 0.4 0.6 0.8 lx 


-— 
-— 
-—— 


Asymptotically stable 


Unstable 


| FIGURE 2.9.3 | Exchange of stability for u, 4 = pu,(1—uy,). 


For p > 3, neither ofthe equilibrium solutions is stable, and the solutions of equation (21) 
exhibit increasing complexity as p increases. For p somewhat greater than 3, the sequence un 
rapidly approaches a steady oscillation of period 2; that is, un oscillates back and forth between 
two distinct values. For p — 3.2, a solution is shown in Figure 2.9.4. For n greater than about 


0.5130 0.7995 


0.2 0.4 0.6 0.8 
(b) 


A solution of u,4; = pun(l — un) for p = 3.2; period 2. 
(a) un versus n; (b) the cobweb diagram shows the iterates are in a two-cycle. 


20, the solution alternates between the values 0.5130 and 0.7995. The graph is drawn for the 
particular initial condition uo = 0.3, but itis similar for all other initial values between 0 and 1. 
Figure 2.9.4b also shows the same steady oscillation as a rectangular path that is traversed 
repeatedly in the clockwise direction. 

At about p = 3.449, each state in the oscillation of period 2 separates into two distinct 
states, and the solution becomes periodic with period 4; see Figure 2.9.5, which shows a 
solution of period 4 for p = 3.5. As p increases further, periodic solutions of period 8, 
16, ... appear. The transition from solutions with one period to solutions with a new period 
that occurs at a certain parameter value is called a bifurcation; the value of the parameter 
where the bifurcation occurs is called a bifurcation value of the parameter. 

The values of p at which the successive period doublings occur approach a limit that is 
approximately 3.57. For p > 3.57, the solutions possess some regularity but no discernible 
detailed pattern for most values of p. For example, a solution for p — 3.65 is shown in Figure 
2.9.6. It oscillates between approximately 0.3 and 0.9, but its fine structure is unpredictable. 
The term chaotic is used to describe this situation. One of the features of chaotic solutions 
is extreme sensitivity to the initial conditions. This is illustrated in Figure 2.9.7, where 
two solutions of equation (21) for p — 3.65 are shown. One solution is the same as that in 
Figure 2.9.6 and has the initial value uo — 0.3, while the other solution has the initial value 
ug = 0.305. For about 15 iterations the two solutions remain close and are hard to distinguish 
from each other in the figure. After that, although they continue to wander about in 
approximately the same set of values, their graphs are quite dissimilar. It would certainly not 
be possible to use one of these solutions to estimate the value of the other for values of n larger 
than about 15. 
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It is only comparatively recently that chaotic solutions of difference and differential 
equations have become widely known. Equation (20) was one of the first instances of 
mathematical chaos to be found and studied in detail, by Robert May”? in 1974. On the basis 


3Robert M. May (1936-) was born in Sydney, Australia, and received his education at the University of Sydney with 
a doctorate in theoretical physics in 1959. His interests soon turned to population dynamics and theoretical ecology; 
the work cited in the text is described in two papers listed in the References at the end of this chapter. He has held 
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(b) 


A solution of u,44 = pus(1 — un) for 


p = 3.5; period 4. (a) u, versus n; (b) the cobweb diagram shows 
the iterates are in a four-cycle. 


Un A 
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A solution of u,,, = pu,(1 — un) for 


p = 3.65; a chaotic solution. 


professorships at Sydney, at Princeton, at Imperial College (London), and (since 1988) at Oxford. 
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Problems 99 


60n 


Stet) 13 PAAA Two solutions of un+1 = pu,(1 — un) for 


p = 3.65; ug = 0.3 and up = 0.305. 


of his analysis of this equation as a model of the population of certain insect species, May 
suggested that if the growth rate p is too large, then it will be impossible to make effective 
long-range predictions about these insect populations. The occurrence of chaotic solutions 
in seemingly simple problems has stimulated an enormous amount of research, but many 
questions remain unanswered. It is increasingly clear, however, that chaotic solutions are much 
more common than was suspected at first and that they may be a part of the investigation of a 


wide range of phenomena. 


Problems 


In each of Problems 1 through 4, solve the given difference equation 
in terms of the initial value yọ. Describe the behavior of the solution 


as n — oo. 
1. Yat = —0.9y, 
[n 3 
2. Yn4l = rea 
3. Yn+1 = (—1)” yn 
4. yu 05t 


5. An investor deposits $1000 in an account paying interest at a 
rate of 8%, compounded monthly, and also makes additional deposits 
of $25 per month. Find the balance in the account after 3 years. 


6. A certain college graduate borrows $8000 to buy a car. The 
lender charges interest at an annual rate of 10%. What monthly 
payment rate is required to pay off the loan in 3 years? Compare your 
result with that of Problem 7 in Section 2.3. 


7. A homebuyer takes out a mortgage of $100,000 with an interest 
rate of 9%. What monthly payment is required to pay off the loan in 
30 years? In 20 years? What is the total amount paid during the term 
of the loan in each of these cases? 


8. Ifthe interest rate on a 20-year mortgage is fixed at 1076 and 
if a monthly payment of $1000 is the maximum that the buyer can 
afford, what is the maximum mortgage loan that can be made under 
these conditions? 


9. A homebuyer wishes to finance the purchase with a $95,000 
mortgage with a 20-year term. What is the maximum interest rate the 
buyer can afford if the monthly payment is not to exceed $900? 


The Logistic Difference Equation. Problems 10 through 15 deal with 
the difference equation (21), un} = pus(1— un). 


10. Carry out the details in the linear stability analysis of the 
equilibrium solution u, — (p — 1)/p. That is, derive the difference 
equation (26) in the text for the perturbation vp. 


11. Qa. For p = 3.2, plot or calculate the solution of the logistic 
equation (21) for several initial conditions, say, uy = 0.2, 0.4, 
0.6, and 0.8. Observe that in each case the solution approaches a 
steady oscillation between the same two values. This illustrates 
that the long-term behavior of the solution is independent of the 
initial conditions. 
@ b. Make similar calculations and verify that the nature of the 
solution for large n is independent of the initial condition for 
other values of p, such as 2.6, 2.8, and 3.4. 


12. Assume that p > 1 in equation (21). 


Q a. Draw a qualitatively correct stairstep diagram and thereby 
show that if ug < 0, then u, — —coasn — oo. 

Q b. In a similar way, determine what happens as n. — oo if 
up l1 
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13. The solutions of equation (21) change from convergent 
sequences to periodic oscillations of period 2 as the parameter p passes 
through the value 3. To see more clearly how this happens, carry out 
the following calculations. 
@ a. Plot or calculate the solution for p = 2.9, 2.95, and 
2.99, respectively, using an initial value u of your choice in the 
interval (0, 1). In each case estimate how many iterations are 
required for the solution to get “very close" to the limiting value. 
Use any convenient interpretation of what “very close" means in 
the preceding sentence. 
(9 b. Plot or calculate the solution for p = 3.01, 3.05, and 
3.1, respectively, using the same initial condition as in part a. 
In each case estimate how many iterations are needed to reach a 
steady-state oscillation. Also find or estimate the two values in 
the steady-state oscillation. 


@ 14. By calculating or plotting the solution of equation (21) for 
different values of p, estimate the value of p at which the solution 
changes from an oscillation of period 2 to one of period 4. In the same 
way, estimate the value of p at which the solution changes from period 
4 to period 8. 


@ 15. Let p, be the value of p at which the solution of 
equation (21) changes from period 2*-! to period 2*. Thus, as noted 
in the text, o; = 3, p2 = 3.449, and p3 = 3.544. 
a. Using these values of p1, p2, and p3, or those you found in 
Problem 14, calculate (p2 — p1)/(pa — p2). 


Chapter Review Problems 


Miscellaneous Problems. One of the difficulties in solving first- 
order differential equations is that there are several methods of 
solution, each of which can be used on a certain type of equation. 
It may take some time to become proficient in matching solution 
methods with equations. The first 24 of the following problems are 
presented to give you some practice in identifying the method or 
methods applicable to a given equation. The remaining problems 
involve certain types of equations that can be solved by specialized 
methods. 


Ineach of Problems 1 through 24, solve the given differential equation. 
If an initial condition is given, also find the solution that satisfies it. 


1. dy x!-2y 
dx x 

2 dy _ 1+cosx 

“dx 2—siny 
dy 2x +y 

3. ————— 0) =0 
d 34-a 78 
d 

à 24 6x + y — 2xy 
dx 

god. 2xy - y?1 

` dx x? +2xy 
d 

6. x2 +xy=1-y, y(1) =0 
dx 
d , 

7. x my P y= 
dx x 


dy | 2xycl 
dx — x? + 2y 


d 
9, (x2y Txy—-y)-ct (x2y — 267 =0 


b. Let 6, = (Pn — Pn—1)/(Pn41 — Pn). It can be shown that 6, 
approaches a limit 6 as n — oo, where 6 = 4.6692 is known as 
the Feigenbaum* number. Determine the percentage difference 
between the limiting value 6 and 65, as calculated in part a. 

C. Assume that 63; = 6 and use this relation to estimate p4, the 
value of p at which solutions of period 16 appear. 

O d. By plotting or calculating solutions near the value of p4 
found in part c, try to detect the appearance of a period 16 
solution. 

e. Observe that 


Pn = P1 Coa — pi) + Cpa — pa) t t Con — Pn-1)- 
Assuming that 


D4— P3=(p3—p2)6', ps—pa-(ps— p36 7, 


and so forth, express p, as a geometric sum. Then find the limit 
Pn as n — co. This is an estimate of the value of p at which the 
onset of chaos occurs in the solution of the logistic equation (21). 


This result for the logistic difference equation was discovered in August 
1975 by Mitchell Feigenbaum (1944—-), while he was working at the Los 
Alamos National Laboratory. Within a few weeks he had established that the 
same limiting value also appears in a large class of period-doubling difference 
equations. Feigenbaum, who has a doctorate in physics from M.LT., is now at 
Rockefeller University. 


d 
10. (x2+y) race) =0 
dx 


d 
11. (x+y) + (x + 2y) > =0, 2-3 
d 
12. (e*4+1) aa y — ye 
dx 
B dy | e *cosy— e” cos x 
“dx /— —e™ sin y + 2e2” sinx 
d 
14. 2 =e% 43y 
dx 
d 
15. Z 42y =e-*, y(0) =3 
dx 
dy 3x? —2y-— y? 
16. Y x gem 
dx 2x + 3xy? 
17. ye 
18 dy | 2y?+6xy-4 _ 
"o dx 3x2 4xy + 3y? 
dy 
19. t= 4+(t+Dy = e” 
di +(t+1)y=e 
20. xy =y4+xe/* 
d 
21. i. NM E Hint: Let u — x?. 
dx x?y + y? 
d 
pm, .rtty 
dx x-y 


d 
23. (3y? + 2xy) — (2xy Ban =0 
A. 


24. xy’+y—yre*=0, y(1)22 


25. Riccati Equations. The equation 


dy 2 

q^ GM Pe UD qub 

is known as a Riccati?? equation. Suppose that some particular solution 
y, of this equation is known. A more general solution containing one 
arbitrary constant can be obtained through the substitution 


yy + — : 
Show that v(t) satisfies the first-order linear equation 
dv 
2 —(q2 + 2q3y) v — qa. 


Note that v(t) will contain a single arbitrary constant. 


26. Verify thatthe given function is a particular solution of the given 
Riccati equation. Then use the method of Problem 25 to solve the 
following Riccati equations: 


a. y —1-412—2ty +y; y(t) «t 
1 y 1 
b. y =--> 2. t)-2- 
y zur HY yl) - i 
d 2 t — sin t +y? 
c. edid SEIEN >; y(t) —sint 
dt 2cost 


27. The propagation of a single action in a large population (for 
example, drivers turning on headlights at sunset) often depends 
partly on external circumstances (gathering darkness) and partly on 
a tendency to imitate others who have already performed the action 
in question. In this case the proportion y(t) of people who have 
performed the action can be described?Ó by the equation 


dy/dt = (1 — y)(x(t) + by), (28) 


where x(t) measures the external stimulus and b is the imitation 
coefficient. 
a. Observe that equation (28) is a Riccati equation and that 
yı(t) = 1 is one solution. Use the transformation suggested in 
Problem 25, and find the linear equation satisfied by v(t). 
b. Find v(t) in the case that x(t) = at, where a is a constant. 
Leave your answer in the form of an integral. 


5Riccati equations are named for Jacopo Francesco Riccati (1676-1754), a 
Venetian nobleman, who declined university appointments in Italy, Austria, 
and Russia to pursue his mathematical studies privately at home. Riccati 
studied these equations extensively; however, it was Euler (in 1760) who 
discovered the result stated in this problem. 

?6See Anatol Rapoport, “Contribution to the Mathematical Theory of Mass 
Behavior: I. The Propagation of Single Acts,” Bulletin of Mathematical 
Biophysics 14 (1952), pp. 159-169, and John Z. Hearon, “Note on the Theory 
of Mass Behavior,” Bulletin of Mathematical Biophysics 17 (1955), pp. 7-13. 
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Some Special Second-Order Differential Equations. Second-order 
differential equations involve the second derivative of the unknown 
function and have the general form y" = f(t, y, y'). Usually, 
such equations cannot be solved by methods designed for first-order 
equations. However, there are two types of second-order equations 
that can be transformed into first-order equations by a suitable change 
of variable. The resulting equation can sometimes be solved by the 
methods presented in this chapter. Problems 28 through 37 deal with 
these types of equations. 


Equations with the Dependent Variable Missing. For a second- 
order differential equation of the form y" = f(t, y’), the substitution 
v = y’, v' = y" leads to a first-order differential equation of the 
form v’ = f(t, v). If this equation can be solved for v, then y can be 
obtained by integrating dy /dt = v. Note that one arbitrary constant 
is obtained in solving the first-order equation for v, and a second is 
introduced in the integration for y. In each of Problems 28 through 
3], use this substitution to solve the given equation. 


28. ?y'-2ty -120, t>0 
29. ty"+y'=1, t>0 

30. y’ +t(y’)? =0 

31. 2y" c (y)? =2ty’, t»0 


Equations with the Independent Variable Missing. Consider 
second-order differential equations of the form y”= f(y, y’), in 
which the independent variable t does not appear explicitly. If 
we let v 2 y', then we obtain dv/dt — f(y, v). Since the right- 
hand side of this equation depends on y and v, rather than on 
t and v, this equation contains too many variables. However, if 
we think of y as the independent variable, then by the chain 
rule, dv/dt =(dv/dy)(dy/dt) =v(dv/dy). Hence the original 
differential equation can be written as v(dv/dy) = f(y, v). Provided 
that this first-order equation can be solved, we obtain v as a function 
of y. A relation between y and f results from solving dy/dt — v( y), 
which is a separable equation. Again, there are two arbitrary constants 
in the final result. In each of Problems 32 through 35, use this method 
to solve the given differential equation. 


32. yy" +y) =0 

33. y”"+y=0 

34. yy" —(y')? =0 

35. y” +y)? = 2e 

Hint: In Problem 35 the transformed equation is a Bernoulli equation. 
See Problem 23 in Section 2.4. 

In each of Problems 36 through 37, solve the given initial value 
problem using the methods of Problems 28 through 35. 

36. yy”=2, y(0)=1, y(0 =2 

37. (124 15y" -2ty' +312 20, y(1)=2, wd) 2-1 
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CHAPTER 3 


Second-Order Linear 
Differential Equations 


Linear differential equations of second order are of crucial importance in the study of 
differential equations for two main reasons. The first is that linear equations have a rich 
theoretical structure that underlies a number of systematic methods of solution. Further, 
a substantial portion of this structure and of these methods is understandable at a fairly 
elementary mathematical level. In order to present the key ideas in the simplest possible 
context, we describe them in this chapter for second-order equations. The second reason 
to study second-order linear differential equations is that they are vital to any serious 
investigation of the classical areas of mathematical physics. One cannot go very far in 
the development of fluid mechanics, heat conduction, wave motion, or electromagnetic 
phenomena without finding it necessary to solve second-order linear differential equations. 
We illustrate this at the end of this chapter with a discussion of the oscillations of some basic 
mechanical and electrical systems. 


3.1 Homogeneous Differential Equations 
with Constant Coefficients 


Many second-order ordinary differential equations have the form 
d^y dy 
Uus 7 t, 27, |}? 1 
dü f ( » (1) 


where f is some given function. Usually, we will denote the independent variable by ¢ since 
time is often the independent variable in physical problems, but sometimes we will use x 
instead. We will use y, or occasionally some other letter, to designate the dependent variable. 
Equation (1) is said to be linear if the function f has the form 


(: 2) = Bp C. sat) 2 
f D r =g( PO 3, 710 (2) 


that is, if f is linear in y and dy/dt. In equation (2) g, p, and q are specified functions of the 
independent variable t but do not depend on y. In this case we usually rewrite equation (1) as 


y" + p(t)y’+q(t)y = g(t), (3) 


where the primes denote differentiation with respect to t. Instead of equation (3), we sometimes 
see the equation 


P(t) y" + Q()y' + R(Q)y = G(t). (4) 
Of course, if P(t) + 0, we can divide equation (4) by P(t) and thereby obtain equation (3) 
with 
| a) 
~ P(t)’ 


| R(t) 
~ P(t)’ 


| Git) 


p(t) = PO ` 


q(t) g(t) (5) 
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In discussing equation (3) and in trying to solve it, we will restrict ourselves to intervals in 
which p, q, and g are continuous functions.! 

If equation (1) is not of the form (3) or (4), then it is called nonlinear. Analytical 
investigations of nonlinear equations are relatively difficult, so we will have little to say about 
them in this book. Numerical or geometical approaches are often more appropriate, and these 
are discussed in Chapters 8 and 9. 

An initial value problem consists of a differential equation such as equations (1), (3), or 
(4) together with a pair of initial conditions 


y(t) = yo, y'(to) = yo (6) 


where yo and y, are given numbers prescribing values for y and y’ at the initial point fo. 
Observe that the initial conditions for a second-order differential equation identify not only a 
particular point (to, yo) through which the graph of the solution must pass, but also the slope 
yo of the graph at that point. It is reasonable to expect that two initial conditions are needed for 
a second-order differential equation because, roughly speaking, two integrations are required 
to find a solution and each integration introduces an arbitrary constant. Presumably, two initial 
conditions will suffice to determine values for these two constants. 

A second-order linear differential equation is said to be homogeneous if the term g(t) in 
equation (3), or the term G(t) in equation (4), is zero for all t. Otherwise, the equation is called 
nonhomogeneous. Alternatively, the nonhomogeneous term g(t), or G(t), is sometimes 
called the forcing function because in many applications it describes an externally applied 
force. We begin our discussion with homogeneous equations, which we will write in the form 


P(t)y” + Q(t)y' + R(t)y =0. (7) 


Later, in Sections 3.5 and 3.6, we will show that once the homogeneous equation has been 
solved, it is always possible to solve the corresponding nonhomogeneous equation (4), or 
at least to express the solution in terms of an integral. Thus the problem of solving the 
homogeneous equation is the more fundamental one. 

In this chapter we will concentrate our attention on equations in which the functions P, 
Q, and R are constants. In this case, equation (7) becomes 


ay" +by'+cy =0, (8) 


where a, b, and c are given constants. It turns out that equation (8) can always be solved easily 
in terms of the elementary functions of calculus. On the other hand, it is usually much more 
difficult to solve equation (7) if the coefficients are not constants, and a treatment of that case 
is deferred until Chapter 5. Before taking up equation (8), let us first gain some experience by 
looking at a simple example that in many ways is typical. 


EXAMPLE 1 


Solve the equation 


y —y=0. (9) 
Also find the solution that satisfies the initial conditions 


y(0 2, y'(0)=-l1. (10) 


Solution: 


Observe that equation (9) is just equation (8) with a = 1, b = 0, and c = —1. In words, 
equation (9) says that we seek a function with the property that the second derivative of the function 
is the same as the function itself. Do any of the functions that you studied in calculus have this 
property? A little thought will probably produce at least one such function, namely, y(t) = e’, the 
exponential function. A little more thought may also produce a second function, y(t) = e^'. Some 
further experimentation reveals that constant multiples of these two solutions are also solutions. 


l There is a corresponding treatment of higher-order linear equations in Chapter 4. If you wish, you may read the 
appropriate parts of Chapter 4 in parallel with Chapter 3. 


3.1 Homogeneous Differential Equations with Constant Coefficients 


For example, the functions 2e! and 5e~ also satisfy equation (9), as you can verify by calculating 
their second derivatives. In the same way, the functions c; y;(t) = c;e' and cy y2(t) = ce satisfy 
the differential equation (9) for all values of the constants c, and c2. 

Next, it is vital to notice that the sum of any two solutions of equation (9) is also a solution. 
In particular, since c; yj(t) and cz y(t) are solutions of equation (9) for any values of c, and c», so 
is the function 

Y = oayi(t) + c2y2(t) = ciel +e". (11) 
Again, this can be verified by calculating the second derivative y” from equation (11). We have 
y'= qe — coe! and y" = cje! + cpe~'; thus y" is the same as y, and equation (9) is satisfied. 

Let us summarize what we have done so far in this example. Once we notice that the 
functions y|(t) = e' and y;(t) = e™ are solutions of equation (9), it follows that the general 
linear combination (11) of these functions is also a solution. Since the coefficients c, and c; in 
equation (11) are arbitrary, this expression represents an infinite two-parameter family of solutions 
of the differential equation (9). 

We now turn to the task of picking out a particular member of this infinite family of solutions 
that also satisfies the given pair of initial conditions (10). In other words, we seek the solution that 
passes through the point (0, 2) and at that point has the slope — 1. First, to ensure the solution passes 
through the point (0, 2), we set t = 0 and y = 2 in equation (11); this gives the equation 


cy tc. —2. (12) 
Next, we differentiate equation (11) with the result that 
y =e — caet. (13) 
Then, to enforce the condition that the slope at (0, 2) is — 1, we set t = 0 and y’ = —1 in equation 
(13); this yields the equation 
cj — c? — l. (14) 
By solving equations (12) and (14) simultaneously for c, and c», we find that 
1 3 
= y. 2S7 (45) 
Finally, inserting these values in equation (11), we obtain 
1 3 
ya ge se, (16) 


the solution of the initial value problem consisting of the differential equation (9) and the initial 
conditions (10). 


What conclusions can we draw from the preceding example that will help us to deal with 

the more general equation (8), 
ay" + by! - cy =0, 

whose coefficients a, b, and c are arbitrary (real) constants? In the first place, in the example 
the solutions were exponential functions. Further, once we had identified two solutions, we 
were able to use a linear combination of them to satisfy the given initial conditions as well as 
the differential equation itself. 

It turns out that by exploiting these two ideas, we can solve equation (8) for any values of 
its coefficients and also satisfy any given set of initial conditions for y and y’. 

We start by seeking exponential solutions of the form y = e", where r is a parameter to be 
determined. Then it follows that y’ = re” and y" = r?e"'. By substituting these expressions 
for y, y’, and y" in equation (8), we obtain 


(ar? + br +c)e™ — 0. 
Since e" + 0, this condition is satisfied only when the other factor is zero: 
ar? +br+c=0. (17) 


Equation (17) is called the characteristic equation for the differential equation (8). Its 
significance lies in the fact that if r is a root of the polynomial equation (17), then y = e” 
is a solution of the differential equation (8). Since equation (17) is a quadratic equation with 
real coefficients, it has two roots, which may be real and different, complex conjugates, or real 
but repeated. We consider the first case here and the latter two cases in Sections 3.3 and 3.4, 
respectively. 
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Assuming that the roots of the characteristic equation (17) are real and different, let them 
be denoted by rı and rz, where rı Æ r2. Then y;(t) = e" and y(t) = e” are two solutions 
of equation (8). Just as in Example 1, it now follows that 


y = cyi(t) + eoyx(f) = cie! + coe! (18) 


is also a solution of equation (8). To verify that this is so, we can differentiate the expression 
in equation (18); hence 


y 2 qre + corre” (19) 
and 
y 2n 
y" = cre" + core". (20) 


Substituting these expressions for y, y’, and y” in equation (8) and rearranging terms, we obtain 
ay" + by’ +cy=c, (ar? + bry + c) e"! ^F cs (ar; + bra + c) et, (21) 


The fact that r; is a root of equation (17) means that ar; + br, + c = 0. Since rp is also a root 


of the characteristic equation (17), it follows that ar; + br; + c = Q as well. This completes 
the verification that y as given by equation (18) is indeed a solution of equation (8). 

Now suppose that we want to find the particular member of the family of solutions (18) 
that satisfies the initial conditions (6) 


y(t.) = yo, y'(to) = yo 
By substituting t = tọ and y = yo in equation (18), we obtain 
cien” + coe? = yo. (22) 
Similarly, setting t = tọ and y’ = y, in equation (19) gives 
carye” + corre” = yo. (23) 
On solving equations (22) and (23) simultaneously for cı and c2, we find that 


/ 
fi 
yori Y0 i-r. 


/ 
— yor 
Yo — Yo 2 oor, 


ry — n P — FP 


C; = (24) 
Since the roots of the characteristic equation (17) are assumed to be different, rı —r? 4 0 so that 
the expressions in equation (24) always make sense. Thus, no matter what initial conditions 
are assigned— that is, regardless of the values of tọ, yo, and y, in equations (6)— it is always 
possible to determine c, and c» so that the initial conditions are satisfied. Moreover, there is 
only one possible choice of c, and c» for each set of initial conditions. With the values of c; 
and c» given by equation (24), the expression (18) is the solution of the initial value problem 


ay" +by’+cy=0, y(to = yo, y'(to) = yo. (25) 


It is possible to show, on the basis of the fundamental theorem cited in the next section, 
that all solutions of equation (8) are included in the expression (18), at least for the case in 
which the roots of equation (17) are real and different. Therefore, we call equation (18) the 
general solution of equation (8). The fact that any possible initial conditions can be satisfied 
by the proper choice of the constants in equation (18) makes more plausible the idea that this 
expression does include all solutions of equation (8). 

Let us now look at some further examples. 


EXAMPLE 2 


Find the general solution of 


y" + 5y' + 6y = 0. (26) 


3.1 Homogeneous Differential Equations with Constant Coefficients 


Solution: 


We assume that y = e”, and it then follows that r must be a root of the characteristic equation 
r?+5r+6=(r+2)(r+3) =0. 


Thus the possible values of r are r; = —2 and rz = —3; the general solution of equation (26) is 


yc? ce *. (27) 


EXAMPLE 3 


Find the solution of the initial value problem 


y'45y -6y 20, y(0) =2, y'(0) =3. (28) 


Solution: 


The general solution of the differential equation was found in Example 2 and is given by 
equation (27). To satisfy the first initial condition, we set t = 0 and y = 2 in equation (27); thus c, 
and c; must satisfy 


cy tc) —2. (29) 


To use the second initial condition, we must first differentiate equation (27). This gives 


y! 2 —2c,e 7” — 3c; ?'. Then, setting t = 0 and y’ = 3, we obtain 

—2c; — 3e; = 3. (30) 
By solving equations (29) and (30), we find that c; — 9 and c; — —7. Using these values in the 
expression (27), we obtain the solution 

y = 9e” — Te! (31) 


of the initial value problem (28). The graph of the solution is shown in Figure 3.1.1. 


0.5 1 1.5 2 


Solution of the initial value problem (28): 
y" +5y'+6y =0, y(0) = 2, y'(0) =3. 


EXAMPLE 4 


Find the solution of the initial value problem 


1 
4y"—8y'+3y=0, y(0) 22, Y( = 7. (32) 


Solution: 


If y = e” , then we obtain the characteristic equation 


4r! — 8r +3 =0 
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3 1 
whose roots are r = z andr = 5 Therefore, the general solution of the differential equation is 


y= ce + coe!!?, (33) 


Applying the initial conditions, we obtain the following two equations for c, and c»: 


+ 2 z + : : 
=2, acd... 
C; t C2 2 1 2€ 2 
. . . 1 5 í v 
The solution of these equations is c; — — y C2 = =, 50 the solution of the initial value problem (32) 
is 
l 32 9 2 
= —-e Tze'*. 34 
y 2 2 (34) 


1 5 
y=- letl 4 Set? 


2 


Solution of the initial value problem (32): 
4y" — 8y' --3y = 0, y(0) 22, y'(0) = 1/2. 


EXAMPLE 5 


The solution (31) of the initial value problem (28) initially increases (because its initial slope 
is positive), but eventually approaches zero (because both terms involve negative exponential 
functions). Therefore, the solution must have a maximum point, and the graph in Figure 3.1.1 
confirms this. Determine the location of this maximum point. 


Solution: 


The coordinates of the maximum point can be estimated from the graph, but to find them more 
precisely, we seek the point where the solution has a horizontal tangent line. By differentiating the 
solution (31), y = 9e?! — 7e, with respect to t, we obtain 


y! = —18e^? + 21e. (35) 


Setting y’ equal to zero and multiplying by &?', we find that the critical value t, satisfies e! = 7/6; 
hence 


tm = 1n(7/6) = 0.15415. (36) 


The corresponding maximum value ym is given by 


5 —2tm —3tm _ 108 ~ 
Ym = 9673 — en — = 2.20408. (37) 


In this example the initial slope is 3, but the solution of the given differential equation behaves 
in a similar way for any other positive initial slope. In Problem 19 you are asked to determine how 
the coordinates of the maximum point depend on the initial slope. 


3.1 Homogeneous Differential Equations with Constant Coefficients 


Returning to the equation ay” + by’ + cy = O0 with arbitrary coefficients, recall 
that when rı Æ 7r», its general solution (18) is the sum of two exponential functions. 
Therefore, the solution has a relatively simple geometrical behavior: as f increases, the 
magnitude of the solution either tends to zero (when both exponents are negative) or else 
exhibits unbounded growth (when at least one exponent is positive). These two cases 
are illustrated by the solutions of Examples 3 and 4, which are shown in Figures 3.1.1 
and 3.1.2, respectively. Note that whether a growing solution approaches +00 or —oo as 
t — oo is determined by the sign of the coefficient of the exponential for the larger root 
of the characteristic equation. (See Problem 21.) There is also a third case that occurs 
less often: the solution approaches a constant when one exponent is zero and the other is 
negative. 

In Sections 3.3 and 3.4, respectively, we return to the problem of solving the equation 
ay" +by'+cy = 0 when the roots of the characteristic equation either are complex conjugates 
or are real and equal. In the meantime, in Section 3.2, we provide a systematic discussion 
of the mathematical structure of the solutions of all second-order linear homogeneous 
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equations. 


Problems 


In each of Problems 1 through 6, find the general solution of the given 
differential equation. 


1. y'42y —3y-0 
2. y'3y 42y 20 
3. 6y”—y’-y=0 
4. y"’+5y'=0 
5. 4y"—9y 20 
6. y/"—2y'—2y 20 


In each of Problems 7 through 12, find the solution of the given 
initial value problem. Sketch the graph of the solution and describe 
its behavior as ¢ increases. 


@ 7. y"+y'-2y=0, (0) =1, y(0-21 

Q 8. y’+4y'+3y=0, y(0) =2, y(0 2-1 

@ 9. y’+3y'=0, y(0) =-2, y(0) =3 

Q 10. 2,"^-y—4y20, y(0-20, y(0 =1 

O 11. y"+8y'-9y=0, y(1)=1, y(1)=0 

@ 12. 4y’-y=0, y(-2)=1, y(-2) =-1 

13. Find a differential equation whose general solution is 

y = cie? +ee. 

© 14. Find the solution of the initial value problem 
y’-y=0, y(0) = >, y(0) = - 

Plot the solution for 0 < t < 2 and determine its minimum value. 


15. Find the solution of the initial value problem 
H $ 1 1 
2y —3y +y=0, y(0) =2, y0) =z: 


Then determine the maximum value of the solution and also find the 
point where the solution is zero. 


16. Solve the initial value problem y” — y’ — 2y = 0, y(0) = q, 
y'(0) = 2. Then find a so that the solution approaches zero as 
t — oo. 


In each of Problems 17 and 18, determine the values of a, if any, for 
which all solutions tend to zero as  — oo; also determine the values 
of a, if any, for which all (nonzero) solutions become unbounded as 
t — oo. 


17. y'— (2a — Dy'-a(a — 1D)y 20 

18. y"-(3—o)y'-2(a —1)y «0 

19. Consider the initial value problem (see Example 5) 
y'45y -6y 20, y(0) =2, y(0) =, 


where B > 0. 
a. Solve the initial value problem. 
b. Determine the coordinates 7,, and ym of the maximum point 
of the solution as functions of 5. 
c. Determine the smallest value of 3 for which ym > 4. 
d. Determine the behavior of tm and ym as 8 — oc. 


20. Consider the equation ay" + by’ + cy = d, where a, b, c, and d 
are constants. 
a. Findall equilibrium, or constant, solutions of this differential 
equation. 
b. Let ye denote an equilibrium solution, and let Y = y — ye. 
Thus Y is the deviation of a solution y from an equilibrium 
solution. Find the differential equation satisfied by Y. 


21. Consider the equation ay" + by’ + cy = 0, where a, b, and c 
are constants with a > 0. Find conditions on a, b, and c such that the 
roots of the characteristic equation are: 

a. real, different, and negative. 

b. real with opposite signs. 

C. real, different, and positive. 
In each case, determine the behavior of the solution as t — oo. 
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32. Solutions of Linear Homogeneous 
Equations; the Wronskian 


In the preceding section we showed how to solve some differential equations of the form 
ay" + by! cy =0, 


where a, b, and c are constants. Now we build on those results to provide a clearer picture of 
the structure of the solutions of all second-order linear homogeneous equations. In turn, this 
understanding will assist us in finding the solutions of other problems that we will encounter 
later. 

To discuss general properties of linear differential equations, it is helpful to introduce 
a differential operator notation. Let p and q be continuous functions on an open interval 
I —that is, fora < t < 0. The cases for a = —oo, or 3 = co, or both, are included. Then, 
for any function $ that is twice differentiable on /, we define the differential operator L by 
the equation 


L[ó] = 9" + pó' -- 46. (1) 


It is important to understand that the result of applying the operator L to a function $ is another 
function, which we refer to as L[@]. The value of L[@] at a point t is 


LIPIH = ó"(1) + PODEA + q(0)90(). 
For example, if p(t) = t°, q(t) = 1 +t, and ¢(t) = sin3r, then 
L[ó (t) = (sin3t)” + 17(sin 32)’ + (1 +t) sin3t 
= —9 sin 3t + 3t’ cos 3t + (1 f) sin3t. 


The operator L is often written as L = D? + pD +q, where D is the derivative operator, that 
is, D[ó] = ¢’. 

In this section we study the second-order linear homogeneous differential equation 
L[¢](t) = 0. Since it is customary to use the symbol y to denote @(t), we will usually 
write this equation in the form 


L[y] 2 y" + p(t)y’ + q(t)y =0. (2) 
With equation (2) we associate a set of initial conditions 
y(t.) = yo. y'(to) = Yo (3) 


where f is any point in the interval 7, and yo and y, are given real numbers. We would like 
to know whether the initial value problem (2), (3) always has a solution, and whether it may 
have more than one solution. We would also like to know whether anything can be said about 
the form and structure of solutions that might be helpful in finding solutions of particular 
problems. Answers to these questions are contained in the theorems in this section. 

The fundamental theoretical result for initial value problems for second-order linear 
equations is stated in Theorem 3.2.1, which is analogous to Theorem 2.4.1 for first-order linear 
equations. The result applies equally well to nonhomogeneous equations, so the theorem is 
stated in that form. 


Theorem3.2.1 | (Existence and Uniqueness Theorem) 


Consider the initial value problem 
y" + p(t)y' -q(0y = a(t),  y(to) = yo. y'(to) = Yo» (4) 


where p, q, and g are continuous on an open interval 7 that contains the point tọ. This problem has 
exactly one solution y = ¢ (t), and the solution exists throughout the interval 7. 
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We emphasize that the theorem says three things: 


1. The initial value problem das a solution; in other words, a solution exists. 
2. The initial value problem has only one solution; that is, the solution is unique. 


3. The solution $ is defined throughout the interval I where the coefficients are continuous 
and is at least twice differentiable there. 


For some problems some of these assertions are easy to prove. For instance, we found in 
Example 1 of Section 3.1 that the initial value problem 


y’-y=0, y(0)=2, y(0 —-1 (5) 
has the solution 
y= 15 + 3 ct. (6) 
2 2 


The fact that we found a solution certainly establishes that a solution exists for this initial value 
problem. Further, the solution (6) is twice differentiable, indeed differentiable any number of 
times, throughout the interval ( —oo, oo) where the coefficients in the differential equation are 
continuous. On the other hand, it is not obvious, and is more difficult to show, that the initial 
value problem (5) has no solutions other than the one given by equation (6). Nevertheless, 
Theorem 3.2.1 states that this solution is indeed the only solution of the initial value 
problem (5). 

For most problems of the form (4), it is not possible to write down a useful expression for 
the solution. This is a major difference between first-order and second-order linear differential 
equations. Therefore, all parts of the theorem must be proved by general methods that do not 
involve having such an expression. The proof of Theorem 3.2.1 is fairly difficult, and we do not 
discuss it here.2 We will, however, accept Theorem 3.2.1 as true and make use of it whenever 
necessary. 


EXAMPLE 1 


Find the longest interval in which the solution of the initial value problem 
(à —30y" E ty' -—(t+3)y=0, yO) 22, y (1 21 


is certain to exist. 


Solution: 


If the given differential equation is written in the form of equation (4), then 
1 t+3 
tj) =~, t) = -———, 
p(t) 5 q(t) Ht —3) 
The only points of discontinuity of the coefficients are t = 0 and t = 3. Therefore, the longest open 


interval, containing the initial point t = 1, in which all the coefficients are continuous is 0 < t < 3. 
Thus this is the longest interval in which Theorem 3.2.1 guarantees that the solution exists. 


and g(t) =0. 


EXAMPLE 2 


Find the unique solution of the initial value problem 
y" * p()y' +q(thy =0, y(t) =0, y(t) = 0, 


where p and q are continuous in an open interval / containing fo. 


Solution: 


The function y = $(f) = O for all t in Z certainly satisfies the differential equation and initial 
conditions. By the uniqueness part of Theorem 3.2.1, it is the only solution of the given problem. 


2A proof of Theorem 3.2.1 can be found, for example, in Chapter 6, Section 8 of the book by Coddington listed in 
the references at the end of this chapter. 
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Let us now assume that y; and y are two solutions of equation (2); in other words, 
L|yil = Xr + py; + qvi — 0, 


and similarly for y2. Then, just as in the examples in Section 3.1, we can generate more 
solutions by forming linear combinations of y, and y2. We state this result as a theorem. 


Theorem3.2.2 | (Principle of Superposition) 


If y, and y» are two solutions of the differential equation (2), 
Liy] =y" + p(t)y’ +4(t)y — 0, 


then the linear combination c; y; + czy is also a solution for any values of the constants c, and c». 


A special case of Theorem 3.2.2 occurs if either c, or c? is zero. Then we conclude that 
any constant multiple of a solution of equation (2) is also a solution. 
To prove Theorem 3.2.2, we need only substitute 


y =ciyi(t) + coy2(t) (7) 
for y in equation (2). By calculating the indicated derivatives and rearranging terms, we 
obtain 
[ciy1 + c2y2]" + pIeiyi + coy) + gleiy  eoyl 
= Cyr + cry; + c1 py, + copy, + eqyi + eaqy» 
erly! + py, + ayil + cly} + py) + qy] 
= cıL[yı] + c2L[y2]. 


L[ciy1 + c2y2] 


Since L[y;] = 0 and L[y2] = 0, it follows that L[c; y; +c2y2] = 0 also. Therefore, regardless 
of the values of c, and c», the function y as given by equation (7) satisfies the differential 
equation (2), and the proof of Theorem 3.2.2 is complete. 

Theorem 3.2.2 states that, beginning with only two solutions of equation (2), we can 
construct an infinite family of solutions by means of equation (7). The next question is whether 
all solutions of equation (2) are included in equation (7) or whether there may be other solutions 
of a different form. We begin to address this question by examining whether the constants c; 
and c; in equation (7) can be chosen so as to satisfy the initial conditions (3). These initial 
conditions require c, and c» to satisfy the equations 


€1y1 C49) + c2y2(t0) = yo. 


/ f / (8) 
c1y1 (10) + c2y5(10) = Yo- 
The determinant of coefficients of the system (8) is 
vito) — y2(to) " : 
= = t to) — t to). 9 
y(t) Xp) Yı (to) Y2(t0) — y1 (to) y2(to) (9) 


If W + 0, then equations (8) have a unique solution (c1, c2) regardless of the values of 
yo and yj. This solution is given by 


yo¥s(to) — yoy2(to) yoy; (to) + yoyiCto) s 
C] = E C2 = > 
yi(to) ¥5(to) — yq (to) yaCto) yi(to) ¥5(to) — yi (to) y2(to) 
or, in terms of determinants, 
Yo Y2(to) yi(to) yo 
Yo Yo(to) yio) X 
ee, ms cicer Pene (11) 


(to) — yo(1o) 
yio) — y5(to) 


1(fo) — y2(to) 
yi(to) y5(to) 


With these values for cı and c», the linear combination y = c1yı(t) + c2y2(t) satisfies the 
initial conditions (3) as well as the differential equation (2). Note that the denominator in the 
expressions for c, and c is the nonzero determinant W. 
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On the other hand, if W = 0, then the denominators appearing in equations (10) and (11) 
are zero. In this case equations (8) have no solution unless yo and y, have values that also 
make the numerators in equations (10) and (11) equal to zero. Thus, if W = 0, there are many 
initial conditions that cannot be satisfied no matter how c, and c3 are chosen. 

The determinant W is called the Wronskian? determinant, or simply the Wronskian, 
of the solutions y; and y2. Sometimes we use the more extended notation W[y, y5](1o) to 
stand for the expression on the right-hand side of equation (9), thereby emphasizing that the 
Wronskian depends on the functions y; and y», and that it is evaluated at the point tọ. The 
preceding argument establishes the following result. 


Theorem 3.2.3 


Suppose that y, and y» are two solutions of equation (2) 
L[y] = y" + p(t)y' + q(0y — 0, 
and that the initial conditions (3) 
y(t) = yo y(t) = Yo 
are assigned. Then it is always possible to choose the constants c4, c» so that 
y =ci y(t) + coya(t) 
satisfies the differential equation (2) and the initial conditions (3) if and only if the Wronskian 


W[yi, yal = yiy5 — Y1)2 


is not zero at fo. 


EXAMPLE 3 


In Example 2 of Section 3.1 we found that y(t) = e^?' and y(t) = e^ are solutions of the 
differential equation 


y" +5y’ + 6y = 0. 
Find the Wronskian of y, and y2. 


Solution: 


The Wronskian of these two functions is 


=e", 


—2e?! 3e 


Since W is nonzero for all values of t, the functions y; and yz can be used to construct solutions of 
the given differential equation, together with initial conditions prescribed at any value of t. One such 
initial value problem was solved in Example 3 of Section 3.1. 


The next theorem justifies the term “general solution" that we introduced in Section 3.1 
for the linear combination c; y; + c2y». 


3Wronskian determinants are named for Jósef Maria Hoéné-Wronski (1776-1853), who was born in Poland but spent 
most of his life in France. Wronski was a gifted but troubled man, and his life was marked by frequent heated disputes 
with other individuals and institutions. 
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Theorem 3.2.4 


Suppose that y, and y» are two solutions of the second-order linear differential equation (2), 
Lily] = y"  p()y' * q()y — 0. 
Then the two-parameter family of solutions 
y = c1y1 (t) + coyo(t) 


with arbitrary coefficients c, and c»? includes every solution of equation (2) if and only if there is a 
point tọ where the Wronskian of y; and y» is not zero. 


Let the function ¢ be any solution of equation (2). To prove the theorem, we must 
determine whether $ is included in the linear combinations c; y; + c2y2. That is, we must 
determine whether there are values of the constants c; and c» that make the linear combination 
the same as @. Let t; be a point where the Wronskian of y; and y; is nonzero. Then evaluate 
ġ and ¢’ at this point and call these values yo and y, respectively; that is, 


Yo = (to), yo = ó'(to). 
Next, consider the initial value problem 
y" t p()y'-q()y =0, yo) = yo. y (9) = Yo- (12) 


The function ó is certainly a solution of this initial value problem. Further, because we are 
assuming that W[y,, y2](fọ) is nonzero, it is possible (by Theorem 3.2.3) to choose cı and 
C2 such that y = c,y,(t) + c2y2(t) is also a solution of the initial value problem (9). In 
fact, the proper values of c, and c; are given by equations (10) or (11). The uniqueness part 
of Theorem 3.2.1 guarantees that these two solutions of the same initial value problem are 
actually the same function; thus, for the proper choice of c, and c», 


P(t) =ciyi(t) + exyx (t). (13) 


and therefore $ is included in the family of functions c)y; + czy. Finally, since @ is an 
arbitrary solution of equation (2), it follows that every solution of this equation is included in 
this family. 

Now suppose that there is no point tọ where the Wronskian is nonzero. Thus 
Wlyi, y2](to) = O for every point tọ. Then (by Theorem 3.2.3) there are values of yo and Yo 
such that no values of cı and c» satisfy the system (8). Select a pair of such values for yọ and 
Yo and choose the solution ¢(t) of equation (2) that satisfies the initial condition (3). Observe 
that this initial value problem is guaranteed to have a solution by Theorem 3.2.1. However, this 
solution is not included in the family y = c; y; +c2y2. Thus, in cases where W[yi, y2](to) = 0 
for every to, the linear combinations of y; and y» do not include all solutions of equation (2). 
This completes the proof of Theorem 3.2.4. 

Theorem 3.2.4 states that the Wronskian of y, and y; is not everywhere zero if and only if 
the linear combination c, y; + c? y? contains all solutions of equation (2). It is therefore natural 
(and we have already done this in the preceding section) to call the expression 


y = cy (t) + c2y2(t) 


with arbitrary constant coefficients the general solution of equation (2). The solutions 
yı and y, are said to form a fundamental set of solutions of equation (2) if and only if their 
Wronskian is nonzero. 

We can restate the result of Theorem 3.2.4 in slightly different language: to find the 
general solution, and therefore all solutions, of an equation of the form (2), we need only 
find two solutions of the given equation whose Wronskian is nonzero. We did precisely this 
in several examples in Section 3.1, although there we did not calculate the Wronskians. You 
should now go back and do that, thereby verifying that all the solutions we called “general 
solutions” in Section 3.1 do satisfy the necessary Wronskian condition. 

Now that you have a little experience verifying the nonzero Wronskian condition for the 
examples from Section 3.1, the following example handles all second-order linear differential 
equations whose characteristic polynomial has two distinct real roots. 
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EXAMPLE 4 


Suppose that y;(1) = e" and y(t) = e"?! are two solutions of an equation of the form (2). Show 
that if rı Æ r2, then y, and yz form a fundamental set of solutions of equation (2). 


Solution: 


We calculate the Wronskian of y, and y»: 


ryt rot 
el e? 


W = ryt 
rye r9e 


= (r2 — ri) explGri + r2)t]. 


rot 


Since the exponential function is never zero, and since we are assuming that r2 — rı Æ 0, it follows 
that W is nonzero for every value of t. Consequently, yı and y» form a fundamental set of solutions 
of equation (2). 


EXAMPLE 5 


Show that y,(t) = t!/? and yo(t) = t7! form a fundamental set of solutions of 
2y" E3ty/' -y 20, tr» 0. (14) 


Solution: 


We will show how to solve equation (14) later (see Problem 25 in Section 3.3). However, at this 
stage we can verify by direct substitution that y; and y» are solutions of the differential equation 


1 1 
(14). Since yi(t) = pte and y/(t) = c du we have 


1 1 1 3 
OP e uoa ae ee | gy eg. 
( 4 Und 2*2 


Similarly, y}(t) = —t~? and y{(t) 221 2, so 
2? (20-7) +3r(-17) 2-6! 2(4-3- Dr! =0. 
Next we calculate the Wronskian W of y; and y2: 
11/2 pd 


E 3,32 
Wes cue. glee (15) 


Since W + 0 for t > 0, we conclude that y; and y; form a fundamental set of solutions there. Thus 
the general solution of differential equation (14) is y(t) = cit!/? + c5t7! for t > 0. 


In several cases we have been able to find a fundamental set of solutions, and therefore the 
general solution, of a given differential equation. However, this is often a difficult task, and the 
question arises as to whether a differential equation of the form (2) always has a fundamental 
set of solutions. The following theorem provides an affirmative answer to this question. 


Theorem 3.2.5 


Consider the differential equation (2), 
Liy] = y" + p(t)y'  q(t)y — 0, 


whose coefficients p and q are continuous on some open interval 7. Choose some point fg in Z. Let 
yı be the solution of equation (2) that also satisfies the initial conditions 


y(t) =1, — y'(9) — 0, 
and let y; be the solution of equation (2) that satisfies the initial conditions 
y(t) — 0,  y'(19) = 1. 


Then y, and yz form a fundamental set of solutions of equation (2). 
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First observe that the existence of the functions y; and y» is ensured by the existence 
part of Theorem 3.2.1. To show that they form a fundamental set of solutions, we need only 
calculate their Wronskian at fo: 


(1p) yo to) 
Nune n = : 


X4)  y(ta)| ~ [0 | 2 
Since their Wronskian is not zero at the point fp, the functions y, and y? do form a fundamental 
set of solutions, thus completing the proof of Theorem 3.2.5. 

Note that the potentially difficult part of this proof, demonstrating the existence of a pair 
of solutions, is taken care of by reference to Theorem 3.2.1. Note also that Theorem 3.2.5 does 
not address the question of how to find the solutions y, and y; by solving the specified initial 
value problems. Nevertheless, it may be reassuring to know that a fundamental set of solutions 
always exists. 


EXAMPLE 6 


Find the fundamental set of solutions y, and y»? specified by Theorem 3.2.5 for the differential 
equation 


y'— y-0, (16) 
using the initial point tọ = 0. 


Solution: 


In Section 3.1 we noted that two solutions of equation (16) are y,;(t) = e' and y,(t) = e '. The 
Wronskian of these solutions is W[y;, y2](t) = —2 ¥ 0, so they form a fundamental set of solutions. 
However, they are not the fundamental solutions indicated by Theorem 3.2.5 because they do not 
satisfy the initial conditions mentioned in that theorem at the point t = 0. 

To find the fundamental solutions specified by the theorem, we need to find the solutions 
satisfying the proper initial conditions. Let us denote by y3(t) the solution of equation (16) that 
satisfies the initial conditions 


x0) —1, y(0)-0. (7) 
The general solution of equation (16) is 
y= tc, (18) 


and the initial conditions (17) are satisfied if c; = 1/2 and c; = 1/2. Thus 


(t) La + pact ht 
= -e' + ce =cosht. 
y3 2 2 
Similarly, if y4(t) satisfies the initial conditions 
y(0) —0, y(0)=1, (19) 
then 
(t) Lr lesi ht 
= e — ~-e = sinht. 
M 2 2 


Since the Wronskian of y; and y4 is 
W[ys, yal(t) = cosh” t — sinh? t = 1, 


these functions also form a fundamental set of solutions, as stated by Theorem 3.2.5. Therefore, the 
general solution of equation (16) can be written as 


y = kı cosht + ky sinh t, (20) 


as well as in the form (18). We have used k; and kp for the arbitrary constants in equation (20) because 
they are not the same as the constants c, and c» in equation (18). One purpose of this example is to 
make it clear that a given differential equation has more than one fundamental set of solutions; indeed, 
it has infinitely many (see Problem 16). As a rule, you should choose the set that is most convenient. 
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In the next section we will encounter equations that have complex-valued solutions. The 
following theorem is fundamental in dealing with such equations and their solutions. 


Theorem 3.2.6 


Consider again the second-order linear differential equation (2), 
Lily] = y" + p(t)y' - q(1)y = 0, 
where p and q are continuous real-valued functions. If y = u(t) +iv(t) isa complex-valued solution 


of differential equation (2), then its real part u and its imaginary part v are also solutions of this 
equation. 


To prove this theorem, we substitute u(t) + iv(t) for y in L[y], obtaining 
LIYD = u(t) + iv"(1) + POUW + iv'()) + a) (u(t) + iv(0). (21) 


Then, by separating equation (21) into its real and imaginary parts— and this is where we need 
to know that p(t) and q(t) are real-valued — we find that 


LIYE) = (u"(t) + p(t)u'(t) + q(t)u(t)) t i(v" (2) + p(Ov'(£) -a(0v(n) 
= L[u](£) +iL[v](t). 


Recall that a complex number is zero if and only if its real and imaginary parts are both 
zero. We know that L[y] = 0 because y is a solution of equation (2). Therefore, both L[u] = 0 
and L[v] = 0; consequently, the two real-valued functions u and v are also solutions of 
equation (2), so the theorem is established. We will see examples of the use of Theorem 3.2.6 
in Section 3.3. 

Incidentally, the complex conjugate y of a solution y is also a solution. While this can 
be proved by an argument similar to the one just used to prove Theorem 3.2.6, it is also a 
consequence of Theorem 3.2.2 since y = u(t) —iv(t) is a linear combination of two solutions. 

Now let us examine further the properties of the Wronskian of two solutions of a second- 
order linear homogeneous differential equation. The following theorem, perhaps surprisingly, 
gives a simple explicit formula for the Wronskian of any two solutions of any such equation, 
even if the solutions themselves are not known. 


Theorem 3.2.7 | (Abel’s Theorem)* 


If y, and y» are solutions of the second-order linear differential equation 
Liy] = y" + p(t)y’ t q()y —0, Q2) 


where p and q are continuous on an open interval 7, then the Wronskian W[y;. y2](t) is given by 


Wl[yi, y21(£) = c exp (- [mo a) (23) 


where c is a certain constant that depends on y; and y», but not on t. Further, W[y,, y2](t) either is 
zero for all t in J (if c = 0) or else is never zero in J (if c #0). 


To prove Abel’s theorem, we start by noting that y, and y» satisfy 
yi t+ p(t)y, +9) v1 = 0, 


y»  p()y» + q(t) yo = 0. 


If we multiply the first equation by —y2, multiply the second by yı, and add the resulting 
equations, we obtain 


(24) 


Quy — yi y2) + P(t) ny — yi»2 = 0. (25) 


^The result in Theorem 3.2.7 was derived by the Norwegian mathematician Niels Henrik Abel (1802-1829) in 1827 
and is known as Abel's formula. Abel also showed that there is no general formula for solving a quintic, or fifth 
degree, polynomial equation in terms of explicit algebraic operations on the coefficients, thereby resolving a question 
that had remained unanswered since the sixteenth century. His greatest contributions, however, were in analysis, 
particularly in the study of elliptic functions. Unfortunately, his work was not widely noticed until after his death. The 
distinguished French mathematician Legendre called it a “monument more lasting than bronze." 
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Next, we let W(t) = W[y1, y2](t) and observe that 

W' = yyy — yy»a- Q6) 
Then we can write equation (25) in the form 

W' + p(t) W «0. (27) 


Equation (27) can be solved immediately since it is both a first-order linear differential 
equation (Section 2.1) and a separable differential equation (Section 2.2). Thus 


W(t) =cexp (- J p(t) a). (28) 
where c is a constant. 


The value of c depends on which pair of solutions of equation (22) is involved. However, 
since the exponential function is never zero, W(t) is not zero unless c = 0, in which case 
W(t) is zero for all t. This completes the proof of Theorem 3.2.7. 

Note that the Wronskians of any two fundamental sets of solutions of the same differential 
equation can differ only by a multiplicative constant, and that the Wronskian of any 
fundamental set of solutions can be determined, up to a multiplicative constant, without solving 
the differential equation. Further, since under the conditions of Theorem 3.2.7 the Wronskian 
W is either always zero or never zero, you can determine which case actually occurs by 
evaluating W at any single convenient value of t. 


EXAMPLE 7 


In Example 5 we verified that y,(t) = t!/? and yo(1) = t^ are solutions of the equation 
2y" p 3:ty y 20, t» 0. (29) 


Verify that the Wronskian of yı and y is given by Abel’s formula (23). 


Solution: 


3 
From the example just cited we know that W[y;. y2](t) = — "URS To use equation (23), we must 


write the differential equation (29) in the standard form with the coefficient of y" equal to 1. Thus 
we obtain 


li 


3 1 
Dk aa O: 


2t 


3 
so p(t) = 2 Hence 


WIyi, yalt) = c exp(- f Za) =cop( -3 imr) 


= cth., (30) 
Equation (30) gives the Wronskian of any pair of solutions of equation (29). For the particular 
3 


solutions given in this example, we must choose c = — z 


Summary. We can summarize the discussion in this section as follows: to find the general 
solution of the differential equation 


y' + p(t)y *q(0y 20, a<t< f, 


we must first find two functions yı and y» that satisfy the differential equation in o, < t < f. 
Then we must make sure that there is a point in the interval where the Wronskian W of yı and 
y2 is nonzero. Under these circumstances yı and y» form a fundamental set of solutions, and 
the general solution is 


y = cy) + ext), 


where c, and c» are arbitrary constants. If initial conditions are prescribed at a given point in 
a < t < p, then c, and c? can be chosen so as to satisfy these conditions. 


Problems 


In each of Problems 1 through 5, find the Wronskian of the given pair 
of functions. 


1. e? e 92 

2. cost, sint 

3. eO, te” 

4. e'sint, e'cost 

5. cos0, 1+ cos( 20) 


In each of Problems 6 through 9, determine the longest interval in 
which the given initial value problem is certain to have a unique twice- 
differentiable solution. Do not attempt to find the solution. 

6. ty”+3y=t, y())=1, y(1)=2 

7. t(t—4)y" -3ty' Ay 22, y(3 20, y (3 2-1 

8. y'c(cost)y' +3(In|t))y=0, y 23, y 21 

9. (x—2)y"+y/+(x—-2(tanx)y 20, »3)-1, y(3) 22 


10. Verify that y;(t) = 1? and y;(r) = t^! are two solutions 
of the differential equation t?y” — 2y = 0 fort > 0. Then show 
that y = cıt? + cot! is also a solution of this equation for any c, 
and C5. 


11. Verify that y;(t) = 1 and y(t) = t!/2 are solutions of the 
differential equation yy" + (y^)? = 0 for t > 0. Then show that 
y = cı + czt !/? is not, in general, a solution of this equation. Explain 
why this result does not contradict Theorem 3.2.2. 

12. Show that if y = $ (1) is a solution of the differential equation 
y" + p(t)y’ + q(t)y = g(t), where g(t) is not always zero, then 
y = có(t), where c is any constant other than 1, is not a solution. 
Explain why this result does not contradict the remark following 
Theorem 3.2.2. 

13. Can y = sin(1?) bea solution on an interval containing t = 0 of 
an equation y" + p(1) y' + q(t)y = 0 with continuous coefficients? 
Explain your answer. 


14. If the Wronskian W of f and g is 3e“, and if f(t) = e”, find 
g(t). 

15. If the Wronskian of f and g is tcost — sint, and if 
u = f +3g,v = f — g, find the Wronskian of u and v. 


16. Assume that y; and y are a fundamental set of solutions 
of y" + p(t)y' + q(t)y = O0 and let y3 = ayy; + azy and 
ya = byy, + boy», where ay, a», by, and bz are any constants. Show 
that 


W[ys. y4] = Ca1b2 — a3b1) W[y1, y2]. 
Are y3 and y4 also a fundamental set of solutions? Why or why not? 


In each of Problems 17 and 18, find the fundamental set of solutions 
specified by Theorem 3.2.5 for the given differential equation and 
initial point. 

17. y"-y'-2y-0, 
18. y"+4y’+3y =0, 
In each of Problems 19 through 21, verify that the functions y, and y2 


are solutions of the given differential equation. Do they constitute a 
fundamental set of solutions? 


19. y"+4y=0; y(t) =cos(2t), yo(t) = sin(2t) 
20. y’—2y’+y=0; y(t) =e", y(t) = te’ 

21. x?y" - x(x -2)y' +(x -2y 20, x > 0; 
y(x) =x, y(x) = xe* 


1j =0 
to =1 
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22. Consider the equation y" — y' — 2y =0. 

a. Show that y,(1) = e^! and y;(1) = e” form a fundamental 
set of solutions. 

b. Let y3(t) = —2e", y(t) = yi(t) + 2y2(t), and 
ys(t) = 2y,(t) — 2y3(t). Are y3(t), y4(t), and ys(t) also 
solutions of the given differential equation? 

c. Determine whether each of the following pairs forms a 
fundamental set of solutions: {yi(t), y(t) }; xt. y3(t) ji 


(nO. y(0 bs Lye), vs }. 
In each of Problems 23 through 25, find the Wronskian of two 


solutions of the given differential equation without solving the 
equation. 


23. ty” — t(t - 2)y' - (t -2)y 20 
24. (cost) y" + (sint) y - ty «0 


25. (1—x2)y" - 2xy' +a(a+1)y=0,  Legendre's equation 


26. Show that if p is differentiable and p(t) > 0, then the 
Wronskian W(t) of two solutions of [p(1) y'] + q(t)y = 0 is 
W(t) — c/ p(t), where c is a constant. 

27. Ifthe differential equation t y" -- 2y' 4- te! y = 0 has y; and y» as 


a fundamental set of solutions and if W[y;, y2](1) = 2, find the value 
of W[yi, y2](5). 


28. Ifthe Wronskian of any two solutions of y"-4- p(t) y'+q(t)y = 0 
is constant, what does this imply about the coefficients p and q? 


In Problems 29 and 30, assume that p and q are continuous and 
that the functions y; and y» are solutions of the differential equation 
y" + p(t) y' + q(t) y = 0 on an open interval /. 


29. Prove that if y; and y» are zero at the same point in Z, then they 
cannot be a fundamental set of solutions on that interval. 


30. Prove that if y; and y? have a common point of inflection fg in 
I, then they cannot be a fundamental set of solutions on / unless both 
p and q are zero at fo. 


31. Exact Equations. The equation 
P(x)y" + QG)y' + RG)y 20 
is said to be exact if it can be written in the form 
(PQ) y) + (F(x) y)' = 0. 


where f(x) is to be determined in terms of P(x), Q(x), and R(x). 
The latter equation can be integrated once immediately, resulting 
in a first-order linear equation for y that can be solved as in 
Section 2.1. By equating the coefficients of the preceding equations 
and then eliminating f(x), show that a necessary condition for 
exactness is 


P'(x) — Q'(x) + R(x) = 0. 


It can be shown that this is also a sufficient condition. 


In each of Problems 32 through 34, use the result of Problem 31 to 
determine whether the given equation is exact. If it is, then solve the 
equation. 


32. y’+xy'+y=0 
33. xy" —(cosx)y'- (sinx)y 20, x » 0 
34. x?y" pxy —-y 20, x 5 0 
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35. The Adjoint Equation. If a second-order linear homogeneous 
equation is not exact, it can be made exact by multiplying by an 
appropriate integrating factor u(x). Thus we require that u(x) be 
such that 

p) PG) y" + p(x) O(a) y+ w(x) RO) y = 0 
can be written in the form 

(u(x) PG) Y) + (f(a) y)! = 0. 

By equating coefficients in these two equations and eliminating f(x), 
show that the function jz must satisfy 


Pp" 4(2P' =a 4-( P" — Q' 4+ R)y e 0. 
This equation is known as the adjoint of the original equation and is 
important in the advanced theory of differential equations. In general, 


the problem of solving the adjoint differential equation is as difficult as 
that of solving the original equation, so only occasionally is it possible 
to find an integrating factor for a second-order equation. 

In each of Problems 36 and 37, use the result of Problem 35 to find the 
adjoint of the given differential equation. 


36. x?y" +xy' +(x? —v?)y=0, Bessel's equation 

37. y'—xy —0, Airy’s equation 

38. A second-order linear equation P(x) y" 4- Q(x) y'- R(x) y =0 
is said to be self-adjoint if its adjoint is the same as the original 
equation. Show that a necessary condition for this equation to be 


self-adjoint is that P'(x) — Q(x). Determine whether each of the 
equations in Problems 36 and 37 is self-adjoint. 


33 Complex Roots of the Characteristic 


Equation 


We continue our discussion of the second-order linear differential equation 


ay" +by' cy =0, (1) 


where a, b, and c are given real numbers. In Section 3.1 we found that if we seek solutions of 
the form y = e”, then r must be a root of the characteristic equation 


ar? 4- br -- c — 0. (2) 


We showed in Section 3.1 that if the roots rı and rz are real and different, which occurs 
whenever the discriminant b? — 4ac is positive, then the general solution of equation (1) is 


y= cue"! + ce”. (3) 


Suppose now that b? — 4ac is negative. Then the roots of equation (2) are conjugate 
complex numbers; we denote them by 


rp =At+ip, m~=A-ip, (4) 


where À and p are real. The corresponding expressions for y are 


yi(t) = exp(CA +ip)t), 


yo(t) = exp((A — ip)t). (5) 


Our first task is to explore what is meant by these expressions, which involve evaluating the 
exponential function for a complex exponent. For example, if A = —1, jj = 2, and t = 3, 


then from equation (5), 


yi3) = eti, (6) 


What does it mean to raise the number e to a complex power? The answer is provided by an 
important relation known as Euler’s formula. 


Euler’s Formula. 


To assign a meaning to the expressions in equations (5), we need to give 


a definition of the complex exponential function. Of course, we want the definition to reduce 
to the familiar real exponential function when the exponent is real. There are several ways 
to discover how this extension of the exponential function should be defined. Here we use a 
method based on infinite series; an alternative is outlined in Problem 20. 

Recall from calculus that the Taylor series for e' about t = 0 is 


; p t 
dS thee 


n oo 


Le- - 


— —00 « t « oo. (7) 
n! n! 
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If we now assume that we can substitute it for t in equation (7), then we have 


. oo ity” 
et = 5 uo (8) 


n=0 


To simplify this series, we write (it)” = i"t" and make use of the facts that i? = —1, 
i? = —i,i^ = 1, and so forth. When n is even, there is an integer k with n = 2k; in this 
case i” = i™ = (—1)*. And when n is odd, n = 2k + 1, so i” = i?**! = j( —1)*. This 
suggests separating the terms in the right-hand side of (8) into its real and imaginary parts. 
The result is? 


; oo ( —1)*67* oo (—1)k 12441 
ct —— 
e 2H (2k)! a (2k +1)!” e 


The first series in equation (9) is precisely the Taylor series for cost about t = 0, and the 
second is the Taylor series for sin? about t = 0. Thus we have 


e! = cost 4- i sint. (10) 


Equation (10) is known as Euler's formula and is an extremely important mathematical 
relationship. 

Although our derivation of equation (10) is based on the unverified assumption that the 
series (7) can be used for complex as well as real values of the independent variable, our 
intention is to use this derivation only to make equation (10) seem plausible. We now put 
matters on a firm foundation by adopting equation (10) as the definition of e. In other words, 
whenever we write e, we mean the expression on the right-hand side of equation (10). 

There are some variations of Euler's formula that are also worth noting. If we replace t 
by —t in equation (10) and recall that cos(—t) = cost and sin( —7) = — sint, then we have 


e" = cost — i sint. (11) 

Further, if t is replaced by ut in equation (10), then we obtain a generalized version of Euler's 
formula, namely, 

e"! = cos(ut) +isin(pt). (12) 

Next, we want to extend the definition of the exponential function to arbitrary complex 


exponents of the form ( A +i u) t. Since we want the usual properties of the exponential function 
to hold for complex exponents, we certainly want exp((À + ii) t) to satisfy 


eAtint = e^ ei", (13) 


Then, substituting for et”! from equation (12), we obtain 
etl = eò (cos( ut) +i sin(pt)) 
= e cos( ut) + ie" sin(yt). (14) 


We now take equation (14) as the definition of exp[(A + ipi)t]. The value of the exponential 
function with a complex exponent is a complex number whose real and imaginary parts are 
given by the terms on the right-hand side of equation (14). Observe that the real and imaginary 
parts of exp((A + ij4)t) are expressed entirely in terms of elementary real-valued functions. 
For example, the quantity in equation (6) has the value 


e 3+6 — 6-3 cos 6 + ie ? sin6 = 0.0478041 — 0.0139113i. 


With the definitions (10) and (14), it is straightforward to show that the usual laws of 
exponents are valid for the complex exponential function. You can also use equation (14) to 
verify that the differentiation formula 

d 


e a= ppt 
p ) —re (15) 


holds for complex values of r. 


Recall from calculus that the reordering of terms in the right-hand side of equation (9) is allowed because the series 
converges absolutely for all —oo < t < oo. 
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EXAMPLE 1 


Find the general solution of the differential equation 


y" +y  9.25y — 0. (16) 
Also find the solution that satisfies the initial conditions 

x0) 22, y(0)-—8, (17) 
and draw its graph for 0 < t < 10. 


Solution: 
The characteristic equation for equation (16) is 
r+r+9.25 =0 
so its roots are 
1 1 


pes EO ee 


Therefore, two solutions of equation (16) are 


1 
y(t) = ew( (-3 + si) J = e"?! (cos(3t) +i sin(3t)) (18) 
and 
boa zn -" 
yo(t) = exp sm 3i j£ | =e (cos(3t) — i sin(31)). (19) 
You can verify that the Wronskian W[y;, y2](t) = —6ie™', which is not zero, so the general 
solution of equation (15) can be expressed as a linear combination of y,(t) and y2(t) with arbitrary 
coefficients. 


However, the initial value problem (16), (17) has only real coefficients, and it is often desirable 
to express the solution of such a problem in terms of real-valued functions. To do this we can make 
use of Theorem 3.2.6, which states that the real and imaginary parts of a complex-valued solution 
of equation (16) are also solutions of the same differential equation. Thus, starting from y;(t), we 
obtain 


u(t) = e"? cos(3t), v(t) 2e"? sin(3t) (20) 


as real-valued solutions of equation (16). On calculating the Wronskian of u(t) and v(t), we find 
that W[u, v](t) = 3e™, which is not zero; thus u(t) and v(t) form a fundamental set of solutions, 
and the general solution of equation (16) can be written as 


y = cult) + c(t) = e™? (c cos(3t) + c» sin(3t) ), (21) 


where c; and c» are arbitrary constants. 
To satisfy the initial conditions (17), we first substitute t = 0 and y = 2 in the solution (20) 
with the result that c; = 2. Then, by differentiating equation (21), setting t = 0, and setting y’ = 8, 


1 
we obtain — 55 1 +3c2 = 8 so that c; = 3. Thus the solution of the initial value problem (16), (17) is 


y= e™ (2 cos(3t) + 3 sin(3r)). (22) 


The graph of this solution is shown in Figure 3.3.1. 

From the graph we see that the solution of this problem oscillates, with period 27/3 and a 
decaying amplitude. The sine and cosine factors control the oscillatory nature of the solution, and 
the negative exponential factor in each term causes the magnitude of the oscillations to decrease 
toward zero as time increases. 


Tf you are not completely sure that u(t) and v(t) are solutions of the given differential equation, you should 
substitute these functions into equation (16) and confirm that they satisfy it. (See Problem 23.) 
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Solution of the initial value problem (16), (17): 
y” + y! 4-9.25y =0, y(0) = 2, y(0) =8. 


Complex Roots; The General Case. The functions y(t) and y2(t), given by equations (5) 
and with the meaning expressed by equation (14), are solutions of equation (1) when the roots 
of the characteristic equation (2) are complex numbers A + iu. However, the solutions y, 
and y? are complex-valued functions, whereas in general we would prefer to have real-valued 
solutions because the differential equation itself has real coefficients. Just as in Example 1, we 
can use Theorem 3.2.6 to find a fundamental set of real-valued solutions by choosing the real 
and imaginary parts of either y,(t) or y2(t). In this way we obtain the solutions 


u(t) 2e" cos( ut), v(t) = e" sin( ut). Q3) 
By direct computation (see Problem 19), you can show that the Wronskian of u and v is 
Wiu, v(t) = pe. (24) 


Thus, as long as u ZZ 0, the Wronskian W is not zero, so u and v form a fundamental set of 
solutions. (Of course, if jj = 0, then the roots are real and equal and the discussions in this 
section, and in Section 3.1, are not applicable.) Consequently, if the roots of the characteristic 
equation are complex numbers A + iu, with u Æ 0, then the general solution of equation (1) 
is 


y = cie" cos( ut) + coe? sin( ut), (25) 


where cı and c» are arbitrary constants. Note that the solution (25) can be written down as 
soon as the values of A and yz are known. Let us now consider some further examples. 


EXAMPLE 2 


Find the solution of the initial value problem 


16y" -8y'--145y 20, y(0) ——2, y(0) — 1. Q6) 
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Solution: 


1 
The characteristic equation is 16r? — 8r + 145 = 0 and its roots are r = P + 3i. Thus the general 


solution of the differential equation is 
y(t) = cie"^ cos(3r) + cse" /^ sin(3r). Q7) 
To apply the first initial condition, we set t = 0 in equation (27); this gives 
y(0) = cı = -2. 


For the second initial condition, we must differentiate equation (27) before substituting t = 0. In this 
way we find that 


1 
y(0) = n +3e, = 1, 


1 
from which we determine that c; = 3 Using these values of c, and c» in the general solution (27), 


we obtain 
1 
y = —2e'/^ cos(3t) + ^ aii sin(3t) (28) 


as the solution of the initial value problem (26). The graph of this solution is shown in Figure 3.3.2. 

In this case we observe that the solution is a growing oscillation. Again the trigonometric factors 
in equation (28) determine the oscillatory part of the solution (again with period 27 /3), while the 
exponential factor (with a positive exponent this time) causes the magnitude of the oscillation to 
increase with time. 


y 


10 y = 2e" cos (3t) + je" sin (37) 


Solution of the initial value problem (26): 
16y" — 8y' + 145y = 0, y(0) = —2, y'(0 = I. 


EXAMPLE 3 


Find the general solution of 


y" +9y =0. (29) 


Solution: 


The characteristic equation is r? +9 = 0 with the roots r = 3i; thus À = 0 and jj = 3. The general 
solution is 


y = cı cos( 3t) + c2 sin(3t). (30) 
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Solutions of equation (29): y” + 9y = 0, with two sets of 
initial conditions: y(0) = 1, y'(0) = 2 (dashed, green) and y(0) = 2, y'(0) = 8 
(solid, blue). Both solutions have the same period, but different amplitudes and 
phase shifts. 


Note that if the real part of the roots is zero, as in this example, then there is no exponential factor 
in the solution. Figure 3.3.3 shows the graph of two solutions of equation (28) with different initial 
conditions. In each case the solution is a pure oscillation with period 27/3 but whose amplitude 
and phase shift are determined by the initial conditions. Since there is no exponential factor in the 
solution (30), the amplitude of each oscillation remains constant in time. 


Problems 


In each of Problems 1 through 4, use Euler's formula to write the given 
expression in the form a + ib. 


1. exp(2—3i) 


16. Consider the initial value problem 


14. y"+y=0, y(-/3) 22, y(r/3) 2 —4 
15. y’+2y’+2y=0, y(-/4) =2, y'(m/4) 2 —2 
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2. git 

; 3u" —u'+2u=0, u(0) =2, u'(0) — 0. 
3. e2? (T/Di 
4. 21-i a. Find the solution u(t) of this problem. 


In each of Problems 5 through 11, find the general solution of the given 
differential equation. 


5. y”—2y'+2y=0 
y" —2y'+6y =0 
y! + 2y' +2y 20 


6 
7 
8. y" 6y'-13y 20 
9 


11. 


y" +2y' -1.25y 20 
9y" + 9y' - 4y 20 
y" +4y’ + 6.25y =0 


In each of Problems 12 through 15, find the solution of the given 
initial value problem. Sketch the graph of the solution and describe 
its behavior for increasing t. 


Q 12. y"4+4y=0, x0) =0, y(0) =1 
O 13. y’—2y'+5y=0, y(»x/2 =0, y(a/2) =2 


b. Fort > 0, find the first time at which |u(t)| = 10. 

@ 17. Consider the initial value problem 
5u" --2u' -7u 20, u(0) 22, u'(0) — 1. 

a. Find the solution u(t) of this problem. 

b. Find the smallest T such that |u(t)| < 0.1 for allt > T. 
@ 18. Consider the initial value problem 

y’+2y'+6y=0, y(0)=2, y(0) =a 70. 
a. Find the solution y(t) of this problem. 


b. Find a such that y = 0 whent = 1. 
C. Find, as a function of a, the smallest positive value of t for 


which y — 0. 
d. Determine the limit of the expression found in part c as 
a> oo. 
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19. Show that W [eàt cos( ut), eò sin(ut)] = pe, 


20. In this problem we outline a different derivation of Euler’s 
formula. 
a. Show that yı(t) = cost and y(t) = sin t are a fundamental 
set of solutions of y”+ y = 0; that is, show that they are solutions 
and that their Wronskian is not zero. 


b. Show (formally) that y = e' is also a solution of y" +y = 0. 
Therefore, 


e! = c, cost +c) sint (31) 


for some constants c; and cy. Why is this so? 

C. Set¢ = 0 in equation (31) to show that c, = 1. 

d. Assuming that equation (15) is true, differentiate 
equation (31) and then set t = 0 to conclude that c; = i. Use the 
values of c, and c» in equation (31) to arrive at Euler's formula. 


21. Using Euler’s formula, show that 


elt + e ei! — e 
— = COSf, 


— =sint. 
2 2i 


rytr)t ryt rot 
(yf = utn 


22. Ife" is given by equation (14), show that e 
for any complex numbers rı and r2. 


23. Consider the differential equation 
ay" + by’ +cy =0, 


where b? —4ac < Oand the characteristic equation has complex roots 
A + iu. Substitute the functions 


u(t) = e" cos( ut) and v(t) = e sin( ut) 


for y in the differential equation and thereby confirm that they are 
solutions. 


24. If the functions y, and y are a fundamental set of solutions of 
y" + p(t) y' + q(t) y = 0, show that between consecutive zeros of y, 
there is one and only one zero of y2. Note that this result is illustrated 
by the solutions y;(7/) = cost and y:(t) = sint of the equation 
y" + y = 0. 

Hint: Suppose that t; and f; are two zeros of y, between which 
there are no zeros of y2. Apply Rolle's theorem to y;/ y? to reach a 
contradiction. 


Change of Variables. Sometimes a differential equation with variable 
coefficients, 


y” + p(t)y’ + q(t)y — 0, (32) 


can be put in a more suitable form for finding a solution by making 
a change of the independent variable. We explore these ideas in 
Problems 25 through 36. In particular, in Problem 25 we show that 
a class of equations known as Euler equations can be transformed 
into equations with constant coefficients by a simple change of the 
independent variable. Problems 26 through 31 are examples of this 
type of equation. Problem 32 determines conditions under which the 
more general equation (32) can be transformed into a differential 
equation with constant coefficients. Problems 33 through 36 give 
specific applications of this procedure. 


25. Euler Equations. An equation of the form 


d?y dy 
2 2 
t +at 
dt? St 


+6y=0, t»0, (33) 
where a and f) are real constants, is called an Euler equation. 
a. Let x = lnt and calculate dy/dt and d?y/dt? in terms of 
dy/dx and d?y/dx?. 


b. Use the results of part a to transform equation (33) into 


d?y 


dx? 


ddnde dpi. (34) 
dx 


Observe that differential equation (34) has constant coefficients. 
If yı(x) and y2(x) form a fundamental set of solutions of 
equation (34), then y,(Int) and y2(Int) form a fundamental set 
of solutions of equation (33). 


In each of Problems 26 through 31, use the method of Problem 25 to 
solve the given equation for t > 0. 


26. Ü?y'Lty +y=0 
27. t?y’ +4ty’+2y 0 
28. ?y"—A4ty' -6y 20 
29. ry" — Aty' - 6y =0 
30. ty’ +3ty’ -3y 20 
31. ty” +7ty' +10y 20 
32. In this problem we determine conditions on p and q that 
enable equation (32) to be transformed into an equation with constant 
coefficients by a change of the independent variable. Let x = u(t) be 
the new independent variable, with the relation between x and f to be 
specified later. 

a. Show that 


dy dx dy d?y (dx * Py d?x dy 
dt dt dx’ dit?  \ dt) dx? dt dx’ 


b. Show that the differential equation (32) becomes 


dx “dy Px "t: dy | uei 
Hide Vae bilge Oe 


c. In order for equation (35) to have constant coefficients, the 
coefficients of d*y/dx*, dy/dx, and y must all be proportional. 
If q(t) > 0, then we can choose the constant of proportionality 
to be 1; hence, after integrating with respect to t, 


x=u(t) = fao” dt. (36) 


d. With x chosen as in partc, show that the coefficient of dy / dx 
in equation (35) is also a constant, provided that the expression 


q'() + 2p(t)q() 
2(q(t))3/? 

is a constant. Thus equation (32) can be transformed into 
an equation with constant coefficients by a change of the 
independent variable, provided that the function (4' J-2pq) / q?/? 
is a constant. 
e. How must the analysis and results in d be modified if 
q(t) « 0? 


In each of Problems 33 through 36, try to transform the given equation 
into one with constant coefficients by the method of Problem 32. If this 
is possible, find the general solution of the given equation. 


33. y+ ty’ + ety =0, 
34. y"-r3ty' 4 y =0, 
35. ty” + - Dy 4 Py =0, 
36. y"+ty' — e? y =0 


(37) 


—oo«t« oo 
—oo«t« oo 


O< too 
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34 Repeated Roots; Reduction of Order 


In Sections 3.1 and 3.3 we showed how to solve the equation 
ay" + by cy =0 Q) 
when the roots of the characteristic equation 
ar? +br+c=0 (2) 


either are real and different or are complex conjugates. Now we consider the third possibility, 
namely, that the two roots r, and r» are equal. This case is transitional between the other two 
and occurs when the discriminant b? — 4ac is zero. Then it follows from the quadratic formula 
that 
: (3) 
n=Mn=-—. 
1 2 2d 

The difficulty is immediately apparent; both roots yield the same solution 

yi(t) = e PCO (4) 


of the differential equation (1), and it is not obvious how to find a second solution. 


EXAMPLE 1 


Solve the differential equation 


y" + Ay! +4y =0. (5) 


Solution: 


The characteristic equation is 
r? 4r F4 —(r-2)? 20 


sor; = r; = —2. Therefore, one solution of equation (5) is y;(t) = e~”. To find the general 
solution of equation (5), we need a second solution that is not a constant multiple of y;. This second 
solution can be found in several ways (see Problems 15 through 17); here we use a method originated 
by d’Alembert’ in the eighteenth century. Recall that since y,(t) is a solution of equation (1), so 
is cy; (1) for any constant c. The basic idea is to generalize this observation by replacing c by a 
function v(t) and then trying to determine v(t) so that the product v(t) y;(t) is also a solution of 
equation (1). 

To carry out this program, we substitute y = v(t) y,(t) in equation (5) and use the resulting 
equation to find v(t). Starting with 


y =O = vine, (6) 
we differentiate once to find 
y 2v'()e^? —2y(1)e? (7) 
and a second differentiation yields 
y! 2 v'(r)e 7 — Av" (ne! + Av(1)e 7. (8) 


By substituting the expressions in equations (6), (7), and (8) in equation (5) and collecting terms, we 
obtain 


(v(t) — Av'(1) + 4v(1) + Av'(t) — 8v(1) -4v(0))e 7 2 0, 


"Jean d'Alembert (1717-1783), a French mathematician, was a contemporary of Euler and Daniel Bernoulli and 
is known primarily for his work in mechanics and differential equations. D'Alembert's principle in mechanics 
and d'Alembert's paradox in hydrodynamics are named for him, and the wave equation first appeared in his paper 
on vibrating strings in 1747. In his later years he devoted himself primarily to philosophy and to his duties as 
science editor of Diderot's Encyclopédie. 
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which simplifies to 


v(t) — 0. (9) 
Therefore, 
v'(t) 2c 
and 
v(t) 2 eit +c, (10) 


where c, and c; are arbitrary constants. Finally, substituting for v(t) in equation (6), we obtain 
y 2 cte P 4 cet, (11) 


The second term on the right-hand side of equation (11) corresponds to the original solution 
yı(t) = exp(—2r), but the first term arises from a second solution, namely, yo(t) = t exp( —2t). 
We can verify that these two solutions form a fundamental set by calculating their Wronskian: 


ga ig * —At —4t —4t 
Wy, y2l(0) = —2e?  (1-20e|^ e" —2te " + 2te 
=e" 40, 
Therefore, 
yi(t) =e", y(t) = te? (12) 


form a fundamental set of solutions of equation (5), and the general solution of that equation is given 
by equation (11). Note that both y; (7) and y2(r) tend to zero as t — oo; consequently, all solutions 
of equation (5) behave in this way. The graphs of typical solutions are shown in Figure 3.4.1. 


y 
20-b*9, 


1.84 s 
| S 
1.64 s 
1.4d * 
1.24 x 
4 S 
1.0 
0.84 
0.64 


——T3 
0 0.5 1 1.5 2 2.5 


Three solutions of equation (5): 
y" +4y’ + 4y = 0, with different sets of initial conditions: 
y(0) = 2, y'(0) = 1 (blue, dashed); y(0) = 1, y'(0) = 1 (green, 
solid); y(0) = 1/2, y'(0) = 1 (red). 


The procedure used in Example 1 can be extended to a general equation whose 
characteristic equation has repeated roots. That is, we assume that the coefficients in equation 
(1) satisfy b? — 4ac = 0, in which case 


vit) = eto 


is a solution. To find a second solution, we assume that 
y =v(t) y(t) = v()e 9/02 (13) 


and substitute for y in equation (1) to determine v(t). We have 


b 
yl = v(t) e P / Q9 = ruo (14) 
a 
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and 


2 
y" = v'(r)e P / Q0 u b rper Qo + P yeyeto, (15) 
a 4a? 


Then, by substituting in equation (1), we obtain 


7 b / p? / b —bt/ (2 
al v'(t) — v(t) + Terie, C (f) — zo) +cv(t) Je 22 = 0. (16) 
a a a 


Canceling the factor e~’/(?” , which is nonzero, and rearranging the remaining terms, we find 
that 


p? p? 
av"(t) +(—b+ bv'(t) + | — — — +c |v(t) = 0. (17) 
4a 2a 


The term involving v'(f) is obviously zero. Further, the coefficient of v(f) is 
c — b? / (4a), which is also zero because b? — 4ac = 0 in the problem that we are considering. 
Thus, just as in Example 1, equation (17) reduces to 


v^(t) — 0, 
so 
v(t) =cy eot. 
Hence, from equation (13), we have 


y= cje ™/ 00 + cote ™™/ Qa). (18) 


Thus y is a linear combination of the two solutions 


nose... y(t) = teh e0, (19) 
The Wronskian of these two solutions is 
e PU Qa) te P /Qa) 
= — „—bt/a 
WOw»0- b qu (1 PY io 7€ (20) 
2a 2a 


Since W[yi, y2](t) is never zero, the solutions y; and y2 given by equation (19) are a 
fundamental set of solutions. Further, equation (18) is the general solution of equation (1) 
when the roots of the characteristic equation are equal. In other words, in this case there is one 
exponential solution corresponding to the repeated root and a second solution that is obtained 
by multiplying the exponential solution by t. 


EXAMPLE 2 


Find the solution of the initial value problem 


1 
y" =y + A =0, x0 =2, y) =z: (21) 


Solution: 


The characteristic equation is 
1 
2 
r°—r+—=0, 
4 
so the roots are r; = r2 = 1/2. Thus the general solution of the differential equation is 


y = cie? + catel? Q2) 
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The first initial condition requires that 


y(0) =c =2. 
To satisfy the second initial condition, we first differentiate equation (22) and then set t = 0. This 
gives 
j 1 1 
y (0) = gc te -3' 
so c; = —2/3. Thus the solution of the initial value problem is 
y 22277 — seh. (23) 


The graph of this solution is shown by the blue curve in Figure 3.4.2. 


AC 


y'O) 2 2: y = 2e? + tet? 


Solutions of y" — y' + y/4 = 0, y(0) = 2, 
with y'(0) = 1/3 (blue) and with y'(0) = 2 (red). 


Let us now modify the initial value problem (16) by changing the initial slope; to be specific, 
let the second initial condition be y'(0) = 2. The solution of this modified problem is 


y= 2e! /2 $ tel/2, 


and its graph is shown by the red curve in Figure 3.4.2. The graphs shown in this figure suggest that 
there is a critical initial slope, with a value between 1/3 and 2, that separates solutions that increase 
ast — oo from those that ultimately decrease as t — oo. In Problem 12 you are asked to determine 
this critical initial slope. 


The asymptotic behavior of solutions is similar in this case to that when the roots are real 
and different. If the exponents are either positive or negative, then the magnitude of the solution 
grows or decays accordingly; the linear factor t has little influence. A decaying solution is 
shown in Figure 3.4.1 and growing solutions in Figure 3.4.2. However, if the repeated root 
is zero, then the differential equation is y" = 0 and the general solution is a linear function 
of t. 


Summary. Wecannow summarize the results that we have obtained for second-order linear 
homogeneous equations with constant coefficients 


ay" + by! cy =0. (24) 
Let rı and r, be the roots of the corresponding characteristic equation 


ar? 4 br 4- c — 0. (25) 
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If r; and r3 are real but not equal, then the general solution of differential equation (24) is 


y — cue" + coe", (26) 


If rı and rz are complex conjugates A + ij, then the general solution is 
y = cie" cos( ut) + coe? sin( ut). (27) 
If rj; = 7, then the general solution is 
y = qe" + ete. (28) 


Reduction of Order. Itis worth noting that the procedure used in this section for equations 
with constant coefficients is more generally applicable. Suppose that we know one solution 
yı(t), not everywhere zero, of 


y" + p(thy’ +q(t)y =0. (29) 
To find a second solution, let 
y = v(t) y(t); (30) 
then 
y ez v'()yi() HOYO 
and 


y = v"(t)yi() * 2v'() yD) HvA yA). 
Substituting for y, y’, and y" in equation (29) and collecting terms, we find that 


yiv” + (2y, + pyDv' + (yt + pyi + qy)v = 0. (31) 


Since y, is a solution of equation (29), the coefficient of v in equation (31) is zero so that 
equation (31) becomes 


yy" + (2y, + py) v' = 0. (32) 


Despite its appearance, equation (32) is actually a first-order differential equation for the 
function v’ and can be solved either as a first-order linear equation or as a separable equation. 
Once v' has been found, then v is obtained by an integration. Finally, y is determined from 
equation (30). This procedure is called the method of reduction of order, because the crucial 
step is the solution of a first-order differential equation for v’ rather than the original second- 
order differential equation for y. Although it is possible to write down a formula for v(t), we 
will instead illustrate how this method works by an example. 


EXAMPLE 3 


Given that y,(t) = t7! is a solution of 


2y" +3ty’-y=0, t» 0, (33) 
find a fundamental set of solutions. 
Solution: 
We set y — v(t)t—!; then 
y= vtl — pt, y"= vt! —2y't? 4 ave. 
Substituting for y, y’, and y" in equation (33) and collecting terms, we obtain 
2r (vtl — Qv't? 207) + 3t(v't! — vt?) — ve! 
= 2tv" + (—4 + 3)v' + (4m! -3r !— t')y 
= 2tv" — y! = 0. (34) 
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Note that the coefficient of v is zero, as it should be; this provides a useful check on our algebraic 


calculations. 


If we let w = v’, then the second-order linear differential equation (34) reduces to the separable 
first-order differential equation 


2tw! —w = 0. 


Separating the variables and solving for w(t), we find that 


w(t) = v'(t) =cr!/?; 


then, one final integration yields 


It follows that 


2 
v(t) = 0^ +k. 


2 
y2wnDrz- Qm tC, (35) 


where c and k are arbitrary constants. The second term on the right-hand side of equation (35) is a 
multiple of y,(t) and can be dropped, but the first term provides a new solution y(t) = t!/?. You 
can verify that the Wronskian of y, and y; is 


WIy1, y2l(t) = a? #0 for t > 0. (36) 


Consequently, y, and y? form a fundamental set of solutions of equation (33) for t > 0. 


Problems 


In each of Problems 1 through 8, find the general solution of the given 
differential equation. 


1l. y”—2y’+y=0 
9y" c 6y' cy 20 
4y" — 4y' - 3y =0 
y" — 2y' - 10y 20 
y” — 6y’+9y =0 
4y" + 17y'’+4y =0 
16y" + 24y'+ 9y 20 
8. 2y”’+2y'+y=0 
In each of Problems 9 through 11, solve the given initial value 


problem. Sketch the graph of the solution and describe its behavior 
for increasing t. 


9. 9y" —12y' - 4y =0, 
10. y’—6y'+9y=0, y(0 20, y(0 22 
11. y"-4y-4y20, y(-10)-22, y(-1) 21 
12. Consider the following modification of the initial value problem 
in Example 2: 


Se oue D 


y(0) 22, y'(0) 2 -1 


Hn / y 
y'—y---.z0, y(0)=2, y(0) =b. 


4 
Find the solution as a function of b, and then determine the critical 
value of b that separates solutions that remain positive for all t > 0 
from those that eventually become negative. 


@ 13. Consider the initial value problem 


4y"+4y'+y=0, y(0)=1, y'(0) =2. 


a. Solve the initial value problem and plot the solution. 

b. Determine the coordinates (ty, yy) of the maximum point. 

C. Change the second initial condition to y'(0) = b > 0 and 

find the solution as a function of b. 

d. Find the coordinates (ty, yy) of the maximum point in terms 

of b. Describe the dependence of ty and yy on b as b increases. 
14. Consider the equation ay" + by’ + cy = 0. If the roots of the 
corresponding characteristic equation are real, show that a solution to 
the differential equation either is everywhere zero or else can take on 
the value zero at most once. 


Problems 15 through 17 indicate other ways of finding the second 

solution when the characteristic equation has repeated roots. 

15. a. Consider the equation y" + 2ay' + a?y = 0. Show that the 
roots of the characteristic equation are r} = r? = —a so that one 
solution of the equation is e ^'. 

b. Use Abel’s formula [equation (23) of Section 3.2] to show 
that the Wronskian of any two solutions of the given equation is 


W(t) 2 y(0y(0) — yD y2(t) = cie 7", 


where c, is a constant. 


C. Let y(t) = e™™ and use the result of part b to obtain 
a differential equation satisfied by a second solution yo(1). By 
solving this equation, show that y(t) = te ^. 
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16. Suppose that rı and r» are roots of ar? + br + c = 0 and that 
rı Æ r2; then exp(rıt) and exp(r;t) are solutions of the differential 
equation ay" + by’ + cy = 0. Show that 


is also a solution of the equation for r Æ r;. Then think of r; as fixed, 
and use l'Hópital's rule to evaluate the limit of ¢ (t; r1, r2) as r2 > ri, 
thereby obtaining the second solution in the case of equal roots. 


17. a. If ar? + br +c = 0 has equal roots r;, show that 


Lie") = ale)" + b(e")' + ce" 2 a(r =r)’ e". (37) 
Since the right-hand side of equation (37) is zero when r = r4, it 
follows that exp(r,1) is a solution of L[y] = ay" +by’+cy = 0. 
b. Differentiate equation (37) with respect to r, and interchange 
differentiation with respect to r and with respect to f, thus 
showing that 


= ate" (r — r1)? -2ae"(r —r). — (88) 


Since the right-hand side of equation (36) is zero when r = r4, 
conclude that t exp(r;1) is also a solution of L[y] = 0. 


In each of Problems 18 through 22, use the method of reduction of 
order to find a second solution of the given differential equation. 


18. ty’ —4ty’+6y=0, t» 0; y() =r 
19. fy’ +2ty’-2y=0, t>0; y(t) 5t 
20. ty"4+3ty’+y=0, t>0; y(t) =t! 
21. xy” — y +4xðy=0, x > 06; y(x) = sin(x?) 


22. 
23. 


xy" +xy'+(x?—0.25)y 20, x 2 6; y(x) = x I"? sinx 


The differential equation 
y" + 6(xy’+ y) 20 


arises in the study of the turbulent flow of a uniform stream past a 
circular cylinder. Verify that y,(x) = exp(—6x2/2) is one solution, 
and then find the general solution in the form of an integral. 

24. The method of Problem 15 can be extended to second-order 
equations with variable coefficients. If y; is a known nonvanishing 
solution of y” + p(t) y' + q(t) y = 0, show that a second solution y; 


3.5 


Undetermined Coefficients 
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satisfies ( y2/ y1)' = W[yi, yal y. where W [y;, y2] is the Wronskian 

of y, and y2. Then use Abel’s formula (equation (23) of Section 3.2) 

to determine y2. 

In each of Problems 25 through 27, use the method of Problem 24 to 

find a second independent solution of the given equation. 

25. y" 3ty py 20 t» 0; ye) =t! 

26. ty'—y'--APDy 20, t» 0; y(t) = sin(t?) 

27. x?y"+xy' +(x? -0.25)y 20, x > 0; 

Behavior of Solutions as ¢t — oo. Problems 28 through 30 are 

concerned with the behavior of solutions as t — oo. 

28. Ifa, b, and c are positive constants, show that all solutions of 

ay" + by’ + cy = 0 approach zero as t — oo. 

29. a. Ifa > O and c > 0, but b. = 0, show that the result of 
Problem 28 is no longer true, but that all solutions are bounded 
ast — oo. 
b. Ifa > O and b > 0, but c = 0, show that the result of 
Problem 28 is no longer true, but that all solutions 
approach a constant that depends on the initial conditions as 
t — oo. Determine this constant for the initial conditions 
(0) = yo, (0) = yo 


yix) = x12 sinx 


30. 


Show that y = sint is a solution of 
y" + (ksin t)y' + (1 — kcostsint)y = 0 


for any value of the constant k. If 0< k< 2, show that 
1 — kcostsint > O and ksin? t > 0. Thus observe that even though 
the coefficients of this variable-coefficient differential equation are 
nonnegative (and the coefficient of y' is zero only at the points 
t = 0,7,27, ...), it has a solution that does not approach zero 
ast — oo. Compare this situation with the result of Problem 28. 
Thus we observe a not unusual situation in the study of differential 
equations: equations that are apparently very similar can have quite 
different properties. 

Euler Equations. In each of Problems 31 through 34, use the 
substitution introduced in Problem 25 in Section 3.3 to solve the given 
differential equation. 


31. ?y'—3ty'--4y 20, t>0 
32. r?y" +2ty'+0.25y=0, t>0 
33. t?y”+3ty’+y=0, t>0 

34. 4t?y" —8ty'+9y=0, t»0 


Nonhomogeneous Equations; Method of 


We now turn our attention to the nonhomogeneous second-order linear differential equation 


Liy] = y" + p(t)y’ +4) y = g(t), 


(1) 


where p, q, and g are given (continuous) functions on the open interval 7. The equation 


Liy] = y"+ py’ - q(0y =9, 


(2) 
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in which g(t) = 0 and p and q are the same as in equation (1), is called the homogeneous 
differential equation corresponding to equation (1). The following two results describe the 
structure of solutions of the nonhomogeneous equation (1) and provide a foundation for 
constructing its general solution. 


Theorem 3.5.1 


If Y, and Y; are two solutions of the nonhomogeneous linear differential equation (1), then their 
difference Y, — Y; is a solution of the corresponding homogeneous differential equation (2). If, in 
addition, y, and y» form a fundamental set of solutions of equation (2), then 


Yi) — Y(t) = eciyi(1) + eyx (0), (3) 


where c, and c; are certain constants. 


To prove this result, note that Y; and Y; satisfy the equations 


LIV M(t) = g(t), LYI = g(t). (4) 
Subtracting the second of these equations from the first, we have 
LIYO — LPIA = g(t) — g(t) = 0. (5) 


However, 
L[Yi] — L[Y?] = LIY; — Y2], 
so equation (5) becomes 
LLY, — Yo\(t) = 0. (6) 


Equation (6) states that Y, — Y; is a solution of equation (2). Finally, since by Theorem 3.2.4 
all solutions of equation (2) can be expressed as linear combinations of a fundamental set of 
solutions, it follows that the solution Y; — Y»? can be so written. Hence equation (3) holds and 
the proof is complete. 


Theorem 3.5.2 


The general solution of the nonhomogeneous equation (1) can be written in the form 
y = O(t) = cyiG) + c2y2(t) + Y(t), (7) 


where y; and yz form a fundamental set of solutions of the corresponding homogeneous equation (2), 
c, and c» are arbitrary constants, and Y is any solution of the nonhomogeneous equation (1). 


The proof of Theorem 3.5.2 follows quickly from Theorem 3.5.1. Note that equation (3) 
holds if we identify Y, with an arbitrary solution o of equation (1) and Y; with the specific 
solution Y. From equation (3) we thereby obtain 


b(t) — Y(t) = ciy (2) + c2y2(t), (8) 


which is equivalent to equation (7). Since @ is an arbitrary solution of equation (1), the 
expression on the right-hand side of equation (7) includes all solutions of equation (1); thus it 
is natural to call it the general solution of equation (1). 

In somewhat different words, Theorem 3.5.2 states that to solve the nonhomogeneous 
equation (1), we must do three things: 


1. Find the general solution c; y;(1) + c2y2(t) of the corresponding homogeneous equation. 
This solution is frequently called the complementary solution and may be denoted by 
ye(t). 

2. Find any solution Y(t) of the nonhomogeneous equation. Often this solution is referred 
to as a particular solution. 

3. Form the sum of the functions found in steps 1 and 2. 


We have already discussed how to find y. (1), at least when the homogeneous equation 
(2) has constant coefficients. Therefore, in the remainder of this section and Section 3.6, we 
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will focus on finding a particular solution Y(t) of the nonhomogeneous linear differential 
equation (1). There are two methods that we wish to discuss. They are known as the method 
of undetermined coefficients (discussed here) and the method of variation of parameters (see 
Section 3.6). Each has some advantages and some possible shortcomings. 


Method of Undetermined Coefficients. The method of undetermined coefficients 
requires us to make an initial assumption about the form of the particular solution Y(t), 
but with the coefficients left unspecified. We then substitute the assumed expression into the 
nonhomogeneous differential equation (1) and attempt to determine the coefficients so as to 
satisfy that equation. If we are successful, then we have found a solution of the differential 
equation (1) and can use it for the particular solution Y(t). If we cannot determine the 
coefficients, then this means that there is no solution of the form that we assumed. In this 
case we may modify the initial assumption and try again. 

The main advantage of the method of undetermined coefficients is that it is straightforward 
to execute once the assumption is made about the form of Y(t). Its major limitation is 
that it is useful primarily for equations for which we can easily write down the correct 
form of the particular solution in advance. For this reason, this method is usually used 
only for problems in which the homogeneous equation has constant coefficients and the 
nonhomogeneous term is restricted to a relatively small class of functions. In particular, 
we consider only nonhomogeneous terms that consist of polynomials, exponential functions, 
sines, and cosines. Despite this limitation, the method of undetermined coefficients is useful for 
solving many problems that have important applications. However, the algebraic details may 
become tedious, and a computer algebra system can be very helpful in practical applications. 
We will illustrate the method of undetermined coefficients by several simple examples and 
then summarize some rules for using it. 


EXAMPLE 1 


Find a particular solution of 
y” — 3y' — 4y = 3e”. (9) 


Solution: 


We seek a function Y such that the combination Y"(t) — 3Y'(1) — 4Y (t) is equal to 3e?'. Since the 
exponential function reproduces itself through differentiation, the most plausible way to achieve the 
desired result is to assume that Y(t) is some multiple of e, 


Y(t) = Ae”, 


where the coefficient A is yet to be determined. To find A, we calculate the first two derivatives 
of Y: 


Y(t) =2Ae*, Y'(t) = AAe", 
and substitute for y, y’, and y" in the nonhomogeneous differential equation (9). We obtain 


Y" —3Y' — 4Y = (4A — 6A — 4A)e? = 3e”. 


1 
Hence —6Ae? must equal 3e”, so —6A = 3 and we conclude that A = > Thus a particular 


solution is 


Y(t) = le, (10) 
2 


EXAMPLE 2 


Find a particular solution of 


y" — 3y' — 4y = 2sint. (11) 
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Solution: 


By analogy with Example 1, let us assume that Y(t) = Asint, where A is a constant to be 
determined. On substituting this guess in equation (11) we obtain 


Y" — 3Y' — 4Y = —Asint — 3A cost — 4A sint = 2 sint, 


or, moving all terms to the left-hand side and collecting the terms involving sint and cost, we 
arrive at, 


(2+ 5A) sint + 3A cost = 0. (12) 
We want equation (12) to hold for all t. Thus it must hold for two specific points, such as t = 0 


andt = P At these points equation (12) reduces to 3A = 0 and 2 + 5A = 0, respectively. 


These contradictory requirements mean that there is no choice of the constant A that makes 


equation (12) true for t = 0 and t = > much less for all t. Thus we conclude that our assumption 


concerning Y (f) is inadequate. 
The appearance of the cosine term in equation (12) suggests that we modify our original 
assumption to include a cosine term in Y(t); that is, 


Y(t) — Asint 4 B cost, 
where A and B are the undetermined coefficients. Then 
Y'(t) = Acost — Bsint, Y"(t) = —Asint — B cost. 
By substituting these expressions for y, y', and y" in equation (11) and collecting terms, we obtain 


Y" — 3Y' — AY = (CA + 3B — 4A) sint +(—B — 3A — 4B) cost = 2sint. (13) 


Now, working exactly as with the first guess, move all terms to the left-hand side and evaluate t = 0 
and t = 3 to find that A and B must satisfy the equations 


—5A--3B—2-0, -3A—5B=0. 


5 3 
Solving these algebraic equations for A and B, we obtain A — — F and B = 17 hence a particular 


solution of equation (11) is 


5 3 
Y(t) = — py Sin + 1; Ost 


The method illustrated in the preceding examples can also be used when the right-hand 
side of the equation is a polynomial. Thus, to find a particular solution of 


y! — 3y' - 4y = 4? - 1, (14) 


we initially assume that Y (1) is a polynomial of the same degree as the nonhomogeneous term; 
that is, Y(t) = At? + Bt +C. 

To summarize our conclusions up to this point: if the nonhomogeneous term g(t) in 
differential equation (1) is an exponential function e^', then assume that Y(t) is proportional 
to the same exponential function; if g(t) is sin( 8t) or cos( Bt), then assume that Y(t) is a 
linear combination of sin( 8t) and cos( 8t); if g(t) is a polynomial of degree n, then assume 
that Y(t) is a polynomial of degree n. The same principle extends to the case where g(t) is a 
product of any two, or all three, of these types of functions, as the next example illustrates. 


EXAMPLE 3 


Find a particular solution of 


y" — 3y! — 4y = —8e! cos(2r). (15) 
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Solution: 


In this case we assume that Y(t) is the product of e' and a linear combination of cos(2r) and sin(2r); 
that is, 


Y(t) = Ae! cos(2t) + Be‘ sin(2r). 
The algebra is more tedious in this example, but it follows that 
Y'(t) = (A --2B)e' cos(2t) + (—2A + B)e' sin(2t) 
and 
Y"(r) = (—3A + AB)e' cos(2t) + (—4A —3B)e' sin(2r). 
By substituting these expressions in equation (15), we find that A and B must satisfy 


10A+2B=8, 2A-— 10B =0. 


10 2 
Hence A = 13 and B — EE therefore, a particular solution of equation (15) is 


Y(t) = a cos(2t) + 2 e sin(2t) 
13 13 l 


Now suppose that g(t) is the sum of two terms, g(t) = g;(t) + 22(t), and suppose that 
Y, and Y; are solutions of the equations 


ay" + by + cy = g(t) (16) 
and 

ay" + by’ + cy = g(t), (17) 
respectively. Then Y, + Y? is a solution of the equation 

ay" + by' + cy = g(t). (18) 


To prove this statement, substitute Yı(t) + Y2(t) for y in equation (18) and make use of 
equations (16) and (17). A similar conclusion holds if g(t) is the sum of any finite number of 
terms. The practical significance of this result is that for an equation whose nonhomogeneous 
function g(t) can be expressed as a sum, you can consider instead several simpler equations 
and then add together the results. The following example is an illustration of this procedure. 


EXAMPLE 4 


Find a particular solution of 


y” — 3y' — 4y = 3e? + 2sint — 8e! cos(2r). (19) 


Solution: 
By splitting up the right-hand side of equation (19), we obtain the three equations 
y" -3y' — 4y = 3e™, 
y" — 3y! — 4y = 2sint, 
and 
y" — 3y' — 4y = —8e' cos( 2t). 


Solutions of these three equations have been found in Examples 1, 2, and 3, respectively. Therefore, 
a particular solution of equation (19) is their sum, namely, 


1 3 5 10 2 
Y(t) = s” + T cost a sint + ad cos(2t) + Ta sin(2r). 


The procedure illustrated in these examples enables us to solve a fairly large class of 
problems in a reasonably efficient manner. However, there is one difficulty that sometimes 
occurs. The next example illustrates how it arises. 
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EXAMPLE 5 


Find a particular solution of 


y" —3y' —4y = 2e™. (20) 


Solution: 


Proceeding as in Example 1, we assume that Y(t) = Ae~‘. By substituting in equation (20), we 
obtain 


Y" — 3Y' - AY 2 (A-3A — AA)e '! = 2e”. (21) 


Since the left-hand side of equation (21) is zero, there is no choice of A for which 0 = 2e™. 
Therefore, there is no particular solution of equation (20) of the assumed form. The reason for this 
possibly unexpected result becomes clear if we solve the homogeneous equation 


y'—3y' —4y 20 (22) 


that corresponds to equation (20). The two functions in a fundamental set of solutions of equation 
(22) are y;(t) = e~ and y(t) = e*. Thus our assumed particular solution of equation (20) is 
actually a solution of the homogeneous equation (22); consequently, it cannot possibly be a solution 
of the nonhomogeneous equation (20). To find a solution of equation (20), we must therefore consider 
functions of a somewhat different form. 

At this stage, we have several possible alternatives. One is simply to try to guess the proper form 
of the particular solution of equation (20). Another is to solve this equation in some different way and 
then to use the result to guide our assumptions if this situation arises again in the future; see Problems 
22 and 27 for other solution methods. Still another possibility is to seek a simpler equation where 
this difficulty occurs and to use its solution to suggest how we might proceed with equation (20). 
Adopting the latter approach, we look for a first-order equation analogous to equation (20). One 
possibility is the linear equation 


y 4 y =e. (23) 


=f t 


If we try to find a particular solution of equation (23) of the form Ae", we will fail because e~‘ is 
a solution of the homogeneous equation y’ + y = 0. However, from Section 2.1 we already know 
how to solve equation (23). An integrating factor is u(t) = e', and by multiplying by u(t) and then 
integrating both sides, we obtain the solution 


y 22te ^! t ce f. (24) 
The second term on the right-hand side of equation (24) is the general solution of the homogeneous 
equation y’ + y = 0, but the first term is a solution of the full nonhomogeneous equation (23). 
Observe that it involves the exponential factor e^' multiplied by the factor t. This is the clue that we 
were looking for. 
t 


We now return to equation (20) and assume a particular solution of the form Y(t) = Ate '. 
Then 


Y(t) = Ae"! — Ate™, Y"(t) = —2Ae^ + Ate. Q5) 
Substituting these expressions for y, y', and y" in equation (20), we obtain 


Y" — 3Y' - 4Y 2 (2A — 3A)e™ - (A 3A — AA)te! —2e f. 


2 
The coefficient of te~' is zero, and from the terms involving e' we have —5A = 2, so A = — 5" 


Thus a particular solution of equation (20) is 


Y(t) = ~ ite (26) 


The outcome of Example 5 suggests a modification of the principle stated previously: 
if the assumed form of the particular solution duplicates a solution of the corresponding 
homogeneous equation, then modify the assumed particular solution by multiplying it by t. 
Occasionally, this modification will be insufficient to remove all duplication with the solutions 
of the homogeneous equation, in which case it is necessary to multiply by ¢ a second time. For 
a second-order equation, it will never be necessary to carry the process further than this. 


Summary. We now summarize the steps involved in finding the solution of an initial value 
problem consisting of a nonhomogeneous linear differential equation of the form 


ay" + by + cy = g(t), (27) 
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where the coefficients a, b, and c are constants, together with a given set of initial conditions. 


1. Find the general solution of the corresponding homogeneous equation. 

2. Make sure that the function g(t) in equation (27) belongs to the class of functions 
discussed in this section; that is, be sure it involves nothing more than exponential 
functions, sines, cosines, polynomials, or sums or products of such functions. If this is 
not the case, use the method of variation of parameters (discussed in Section 3.6). 


3. If g(t) = gi(f) + --- + gn(t) —that is, if g(t) is a sum of n terms—then form n 
subproblems, each of which contains only one of the terms gi(1), ... , gn(t). The i : 
subproblem consists of the equation 


ay" + by + cy = gi(t), 


where i runs from | to n. 


4. For the i^ subproblem, assume a particular solution Y;(1) consisting of the appropriate 
exponential function, sine, cosine, polynomial, or combination thereof. If there is any 
duplication in the assumed form of Y;(t) with the solutions of the homogeneous equation 
(found in step 1), then multiply Y;(t) by t, or (if necessary) by t?, so as to remove the 
duplication. See Table 3.5.1. 


TABLE 3.5.1 The Particular Solution of ay" + by’ + cy = gi(t) 


gi(t) Y; (t) 
Py(t) = aot” t ait" +--+ an PCAor" + Ast! +--+ An) 
P,(t)e™! PCAot" + At"! +--+ An) e"! 
i t 
HOES i (CAor" + Aye?! +--+ As) e?! cos( B) 


+ (Bot” + Bit”! +--+ + By)e?! sin( Bt) 


Notes: Here, s is the smallest nonnegative integer (s = 0, 1, or 2) that will ensure 
that no term in Y;(t) is a solution of the corresponding homogeneous equation. 
Equivalently, for the three cases, s is the number of times 0 is a root of the 
characteristic equation, œ is a root of the characteristic equation, and a + iĝ isa 
root of the characteristic equation, respectively. 


5. Find a particular solution Y;(t) for each of the subproblems. Then Y(t) +---+ Y,(t) is 
a particular solution of the full nonhomogeneous equation (27). 

6. Form the sum of the general solution of the homogeneous equation (step 1) and the 
particular solution of the nonhomogeneous equation (step 5). This is the general solution 
of the nonhomogeneous equation. 


7. When initial conditions are provided, use them to determine the values of the arbitrary 
constants remaining in the general solution. 


For some problems this entire procedure is easy to carry out by hand, but often the 
algebraic calculations are lengthy. Once you understand clearly how the method works, a 
computer algebra system can be of great assistance in executing the details. 

The method of undetermined coefficients is self-correcting in the sense that if you assume 
too little for Y(t), then a contradiction is soon reached that usually points the way to the 
modification that is needed in the assumed form. On the other hand, if you assume too many 
terms, then some unnecessary work is done and some coefficients turn out to be zero, but at 
least the correct answer is obtained. 


Proof of the Method of Undetermined Coefficients. In the preceding discussion we have 
described the method of undetermined coefficients on the basis of several examples. To prove 
that the procedure always works as stated, we now give a general argument, in which we 
consider three cases corresponding to different forms for the nonhomogeneous term g(t). 


Case 1: g(t) = P,(t) = aot” + at”! +-+- +a. In this case equation (27) becomes 


ay" + by + cy = agt" - at" 1 +--+ an. (28) 
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To obtain a particular solution, we assume that 
YD est E Aut 424 ut +A tds (29) 
Substituting in equation (28), we obtain 
a (ntn E EK 2An-2) Pb +--+ Ani) 
+ c(Aot" + Ait"! +++ An) = aot" +--+ H an. (30) 


Equating the coefficients of like powers of t, beginning with t”, leads to the following sequence 
of equations: 
CÀQ = a, 
cA; + nb A = d], 


CÅAn DA, , 24A, 5 = ag. 


Provided that c # 0, the solution of the first equation is Ag = ao/c, and the remaining 
equations determine A;, ... , A, successively. 

If c = 0 but b ¥ 0, then the polynomial on the left-hand side of equation (30) is of degree 
n — 1, and we cannot satisfy equation (30). To be sure that aY”(t) + bY'(t) is a polynomial 
of degree n, we must choose Y(t) to be a polynomial of degree n + 1. Hence we assume that 


Y(t) = t( Aot” E + Ap) 
Substituting this guess into equation (28), with c = 0, and simplifying yields 
aY" +bY' = bAgUn + 1)t" + (aAo(n + Da EbAqu)t +- 
= aot” tat” | +--+ an. 

There is no constant term in this expression for Y(t), but there is no need to include such a 
term since a constant is a solution of the homogeneous equation when c = 0. Since b 0, we 
find Ay = ao/ (b(n4- 1)), and the other coefficients Aj, ... , A, can be determined similarly. 

If both c and b are zero, then the characteristic equation is ar? = Oandr = Qisa 


repeated root. Thus y; = e% = 1 and y; = te” = t form a fundamental set of solutions of the 
corresponding homogeneous equation. This leads us to assume that 


Y(1) mi^ Ast? ER A. 


The term aY"(t) gives rise to a term of degree n, and we can proceed as before. Again the 
constant and linear terms in Y (t) are omitted since, in this case, they are both solutions of the 
homogeneous equation. 


Case 2: g(t) = e"'P,(f). The problem of determining a particular solution of 

ay" + by' + cy = e"! P (t) Q1) 
can be reduced to the preceding case by a substitution. Let 

Y(t) He" ut); 

then 

Y'(1) = e"'(u(t) + aut) 
and 

Y"(t) =e“ (u(t) + 2ou'(t) + o?u(t)). 


Substituting for y, y', and y" in equation (31), canceling the factor e^', and collecting terms, 
we obtain 


au" (t) + (2aa + b)u'(t) + (ao? + ba + dult) = P,(t). (32) 


The determination of a particular solution of equation (32) is precisely the same problem, 
except for the names of the constants, as solving equation (28). Therefore, if ao? + ba +c is 
not zero, we assume that u(t) = Aot" +- - -+ An; hence a particular solution of equation (31) 
is of the form 


Y(t) 2 e*'( Aot" + Aut" | +--+ + An). (33) 
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On the other hand, if aa? + ba +c is zero but 2aa + b is not, we must take u(t) to be 
of the form t ( Apt” +---+A,,). The corresponding form for Y(t) is t times the expression on 
the right-hand side of equation (33). Note that if aa? + ba +c is zero, then e is a solution 
of the homogeneous equation. 

If both aa? + ba +c and 2aa + b are zero (and this implies that both e^ and te’ are 
solutions of the homogeneous equation), then the correct form for u(t) is t?( Aot" 4- - - --E A). 
Hence Y (t) is t£? times the expression on the right-hand side of equation (33). 


Case 3: g(t) = e^'P, (t) cos ( Gt) ore^'P, (t) sin ( Gt). These two cases are similar, so we 
consider only the latter. We can reduce this problem to the preceding one by noting that, as a 
consequence of Euler's formula, sin( 91) = (etf! — e-/?') / (2i). Hence g(t) is of the form 


e O9 mu eg 0 7 i9t 


2i 


g(t) = P,(t) 


, 


and we should choose 
Y(t) = e D'CAor" E... + An) + OP" Bot" +--+ + By), 
or, equivalently, 
Y(t) = e" (Aot +--+ + An) cos(Bt) + e?' (ü Bot" +--+ + Bn) sin( St). 


Usually, the latter form is preferred because it does not involve the use of complex-valued 
coefficients. If a + iĝ satisfy the characteristic equation corresponding to the homogeneous 
equation, we must, of course, multiply each of the polynomials by f to increase their degrees 
by 1. 

If the nonhomogeneous function involves both cos(t) and sin( 6t), it is usually 
convenient to treat these terms together, since each one individually may give rise to the same 
form for a particular solution. For example, if g(t) = t sint --2cost, the form for Y(t) would 
be 


Y(t) = (Aot + Aj) sint + (Bot + Bj) cost, 


provided that sin t and cos f are not solutions of the homogeneous equation. 


Problems 


In each of Problems 1 through 10, find the general solution of the given 15. 
differential equation. 


1. y"—2y' — 3y = 3e” 


y" +2y' + 5y = 4e cos(2r), 


In each of Problems 16 through 21: 
a. Determine a suitable form for Y(t) if the method of 


y(0) 21, y'(0) =0 


y! — y! — 2y = 2t + AC 

y" + y' — 6y = 12e + 12e? 
y! — 2y! — 3y = —3te^! 

y" -- 2y! 2 3 + Asin(2t) 

y" +2y' +y =2e% 

y" + y = 3sin(2f) + t cos(2t) 


2 2 2 
u" + wou = cos(wt), w° Fwy 


O 2INaAnaYwD 


u" + wou = cos(wof) 
10. y”+y'+4y =2sinht Hint: sinht = (e — e )/2 


In each of Problems 11 through 15, find the solution of the given initial 
value problem. 


11. y"4-y'—-2y 22t, y(0) 20, y(0 =1 
12. y"-4y 2 -3e, y(0) =0, y'(0) =2 
13. y"—2y'-y-te' +4, y(0)=1, y(0)—1 
14. y’+4y —-3sin(2?), y(0) =2, y(0)——1 


undetermined coefficients is to be used. 
O b. Use a computer algebra system to find a particular solution 
of the given equation. 


16. y"--3y!' = 204 + 2e + sin(3t) 

17. y"—5y' + 6y = e' cos(21) + e" (3t +4) sint 

18. y'"--2y! + 2y = 3e™ + 2e™ cost + 4e ^! t? sint 

19. y'"--4y = P sin(2r) + (6t + 7) cos(2r) 

20. y” +3y’ + 2y = e! (t? + 1) sin(21) + 3e™ cost + 4e! 
21. y"-rF2y!--5y = 3te™ cos(2r) — 2te~* cost 

22. Consider the equation 


y" — 3y!' - 4y = 2e7 (34) 


from Example 5. Recall that y(t) = e™ and y(t) = e^ are 
solutions of the corresponding homogeneous equation. Adapting the 
method of reduction of order (Section 3.4), seek a solution of the 
nonhomogeneous equation of the form Y(t) = v(t) y(t) = v(t)e™! 
where v(t) is to be determined. 


, 
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a. Substitute Y(t), Y'(1), and Y’(t) into equation (34) and 
show that v(t) must satisfy v" — 5v’ = 2. 

b. Let w(t) = v(t) and show that w(t) satisfies w’ — 5w = 2. 
Solve this equation for w(t). 

C. Integrate w(t) to find v(t) and then show that 


2 1 
Y(t) = —zte* + soe" t ce. 


The first term on the right-hand side is the desired particular 
solution of the nonhomogeneous equation. Note that it is a 


product of t and e™. 


23. Determine the general solution of 
N 
y+ Ay = M am sinn), 
m=1 
where \ > OandA Z mz form=1,...,N. 


@ 24. In many physical problems the nonhomogeneous term may 
be specified by different formulas in different time periods. As an 
example, determine the solution y = o (t) of 


y! y- t, O<t<7, 
: D |) pet, tm, 


satisfying the initial conditions y(0) = 0 and y'(0) = 1. Assume that 
y and y’ are also continuous at t = 7. Plot the nonhomogeneous term 
and the solution as functions of time. Hint: First solve the initial value 
problem for t < 7 ; then solve for t > 7, determining the constants in 
the latter solution from the continuity conditions at t = 7. 


Behavior of Solutions as t — oo. In Problems 25 and 26, we 
continue the discussion started with Problems 28 through 30 of Section 
3.4. Consider the differential equation 


ay" + by' + cy = g(t), (35) 


where a, b, and c are positive. 


25. If Y(t) and Y3(t) are solutions of equation (35), show that 
Y\(t) — Yo(t) — 0 as t — œ. Is this result true if b = 0? 


26. Ifg(t) = d, aconstant, show that every solution of equation (35) 
approaches d/c as t — oo. What happens if c = 0? What if b = 0 
also? 


27. Inthis problem we indicate an alternative procedure? for solving 
the differential equation 


y" + by’ +cy 2 (D E bD t c)y = g(t), (36) 
where b and c are constants, and D denotes differentiation with respect 
to t. Let rj and rz be the zeros of the characteristic polynomial of the 
corresponding homogeneous equation. These roots may be real and 
different, real and equal, or conjugate complex numbers. 

a. Verify that equation (36) can be written in the factored form 


(D —ri)(D = r2)y = g(t), 


where r; +r = —b and rir; — c. 

b. Letu = (D — r;)y. Then show that the solution of 
equation (36) can be found by solving the following two first- 
order equations: 


(D =rı)u =8(t), (D-r)y-ut). 


In each of Problems 28 through 30, use the method of Problem 27 to 
solve the given differential equation. 


28. y” —3y’ — 4y =3e” (see Example 1) 
29. y"+2y’+y =2e (see Problem 6) 
30. y” +2y’ = 3 + 4sin(2t) (see Problem 5) 


SR. S. Luthar, “Another Approach to a Standard Differential Equation,” 
Two Year College Mathematics Journal 10 (1979), pp. 200-201. Also see 
D. C. Sandell and F. M. Stein, “Factorization of Operators of Second-Order 
Linear Homogeneous Ordinary Differential Equations,” Two Year College 
Mathematics Journal 8 (1977), pp. 132-141, for a more general discussion 
of factoring operators. 


36 Variation of Parameters 


In this section we describe a second method of finding a particular solution of a 
nonhomogeneous equation. This method, variation of parameters, is due to Lagrange and 
complements the method of undetermined coefficients rather well. The main advantage of 
variation of parameters is that it is a general method; in principle at least, it can be applied 
to any equation, and it requires no detailed assumptions about the form of the solution. In 
fact, later in this section we use this method to derive a formula for a particular solution of 
an arbitrary second-order linear nonhomogeneous differential equation. On the other hand, the 
method of variation of parameters eventually requires us to evaluate certain integrals involving 
the nonhomogeneous term in the differential equation, and this may present difficulties. Before 
looking at this method in the general case, we illustrate its use in an example. 


EXAMPLE 1 


Find the general solution of 


y’+4y =8tant —7/2<1t< 7/2. (1) 


Solution: 


Observe that this problem is not a good candidate for the method of undetermined coefficients, as 
described in Section 3.5, because the nonhomogeneous term g(t) = 8 tant involves a quotient (rather 
than a sum or a product) of sin t and cos t. Therefore, the method of undetermined coefficients cannot 
be applied; we need a different approach. 

Observe also that the homogeneous equation corresponding to equation (1) is 


y" +4y — 0, (2) 


and that the general solution of equation (2) is 
ye(t) = cı cos(2t) + c2 sin(2r). (3) 


The basic idea in the method of variation of parameters is similar to the method of reduction of 
order introduced at the end of Section 3.4. In the general solution (3), replace the constants c, and c; 
by functions u(t) and u(t), respectively, and then determine these functions so that the resulting 
expression 


y = u,(t) cos(2t) + u2(t) sin(2t) (4) 


is a solution of the nonhomogeneous equation (1). 

To determine u; and u2, we need to substitute for y from equation (4) in differential equation (1). 
However, even without carrying out this substitution, we can anticipate that the result will be a single 
equation involving some combination of u;, u2, and their first two derivatives. Since there is only 
one equation and two unknown functions, we can expect that there are many possible choices of 
uy, and u, that will meet our needs. Alternatively, we may be able to impose a second condition of 
our own choosing, thereby obtaining two equations for the two unknown functions u; and u2. We 
will soon show (following Lagrange) that it is possible to choose this second condition in a way that 
makes the computation markedly more efficient.? 

Returning now to equation (4), we differentiate it and rearrange the terms, thereby obtaining 


y' = —2u, (t) sin(21) + 2u9(t) cos(21) + u (t) cos(21) + u5(t) sin(21). (5) 


Keeping in mind the possibility of choosing a second condition on u, and u», let us require the sum 
of the last two terms on the right-hand side of equation (5) to be zero; that is, we require that 


u^ (t) cos(2t) + u5(t) sin(2r) = 0. (6) 
It then follows from equation (5) that 
y! = —2u (t) sin(2t) + 2u2(t) cos(2r). (7) 


Although the ultimate effect of the condition (6) is not yet clear, the removal of the terms involving 
u, and u, has simplified the expression for y". Further, by differentiating equation (7), we obtain 


y” = —4u (t) cos(2t) — 4u(t) sin(2t) — 2u‘ (t) sin(2r) + 2u5(t) cos(2r). (8) 
Then, substituting for y and y" in equation (1) from equations (4) and (8), respectively, we find that 


y" + 4y = — 4u,(t) cos(2t) — 4u(t) sin(2t) — 2u (t) sin(2t) + 2u5(t) cos(2t) 
+ Au4(t) cos(2t) + 4u2(t) sin(2t) = 8 tant. 


Hence u; and uz must satisfy 
—2u' (t) sin(2t) + 2u5(1) cos(2t) = 8tanr. (9) 


Summarizing our results to this point, we want to choose u, and u» so as to satisfy equations (6) 
and (9). These equations can be viewed as a pair of linear algebraic equations for the two unknown 
quantities ui (t) and us(t) . Equations (6) and (9) can be solved in various ways. For example, solving 
equation (6) for us(t), we have 

cos(2t) 
(f£) = -u (t) — — . 10 
u(t) ui ) inr) (10) 
Then, substituting for u(t) in equation (9) and simplifying, we obtain 
8tan t sin(2r) i 


u$ (t) = 5 = —8 sin? t. (41) 


?An alternate, and more mathematically appealing, derivation of the second condition can be found in Problems 
17 to 19 in Section 7.9. 


3.6 Variation of Parameters 


143 


144 


CHAPTER 3 Second-Order Linear Differential Equations 


Further, putting this expression for u(t) back in equation (10) and using the double-angle formulas, 
we find that 


f 8 sin? t cos(2r) sint(2cos? t — 1) . 1 

ust) = - =4 = 4sint | 2cost — —— |. (12) 

2 
sin(2t) cost cos t 


Having obtained u(t) and u(t), we next integrate so as to find u(t) and u2(t). The result is 
u(t) = 4sint cost — 4t + c4 (13) 
and 
u(t) = 4ln(cost) — 4cos? t + c5. (14) 
On substituting these expressions in equation (4), we have 
y = (4sint cost) cos(2t) + (4In(cost) — 4cos? t) sin(2t) + cı cos(2t) + c5 sin(2t). 
Finally, by using the double-angle formulas once more, we obtain 


y = —2sin(2t) — 4t cos( 2t) + AIn(cos t) sin(2t) + c; cos(2t) + cz sin(2t). (15) 


The terms in equation (15) involving the arbitrary constants c, and c are the general solution of the 
corresponding homogeneous equation, while the other three terms are a particular solution of the 
nonhomogeneous equation (1). Thus equation (15) is the general solution of equation (1). 


The particular solution identified at the end of Example 1 corresponds to choosing both 
cı, and c» to be zero in equation (15). Any other choice of c, and c; is also a particular solution 
of the same nonhomogeneous differential equation. Notice, in particular, that choosing c, = 0 
and c; — 2 in equation (15) yields a particular solution with only two terms: 


—4t cos(2t) + 4In(cos t) sin(2r). 


We conclude this first look at the method of variation of parameters with the observation that 
the particular solution involves terms that might be difficult to anticipate. This explains why 
the method of undetermined coefficients is not a good candidate for this problem, and why the 
method of variation of parameters is needed. 

In the preceding example the method of variation of parameters worked well in 
determining a particular solution, and hence the general solution, of equation (1). The next 
question is whether this method can be applied effectively to an arbitrary equation. Therefore, 
we consider 


y" + p(t)y’ t q()y = a(t), (16) 
where p, q, and g are given continuous functions. As a starting point, we assume that we know 
the general solution 


Ye(t) = cryi(t) + cry2(t) (17) 
of the corresponding homogeneous equation 
y" + p(t)y’ t q()y = 0. (18) 


This is a major assumption. So far we have shown how to solve equation (18) only if it 
has constant coefficients. If equation (18) has coefficients that depend on f, then usually the 
methods described in Chapter 5 must be used to obtain ye(t). 

The crucial idea, as illustrated in Example 1, is to replace the constants c, and c; in 
equation (17) by functions u(t) and u»(t), respectively; thus we have 


y — u(t) y(t) + u(t) yx (t). (19) 


Then we try to determine u(t) and u2(t) so that the expression in equation (19) is a solution 
of the nonhomogeneous equation (16) rather than the homogeneous equation (18). Thus we 
differentiate equation (19), obtaining 


y =u (yE) + u(t) y(t) + wy yo(t) + wy). (20) 


As in Example 1, we now set the terms involving ut) and u(t) in equation (20) equal to 
zero; that is, we require that 


wy) + us) yaQ) = 0. (21) 
Then, from equation (20), we have 
y'= w(t) yA) + u(t) yA). Q2) 
Further, by differentiating again, we obtain 
y" =u) y (0) + u(y) HUY) + walt ys). Q3) 


Now we substitute for y, y’, and y" in equation (16) from equations (19), (22), and (23), 
respectively. After rearranging the terms in the resulting equation, we find that 


uo (i) + OMOR ONMO) 


+ ur) (YC) + PCY) + aya) ) 
+ uy (y(t) + wy) = g(t). (24) 


Each of the expressions in parentheses in the first two lines of equation (24) is zero because 
both y; and yz are solutions of the homogeneous equation (18). Therefore, equation (24) 
reduces to 


ui y) + u5(t) yy) = gn. Q5) 


Equations (21) and (25) form a system of two linear algebraic equations for the derivatives 
uj) and u(t) of the unknown functions. They correspond exactly to equations (6) and (9) 
in Example 1. 

Solving the system of equations (21), (25), we obtain 


t t 
RZOGOES üO = ——————, (26) 
WIy1, y21(0) WIy1, y2](t) 
where W[y;, y2] is the Wronskian of y; and y2. Note that division by W[y,, y2] is permissible 


since y; and y» are a fundamental set of solutions, and therefore their Wronskian is nonzero. 
By integrating equations (26), we find the desired functions u(t) and u2(t), namely, 


__ f gH B yi(t) g(t) 
=| vial ^ v BAR wee se — 


If the integrals in equations (27) can be evaluated in terms of elementary functions, then we 
substitute the results in equation (19), thereby obtaining the general solution of equation (16). 
More generally, the solution can always be expressed in terms of integrals, as stated in the 
following theorem. 


Theorem 3.6.1 


Consider the nonhomogeneous second-order linear differential equation 


y" + p(t)y' +q(t)y = g(t). (28) 


jes ida 


If the functions p, q, and g are continuous on an open interval 7, and if the functions y, and yz form 
a fundamental set of solutions of the corresponding homogeneous equation 


y" + p(t)y’+q(thy — 0, (29) 


then a particular solution of equation (28) is 


yo(s) gCs) yis) g(s) 
Y(t) 2— + , 30 
T vof Won xi) 7^ sn f WS mu 


where fto is any conveniently chosen point in 7. The general solution is 
y=cyi(t) + c2y2(t) + Y(t), (31) 


as prescribed by Theorem 3.5.2. 
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By examining the expression (30) and reviewing the process by which we derived it, 
we can see that there may be two major difficulties in carrying out the method of variation 
of parameters. As we have mentioned earlier, one is the determination of functions y,(t) 
and y(t) that form a fundamental set of solutions of the homogeneous equation (29) when 
the coefficients in that equation are not constants. The other possible difficulty lies in the 
evaluation of the integrals appearing in equation (30). This depends entirely on the nature of the 
functions y1, y2, and g. In using equation (30), be sure that the differential equation is exactly 
in the form (28); otherwise, the nonhomogeneous term g(t) will not be correctly identified. 

A major advantage of the method of variation of parameters is that equation (30) provides 
an expression for the particular solution Y(t) in terms of an arbitrary forcing function g(t). 
This expression is a good starting point if you wish to investigate the effect of variations in the 
forcing function, or if you wish to analyze the response of a system to a number of different 
forcing functions. (See Problems 18 to 22.) 


Problems 


In each of Problems 1 through 3, use the method of variation of 
parameters to find a particular solution of the given differential 
equation. Then check your answer by using the method of 
undetermined coefficients. 


1l. y” — 5y + 6y = 2e 
2. y"— y — 2y = 2e” 
3. Ay" — Ay! +y = 16e'? 


In each of Problems 4 through 9, find the general solution of the 
given differential equation. In Problems 9, g is an arbitrary continuous 
function. 


4. y" yc-tant, 
y"! + 9y — 9 sec? (3t), 
y" + Ay! +4y = t 2e, 
4y" + y =2sec(t/2), 
y' -2y «ys ef?) 
9. y"—5y' + 6y = g(t) 

In each of Problems 10 through 15, verify that the given functions 
y, and y satisfy the corresponding homogeneous equation; then 


find a particular solution of the given nonhomogeneous equation. In 
Problems 14 and 15, g is an arbitrary continuous function. 


10. ry” —2y = 3i? — 1, WA 22, y(t) =27! 

11. ?y"—t(r-2)y -(t-2y 220, t > 0; 

yilt) =t, y(t) = te 

12. ry"—(1-0y'-y = te”, t» 0; y(t) 21-4, y(t) =e! 
13. x?2y" —3xy’+4y = x? lnx, x > 0; 
yo(x) = x?Inx 


0<t< 7/2 
0<t< 7/6 
t>0 
—m«t«m 


9e Sa QN Ut 


t > 0; 


yax) mat. 


14. x?y" 4+ xy' +(x? — py = 3x?) sinx, x > 0; 
yi GO = x 72 —1/2 
15. xy" xy + (x? — 025)y = g(x), x > 0; 
yi (x) = x V? sinx, y(x) = x712 
16. By choosing the lower limit of integration in equation (30) in the 
text as the initial point tọ, show that Y(t) becomes 
Y(t) = n yils) y2(t) — yi) yos) 
iy CD XG) = X G0) 2G) 


sinx, y(x) =x cos x 


COS X 


g(s) ds. 


Show that Y(t) is a solution of the initial value problem 


Lily] = g(t), y(t) =9, y'(to) =0. 


17. Show that the solution of the initial value problem 


Liy] = y”  p(t)y' t q(0y =8(t), y(to) = yo. y (t9) = yo 


(32) 
can be written as y = u(t) + v(t), where u and v are solutions of the 
two initial value problems 


L[u] — 0, 
Liv] = g(t), 


u'(to) = Yo. (33) 
v'(to) = 0, (34) 


u(to) = yo. 
v(to) = 0, 


respectively. In other words, the nonhomogeneities in the differential 

equation and in the initial conditions can be dealt with separately. 

Observe that u is easy to find if a fundamental set of solutions of 

L[u] = 0 is known. And, as shown in Problem 16, the function v 

is given by equation (30). 

18. a. Use the result of Problem 16 to show that the solution of the 
initial value problem 


y'cty-gü) Y) =0, y(t) =0 (35) 


y= J sin(t — s)g(s) ds. (36) 
to 


b. Use the result of Problem 17 to find the solution of the initial 
value problem 


y"+y=a(t), y(0)=yo, y(0) = yo. 


19. Use the result of Problem 16 to find the solution of the initial 
value problem 


Lily] = g(t), y(to) = 0, y'(to) =0, 


where L[y] = ( D —a) ( D — D) y for real numbers a and b with a Æ b. 
Note that L[y] = y" — (a + b) y' + aby. 

20. Use the result of Problem 16 to find the solution of the initial 
value problem 


Lily] = g(t), y(to) —0, y'(to) — 0, 


where L[y] = (D — (A + ip))(D — (A — ip))y; that is, 
L[y] = y"—2Ay’'+(A?+17) y. Note that the roots of the characteristic 
equation are \ tip. 


21. Use the result of Problem 16 to find the solution of the initial 
value problem 


Lily] = g(t), y(to) =0, y'(to) =0, 


where L[y] = (D — a)?y, that is, L[y] = y" — 2ay’ + a?y, and a is 
any real number. 


22. By combining the results of Problems 19 through 21, show that 
the solution of the initial value problem 


Liy] 2 (D? -bD t c)y 2 g(t), y(t) =0, y(t) =0, 


where b and c are constants, can be written in the form 
t 
y=¢(t) x K(t — s) g(s)ds, (37) 
to 


where the function K depends only on the solutions y, and y; of 
the corresponding homogeneous equation and is independent of the 
nonhomogeneous term. Once K is determined, all nonhomogeneous 
problems involving the same differential operator L are reduced to 
the evaluation of an integral. Note also that although K depends on 
both ¢ and s, only the combination t — s appears, so K is actually a 
function of a single variable. When we think of g(t) as the input to 
the problem and of o (t) as the output, it follows from equation (37) 
that the output depends on the input over the entire interval from the 
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initial point fo to the current value t. The integral in equation (37) is 
called the convolution of K and g, and K is referred to as the kernel. 


23. The method of reduction of order (Section 3.4) can also be used 
for the nonhomogeneous equation 


y" + p()y' c q(0ny = g(t), (38) 


provided one solution y, of the corresponding homogeneous equation 
is known. Let y = v(t) y;(t) and show that y satisfies equation (38) 
if v is a solution of 


yt)” (2x00 + p(t) y(t)’ = g(t). (39) 


Equation (39) is a first-order linear differential equation for v’. By 
solving equation (39) for v’, integrating the result to find v, and 
then multiplying by y,(t), you can find the general solution of 
equation (38). This method simultaneously finds both the second 
homogeneous solution and a particular solution. 


In each of Problems 24 through 26, use the method outlined in Problem 
23 to solve the given differential equation. 


24. ty” —2ty’+2y=4, t>0; y(t) =t 
25. t?y"47ty’+5y=t, t>0; y(t) =t! 
26. ty' —(1-Dy +y = te”, t>0; y(t) 21-t (see 


Problem 12) 


3.7 Mechanical and Electrical Vibrations 


One of the reasons why second-order linear differential equations with constant coefficients 
are worth studying is that they serve as mathematical models of many important physical 
processes. Two important areas of application are the fields of mechanical and electrical 
oscillations. For example, the motion of a mass on a vibrating spring, the torsional oscillations 
of a shaft with a flywheel, the flow of electric current in a simple series circuit, and many other 
physical problems are all described by the solution of an initial value problem of the form 


ay" + by’ + cy = g(t), 


y(0) = yo, y'(0) = yo. (1) 


This illustrates a fundamental relationship between mathematics and physics: many 
physical problems may have mathematically equivalent models. Thus, once we know how to 
solve the initial value problem (1), it is only necessary to make appropriate interpretations of 
the constants a, b, and c, and of the functions y and g, to obtain solutions of different physical 


problems. 


We will study the motion of a mass on a spring in detail because understanding the 
behavior of this simple system is the first step in the investigation of more complex vibrating 
systems. Further, the principles involved are common to many problems. 

Consider a mass m hanging at rest on the end of a vertical spring of original length /, 
as shown in Figure 3.7.1. The mass causes an elongation L of the spring in the downward 
(positive) direction. In this static situation there are two forces acting at the point where the 
mass is attached to the spring; see Figure 3.7.2. The gravitational force, or weight of the mass, 
acts downward and has magnitude w = mg, where g is the acceleration due to gravity. There 
is also a force F,, due to the spring, that acts upward. If we assume that the elongation L 
of the spring is small, the spring force is very nearly proportional to L; this is known as 
Hooke’s!” law. Thus we write F, = —kL, where the constant of proportionality k is called the 


!0Robert Hooke (1635-1703) was an English scientist with wide-ranging interests. His most important book, 
Micrographia, was published in 1665 and described a variety of microscopical observations. Hooke first published 
his law of elastic behavior in 1676 as ceiiinosssttuv; in 1678 he gave the interpretation ut tensio sic vis, which means, 


roughly, “as the force so is the displacement.” 
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spring constant, and the minus sign is due to the fact that the spring force acts in the upward 
(negative) direction. Since the mass is in equilibrium, the two forces balance each other, which 
means that 


w+ F,-—mg —kL =0. (2) 


For a given weight w = mg, you can measure L and then use equation (2) to determine k. 
Note that k has the units of force per unit length. 


F,--kL 


w-mg 
didis Force diagram for a spring-mass system. 


In the corresponding dynamic problem, we are interested in studying the motion of the 
mass when it is acted on by an external force or is initially displaced. Let u(t), measured 
positive in the downward direction, denote the displacement of the mass from its equilibrium 
position at time f; see Figure 3.7.1. Then u(t) is related to the forces acting on the mass through 
Newton's law of motion 


mu'(t) = f(t), (3) 
where u" is the acceleration of the mass and f is the net force acting on the mass. Observe 


that both u and f are functions of time. In this dynamic problem there are now four separate 
forces that must be considered. 


1. The weight w — mg of the mass always acts downward. 


2. The spring force F, is assumed to be proportional to the total elongation L + u of the 
spring and always acts to restore the spring to its natural position. If L + u > 0, then 
the spring is extended, and the spring force is directed upward. In this case 


F, = —k(L + u). (4) 


On the other hand, if L +u < 0, then the spring is compressed a distance |L + u|, and the 
spring force, which is now directed downward, is given by F; = k|L+u|. However, when 
L+u < 0, it follows that |L +u| = —(L+u), so F, is again given by equation (4). Thus, 
regardless of the position of the mass, the force exerted by the spring is always expressed 
by equation (4). 

3. The damping or resistive force F4 always acts in the direction opposite to the direction of 
motion of the mass. This force may arise from several sources: resistance from the air or 
other medium in which the mass moves, internal energy dissipation due to the extension 
or compression of the spring, friction between the mass and the guides (if any) that 
constrain its motion to one dimension, or a mechanical device (dashpot) that imparts a 
resistive force to the mass. In any case, we assume that the resistive force is proportional 
to the speed |du/dt| of the mass; this is usually referred to as viscous damping. If 
du/dt > 0, then u is increasing, so the mass is moving downward. Then Fy is directed 
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upward and is given by 
Falt) = —yu'(t), (5) 


where y is a positive constant of proportionality known as the damping constant. On the 
other hand, if du/dt < 0, then u is decreasing, the mass is moving upward, and F4 is 
directed downward. In this case, Fy = y|u’(t)|; since |u’(t)| = —u'(t), it follows that 
Fa(t) is again given by equation (5). Thus, regardless of the direction of motion of the 
mass, the damping force is always expressed by equation (5). 

The damping force may be rather complicated, and the assumption that it is modeled 
adequately by equation (5) may be open to question. Some dashpots do behave as 
equation (5) states, and if the other sources of dissipation are small, it may be possible 
to neglect them altogether or to adjust the damping constant y to approximate them. 
An important benefit of the assumption (5) is that it leads to a linear (rather than a 
nonlinear) differential equation. In turn, this means that a thorough analysis of the system 
is straightforward, as we will show in this section and in Section 3.8. 

4. An applied external force F(t) is directed downward or upward as F(t) is positive or 
negative. This could be a force due to the motion of the mount to which the spring is 
attached, or it could be a force applied directly to the mass. Often the external force 
is periodic. 


Taking account of these forces, we can now rewrite Newton's law (3) as 


mu"(t) =w + F(t) + Falt) + F(t) 


mg —k(L +u(t)) — yu'(t) + F(t). (6) 


Since mg — kL — 0 by equation (2), it follows that the equation of motion of the mass is 
mu" (t) + u(t) + ku(t) = F(t), (7) 


where the constants m, y, and k are positive. Note that equation (7) has the same form as 
equation (1), that is, it is a nonhomogeneous second-order linear differential equation with 
constant coefficients. 

It is important to understand that equation (7) is only an approximate equation for 
the displacement u(t). In particular, both equations (4) and (5) should be viewed as 
approximations for the spring force and the damping force, respectively. In our derivation 
we have also neglected the mass of the spring in comparison with the mass of the attached 
body. 

The complete formulation of the vibration problem requires that we specify two initial 
conditions, namely, the initial position uq and the initial velocity vo of the mass: 


u(0) uo, u'(0) = vo. (8) 


It follows from Theorem 3.2.1 that these conditions give a mathematical problem that has a 
unique solution for any values of the constants uo and vg. This is consistent with our physical 
intuition that if the mass is set in motion with a given initial displacement and velocity, then 
its position will be determined uniquely at all future times. The position of the mass is given 
(approximately) by the solution of the second-order linear differential equation (7) subject to 
the prescribed initial conditions (8). 


EXAMPLE 1 


A mass weighing 4 lb stretches a spring 2 in. Suppose that the mass is given an additional 6-in 
displacement in the positive direction and then released. The mass is in a medium that exerts a viscous 
resistance of 6 Ib when the mass has a velocity of 3 ft/s. Under the assumptions discussed in this 
section, formulate the initial value problem that governs the motion of the mass. 


Solution: 


The required initial value problem consists of the differential equation (7) and initial conditions (8), 
so our task is to determine the various constants that appear in these equations. The first step is to 
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choose the units of measurement. Based on the statement of the problem, it is natural to use the 
English rather than the metric system of units. The only time unit mentioned is the second, so we 
will measure ¢ in seconds. On the other hand, both the foot and the inch appear in the statement as 
units of length. It is immaterial which one we use, but having made a choice, we must be consistent. 
To be definite, let us measure the displacement u in feet. 

Since nothing is said in the statement of the problem about an external force, we assume that 
F(t) — 0. To determine m, note that 


ow 4b — llb.s 
£g 32ft/s? 8 ft 


The damping coefficient y is determined from the statement that yu’ is equal to 6 Ib when u' is 3 ft/s. 
Therefore, 
6 Ib Ib - 
) s 


TT ET ft 


The spring constant k is found from the statement that the mass stretches the spring by 2 in or : ft. 
Thus 
| 4lb  ,|b 
1/6 ft ft 


Consequently, differential equation (7) becomes 


1 " / = 
—-u +2u’ + 24u =0, 


8 
or 
u” + 16u' + 192u = 0. (9) 
The initial conditions are 
u(0) = > u'(0) =0. (10) 


The second initial condition is implied by the word “released” in the statement of the problem, which 
we interpret to mean that the mass is set in motion with no initial velocity. 


Undamped Free Vibrations. If there is no external force, then F(t) = O in equation (7). 
Let us also suppose that there is no damping so that ^; = 0; this is an idealized configuration of 
the system, seldom (if ever) completely attainable in practice. However, if the actual damping 
is very small, then the assumption of no damping may yield satisfactory results over short to 
moderate time intervals. In this case the equation of motion (7) reduces to 


mu" + ku = 0. (11) 
The characteristic equation for equation (11) is 


mr? +k=0 


and its roots are r = +i J/k/ m. Thus the general solution of equation (11) is 


u = Acos(wof) + Bsin(wof), (12) 
where 
k 
2 
OQ —. 13 
eee (13) 
The arbitrary constants A and B can be determined if initial conditions of the form (8) are 
given. 
In discussing the solution of equation (11), it is convenient to rewrite equation (12) in the 
form 
u = Rcos(wot — 6), (14) 
or 


u = Rcosó cos (wot) + Rsinó sin (wot). (15) 
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By comparing equation (15) with equation (12), we find that A, B, R, and 6 are related by the 
equations 


A = Rcosó, B -—Rsinó. (16) 
Thus 


B 
R = Vy A? + B?, tand = 7: (17) 


In calculating 6, we must take care to choose the correct quadrant; this can be done by checking 
the signs of cos 6 and sin ô in equations (16). 

The graph of equation (14), or the equivalent equation (12), for a typical set of initial 
conditions is shown in Figure 3.7.3. The graph is a displaced cosine wave that describes a 
periodic, or simple harmonic, motion of the mass. The period of the motion is 


T= = =2r (2). (18) 


The circular frequency wo = 4/k/ m, measured in radians per unit time, is called the natural 
frequency of the vibration. The maximum displacement R of the mass from equilibrium is the 
amplitude of the motion. The dimensionless parameter 6 is called the phase, or phase angle, 
and measures the displacement of the wave from its normal position corresponding to 6 = 0. 


EE ! 


R cos ó 


T [53 a n oe 


-R- = B 


didus Simple harmonic motion; u = R cos(wot — ô). 
Note that the horizontal axis is labeled as wt. 


Note that the motion described by equation (14) has a constant amplitude that does not 
diminish with time. This reflects the fact that, in the absence of damping, there is no way for the 
system to dissipate the energy imparted to it by the initial displacement and velocity. Further, 
for a given mass m and spring constant k, the system always vibrates at the same frequency 
Wo, regardless of the initial conditions. However, the initial conditions do help to determine 
the amplitude of the motion. Finally, observe from equation (18) that the period T increases as 
the mass m increases, so larger masses vibrate more slowly. On the other hand, the period T 
decreases as the spring constant k increases, which means that stiffer springs cause the system 
to vibrate more rapidly. 


EXAMPLE 2 


Suppose that a mass weighing 10 lb stretches a spring 2 in. If the mass is displaced an additional 2 in 
and is then set in motion with an initial upward velocity of 1 ft/s, determine the position of the mass 
at any later time. Also determine the period, amplitude, and phase of the motion. 


Solution: 


The spring constant is k = 10 Ib/2 in = 60 Ib/ft, and the mass is m = w/g = 10/32 Ib-s^/ft. Hence 
the equation of motion reduces to 


u" + 192u = 0, (19) 
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and the general solution is 


u = A cos(84/3t) + B sin(8/31). 
The solution satisfying the initial conditions u(0) = 1/6 ft and u'(0) = —1 ft/s is 


u = = cos( 8/31) - 1 sinca/38). (20) 


8/3 
The natural frequency is wọ = 8/3 = 13.856 rad/s, so the period is T = 2r /wọ = 0.453 s. 
The amplitude R and phase 6 are found from equations (17). We have 
R 1 1 19 


Sota o RS Oe. 
36 192 576 


The second of equations (17) yields tané = — V5 / ^. There are two solutions of this equation, one 
in the second quadrant and one in the fourth. In the present problem, cosó > 0 and sin < 0, so ó 
is in the fourth quadrant. In fact, 


3 
ó= (A) = — 0.40864 rad. 


The graph of the solution (20) is shown in Figure 3.7.4. 


u = 0.182 cos(8V3 t + 0.409) 


JAIE An undamped free vibration: 
u” + 192u — 0, u(0) = 1/6, u'(0) = —1. 
Note that the scale for the horizontal axis is wof. 


Damped Free Vibrations. When the effects of damping are included, the differential 
equation governing the motion of the mass is 


mu” + ^u! + ku — Q0. (21) 


We are especially interested in examining the effect of variations in the damping coefficient 
y for given values of the mass m and spring constant k. The corresponding characteristic 
equation is 


mr? 4 ^r -- k — 0, 


and its roots are 


ipa ims =144/1— — 22 
"ras 2m 2m J = 

Depending on the sign of y? — 4km, the solution u has one of the following forms: 
3? — 4km > 0, u = Ae"! + Be"; (23) 
y? — 4km =0, u= (A + Bt)e?/ Om: Q4) 


1 1/2 
4? — 4km < 0, u = e (A cos( ut) + B sin(ut)), u= 5; (4km = y?) > 0. 
m 


(25) 
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Since m, y, and k are positive, y? — 4km is always less than 7. Hence, if y~? — 4km > 0, then 
the values of r, and r, given by equation (22) are negative. If y? — 4km < 0, then the values 
of rı and rz are complex, but with negative real part. Thus, in all cases, the solution u tends to 
Zero as f — oo; this occurs regardless of the values of the arbitrary constants A and B —that 
is, regardless of the initial conditions. This confirms our intuitive expectation, namely, that 
damping gradually dissipates the energy initially imparted to the system, and consequently 
the motion dies out with increasing time. 

The most interesting case is the third one, which occurs when the damping is small. If we 
let A — Rcosó and B — Rsinó in equation (25), then we obtain 


u = Re 70m cos( ut — 6). (26) 


The displacement u lies between the curves u = +Re77' / Q") - hence it resembles a cosine 
wave whose amplitude decreases as ¢ increases. A typical example is sketched in Figure 3.7.5. 
The motion is called a damped oscillation or a damped vibration. The amplitude factor R 
depends on m, y, k, and the initial conditions. 


Re ?!/2m) 


quem N germ) 


3IdI T3524 Damped vibration; u = Re^?'/?" cos( jut —6). 


Note that the scale for the horizontal axis is jut. 


Although the motion is not periodic, the parameter jz determines the frequency with 
which the mass oscillates back and forth; consequently, jz is called the quasi-frequency. By 
comparing u with the frequency w of undamped motion, we find that 


u _ (4km — 42)! / (2m) _ ( y E y? 


1 ; 27 
Wo Vk/ m Akm en 


8km 
The last approximation is valid when y7/4km is small; we refer to this situation as “small 
damping.” Thus the effect of small damping is to reduce slightly the frequency of the 
oscillation. By analogy with equation (18), the quantity 7; = 27 / u is called the quasi-period 
of the motion. It is the time between successive maxima or successive minima of the position 
of the mass, or between successive passages of the mass through its equilibrium position while 
going in the same direction. The relation between 7; and T is given by 


-1/2 
T 2 2 
ata (1 7 mq Q8) 


4km 8km 


where again the last approximation is valid when y7/4km is small. Thus small damping 
increases the quasi-period. 

Equations (27) and (28) reinforce the significance of the dimensionless ratio 
y? / (4km). It is not the magnitude of y alone that determines whether damping is large or 
small, but the magnitude of y? compared to 4km. When ^?/(4km) is small, then damping 
has a small effect on the quasi-frequency and quasi-period of the motion. On the other hand, 
if we want to study the detailed motion of the mass for all time, then we can never neglect the 
damping force, no matter how small. 
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As y7/(4km) increases, the quasi-frequency u decreases and the quasi-period T; 
increases. In fact, uw — 0 and T; — oo asy — 2./km. As indicated by equations (23), 
(24), and (25), the nature of the solution changes as y passes through the value 2./km. 
The motion with y = 24/km is said to be critically damped. For larger values of y, 
y > 24km, the motion is said to be overdamped. In these cases, given by equations (24) 
and (23), respectively, the mass may pass through its equilibrium position at most once (see 
Figure 3.7.6) and then creep back to it. The mass does not oscillate about the equilibrium, as it 
does for small y. Two typical examples of critically damped motion are shown in Figure 3.7.6, 
and the situation is discussed further in Problems 15 and 16. 


Critically damped motions: u” + u’ + 0.25u = 0; 


u = (A + Bt)e™/?. The solid blue curve is the solution satisfying 
u(0) = 1/2, u'(0) = 7/4; the dashed green curve satisfies 
u(0) = 1/2, u'(0) = —7/4. 


EXAMPLE 3 


The motion of a certain spring-mass system is governed by the differential equation 


H 1 / 
u tg” +u =0, (29) 


where u is measured in feet and f in seconds. If u(0) = 2 and u'(0) = 0, determine the position of 
the mass at any time. Find the quasi-frequency and the quasi-period, as well as the time at which the 
mass first passes through its equilibrium position. Also find the time 7 such that |u(t)| < 0.1 for all 
> Tas 

Solution: 


The solution of equation (29) is 


u(t) = e/16 (^ ( : =) + zi : 2) 


To satisfy the initial conditions, we must choose A = 2 and B = 2/4/ 255; hence the solution of the 
initial value problem is 


4/255 4/255 
u = e™/!6 | 2cos t| + 2 sin t 
16 4/255 16 


2 J/255 
B Ae ee = ) (30) 
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u"+u=0 
u"+0.125u'+u=0 


- 
a m m e 


Vibration with small damping (solid blue curve) and with no damping 
(dashed green curve). Both motions have the same initial conditions: u(0) = 2, u/(0) = 0. 


where 6 is in the first quadrant with tan = 1/4/255, so 6 = 0.06254. The displacement of the 
mass as a function of time is shown in Figure 3.7.7. For purposes of comparison, we also show the 
motion if the damping term is neglected. 

The quasi-frequency is pp = 4/255/16 = 0.998, and the quasi-period is T; = 27 / u = 6.295 s. 
These values differ only slightly from the corresponding values (1 and 27, respectively) for the 
undamped oscillation. This is evident also from the graphs in Figure 3.7.7, which rise and fall almost 
together. The damping coefficient is small in this example: only one-sixteenth of the critical value, 
in fact. Nevertheless, the amplitude of the oscillation is reduced rather rapidly. 


-t16 V255 
2 et6cos (98; - 0.06254) 


felt: F733) Solution of Example 3 for 40 < t < 60; determination of the time 
T after which |u(t)| < 0.1. 


Figure 3.7.8 shows the graph of the solution for 40 < t < 60, together with the graphs of 
u = +0.1. From the graph it appears that 7 is about 47.5, and by a more precise calculation we find 


that 7 = 47.5149 s. 
To find the time at which the mass first passes through its equilibrium position, we refer to 


equation (30) and set 4/2551 / 16 — 6 equal to 7/2, the smallest positive zero of the cosine function. 
Then, by solving for t, we obtain 


1 
(mm) ~ 1.637 s. 


4/255 \ 2 
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Resistance R Capacitance C 


(1 Inductance L 


Impressed voltage E(t) 


AIEA A simple electric circuit. 


Electric Circuits. A second example of the occurrence of second-order linear differential 
equations with constant coefficients is their use as a model of the flow of electric current 
in the simple series circuit shown in Figure 3.7.9. The current 7, measured in amperes (A), 
is a function of time t. The resistance R in ohms (Q), the capacitance C in farads (F), and 
the inductance L in henrys (H) are all positive and are assumed to be known constants. The 
impressed voltage E in volts (V) is a given function of time. Another physical quantity that 
enters the discussion is the total charge Q in coulombs (C) on the capacitor at time t. The 
relation between charge Q and current / is 


4Q 
I = —_., 1 
dt x: 


The flow of current in the circuit is governed by Kirchhoff's!! second law: In a closed 


circuit the impressed voltage is equal to the sum of the voltage drops in the rest of the circuit. 
According to the elementary laws of electricity, we know that 


The voltage drop across the resistor is RT. 


The voltage drop across the capacitor is E 


dI 
The voltage drop across the inductor is L T 


Hence, by Kirchhoff s law, 


LE. gra lo E(t) (32) 
dt C= ` 
The units for voltage, resistance, current, charge, capacitance, inductance, and time are all 


related: 
1 volt = 1 ohm - 1 ampere = 1 coulomb/ 1 farad = 1 henry - 1 ampere/1 second. 


Substituting for J from equation (31), we obtain the differential equation 
1 
LQ" + RQ' + 5O = E(t) (33) 
for the charge Q. The initial conditions are 


Q(t) = Qo, Q'(to) = I (to) = Io. (34) 


Thus to know the charge at any time it is sufficient to know the charge on the capacitor and 
the current in the circuit at some initial time fo. 

Alternatively, we can obtain a differential equation for the current J by differentiating 
equation (33) with respect to 7, and then substituting for d Q/dt from equation (31). The 
result is 


1 
LIN+ RI + 5I = EQ), (35) 


with the initial conditions 


I(to.) = Io I'(to) = Ig. (36) 


"Gustav Kirchhoff (1824-1887) was a German physicist and professor at Breslau, Heidelberg, and Berlin. He 
formulated the basic laws of electric circuits about 1845 while still a student at Albertus University in his native 
Königsberg. In 1857 he discovered that an electric current in a resistanceless wire travels at the speed of light. He 
is also famous for fundamental work in electromagnetic absorption and emission and was one of the founders of 
spectroscopy. 


From equation (32) it follows that 
Bea 
1 G 
= : 
L 
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(37) 


Hence J is also determined by the initial charge and current, which are physically measurable 


quantities. 


The most important conclusion from this discussion is that the flow of current in the circuit 
is described by an initial value problem of precisely the same form as the one that describes the 
motion of a spring-mass system. This is a good example of the unifying role of mathematics: 
once you know how to solve second-order linear equations with constant coefficients, you can 
interpret the results in terms of mechanical vibrations, electric circuits, or any other physical 


situation that leads to the same problem. 


Problems 


In each of Problems | and 2, determine wo, R, and 6 so as to write the 
given expression in the form u = R cos(wot — 6). 

1. u-3cos(2t) + Asin(2t) 

2. u — —2cos(mt) —3sin(zt) 

3. A mass of 100 g stretches a spring 5 cm. If the mass is set in 
motion from its equilibrium position with a downward velocity of 
10 cm/s, and if there is no damping, determine the position u of the 
mass at any time t. When does the mass first return to its equilibrium 
position? 

4. A mass weighing 3 Ib stretches a spring 3 in. If the mass is 
pushed upward, contracting the spring a distance of 1 in and then set in 
motion with a downward velocity of 2 ft/s, and if there is no damping, 
find the position u of the mass at any time t. Determine the frequency, 
period, amplitude, and phase of the motion. 


@ 5. A mass of 20 g stretches a spring 5 cm. Suppose that the 
mass is also attached to a viscous damper with a damping constant of 
400 dyn-s/cm. If the mass is pulled down an additional 2 cm and then 
released, find its position u at any time t. Plot u versus t. Determine the 
quasi-frequency and the quasi-period. Determine the ratio of the quasi- 
period to the period of the corresponding undamped motion. Also find 
the time 7 such that |u(1) | < 0.05 cm for all t > 7. 


6. A spring is stretched 10 cm by a force of 3 N. A mass of 2 kg 
is hung from the spring and is also attached to a viscous damper that 
exerts a force of 3 N when the velocity of the mass is 5 m/s. If the mass 
is pulled down 5 cm below its equilibrium position and given an initial 
downward velocity of 10 cm/s, determine its position u at any time t. 
Find the quasi-frequency p and the ratio of u to the natural frequency 
of the corresponding undamped motion. 


7. A series circuit has a capacitor of 10? F, aresistor of 3 x 10? Q, 
and an inductor of 0.2 H. The initial charge on the capacitor is 1079 C 
and there is no initial current. Find the charge Q on the capacitor at 
any time f. 


8. A vibrating system satisfies the equation u” + yu’ +u = 0. Find 
the value of the damping coefficient y for which the quasi-period of 
the damped motion is 50% greater than the period of the corresponding 
undamped motion. 


9. Show that the period of motion of an undamped vibration of 
a mass hanging from a vertical spring is 27 ,/L/g, where L is the 
elongation of the spring due to the mass, and g is the acceleration due 
to gravity. 

10. Show that the solution of the initial value problem 


mu" + yu'+ku=0, | u(tg) — ug, u'(tg) = Uy 


can be expressed as the sum u = v + w, where v satisfies the initial 
conditions v(tg) = uo, v'(19) = 0, w satisfies the initial conditions 
w(to) — 0,w'(tg) = Ups and both v and w satisfy the same differential 
equation as u. This is another instance of superposing solutions of 
simpler problems to obtain the solution of a more general problem. 


11. a. Show that A cos (wot) +B sin(wof) can be written in the 
form r sin(wot — 0). Determine r and @ in terms of A and B. 
b. If Rcos(wot — 6) = rsin(wot — 0), determine the 
relationship among R, r, ô, and 0. 


12. Ifa series circuit has a capacitor of C = 0.8 x 107 F and an 
inductor of L = 0.2 H, find the resistance R so that the circuit is 
critically damped. 


13. Assume that the system described by the differential equation 
mu" + yu’ + ku = O is either critically damped or overdamped. Show 
that the mass can pass through the equilibrium position at most once, 
regardless of the initial conditions. 

Hint: Determine all possible values of t for which u = 0. 


14. Assume that the system described by the differential equation 
mu" 4-^u' 4-ku = O is critically damped and that the initial conditions 
are u(0) = ug, u/(0) = vo. If vy = 0, show that u — O0 ast — oo 
but that u is never zero. If uo is positive, determine a condition on vo 
that will ensure that the mass passes through its equilibrium position 
after it is released. 


15. Logarithmic Decrement. a. For the damped oscillation 

described by equation (26), show that the time between successive 

maxima is Ty = 27 / p. 
b. Show that the ratio of the displacements at two successive 
maxima is given by exp(^T4/(2m)). Observe that this ratio 
does not depend on which pair of maxima is chosen. The natural 
logarithm of this ratio is called the logarithmic decrement and 
is denoted by A. 
c. Show that A = v y/(my). Since m, jz, and A are quantities 
that can be measured easily for a mechanical system, this result 
provides a convenient and practical method for determining 
the damping constant of the system, which is more difficult to 
measure directly. In particular, for the motion of a vibrating 
mass in a viscous fluid, the damping constant depends on the 
viscosity of the fluid; for simple geometric shapes the form of 
this dependence is known, and the preceding relation allows the 
experimental determination of the viscosity. This is one of the 
most accurate ways of determining the viscosity of a gas at high 
pressure. 
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16. Referring to Problem 15, find the logarithmic decrement of the 
system in Problem 5. 


17. The position of a certain spring-mass system satisfies the initial 
value problem 


3 
-u'--ku-0, u(0) =2, u'(0) =v. 


2 
If the period and amplitude of the resulting motion are observed to be 
7 and 3, respectively, determine the values of k and v. 


18. Consider the initial value problem 


mu" +yu'+ku=0, | u(0) 2 ug, u'(0) = vg. 


Assume that y? < 4km. 
a. Solve the initial value problem. 
b. Write the solution in the form u(t) = Re~1"/(2” cos(ut—ô). 
Determine R in terms of m, y, k, uo, and vo. 
C. Investigate the dependence of R on the damping coefficient 
^y for fixed values of the other parameters. 


19. A cubic block of side / and mass density p per unit volume is 
floating in a fluid of mass density po per unit volume, where po > p. 
If the block is slightly depressed and then released, it oscillates in the 
vertical direction. Assuming that the viscous damping of the fluid and 
air can be neglected, derive the differential equation of motion and 
determine the period of the motion. 

Hint: Use Archimedes’!? principle: an object that is completely or 
partially submerged in a fluid is acted on by an upward (buoyant) force 
equal to the weight of the displaced fluid. 


20. The position of a certain undamped spring-mass system satisfies 
the initial value problem 


u"4-2u 20, u(0) =0, u'(0) =2. 


a. Find the solution of this initial value problem. 

(G] b. Plot u versus f and u' versus f on the same axes. 

Q c. Plot wv’ versus u; that is, plot u(t) and u/(1) parametrically 
with t as the parameter. This plot is known as a phase plot, and 
the uu'-plane is called the phase plane. Observe that a closed 
curve in the phase plane corresponds to a periodic solution u(t). 
What is the direction of motion on the phase plot as t increases? 


21. The position of a certain spring-mass system satisfies the initial 
value problem 


u” + ju +2u=0, u(0) =0, w(0) —2. 

a. Find the solution of this initial value problem. 

@ b. Plot u versus t and u’ versus t on the same axes. 

Q c. Plot u’ versus u in the phase plane (see Problem 20). 
Identify several corresponding points on the curves in parts b and 
c. What is the direction of motion on the phase plot as t increases? 


22. Inthe absence of damping, the motion of a spring-mass system 
satisfies the initial value problem 


mu" --ku =0, u(0) =a, u'(0) =b. 


a. Show that the kinetic energy initially imparted to the mass 
is mb?/2 and that the potential energy initially stored in the 
spring is ka?/2, so initially the total energy in the system is 
(ka? + mb?) /2 


b. Solve the given initial value problem. 

c. Using the solution in part b, determine the total energy in the 
system at any time t. Your result should confirm the principle of 
conservation of energy for this system. 


23. Suppose that a mass m slides without friction on a horizontal 
surface. The mass is attached to a spring with spring constant k, as 
shown in Figure 3.7.10, and is also subject to viscous air resistance 
with coefficient y. Show that the displacement u(t) of the mass 
from its equilibrium position satisfies equation (21). How does the 
derivation of the equation of motion in this case differ from the 
derivation given in the text? 


u(t) 


dicU sem A spring-mass system. 


24. Inthespring-mass system of Problem 23, suppose that the spring 
force is not given by Hooke's law but instead satisfies the relation 


F; = —(ku + eu’), 


where k > 0 and e is small but may be of either sign. The spring is 
called a hardening spring if € > O and a softening spring if € < 0. 
Why are these terms appropriate? 
a. Show that the displacement u(t) of the mass from its 
equilibrium position satisfies the differential equation 


mu" + yu! +ku+ eu? =0. 
Suppose that the initial conditions are 


u(0) =0, u'(0) =1. 


In the remainder of this problem, assume that m — 1, k — 1, and 
y =0. 

b. Find u(t) when e = 0 and also determine the amplitude and 
period of the motion. 

Qc. Let e = 0.1. Plot a numerical approximation to the 
solution. Does the motion appear to be periodic? Estimate the 
amplitude and period. 

O d. Repeat part c for e = 0.2 and e = 0.3. 

Q e. Plot your estimated values of the amplitude A and the 
period T versus e. Describe the way in which A and T, 
respectively, depend on e. 

O f. Repeat parts c, d, and e for negative values of c. 


!2 Archimedes (287-212 BCE) was the foremost of the ancient Greek 
mathematicians. He lived in Syracuse on the island of Sicily. His most notable 
discoveries were in geometry, but he also made important contributions to 
hydrostatics and other branches of mechanics. His method of exhaustion is a 
precursor of the integral calculus developed by Newton and Leibniz almost 
two millennia later. He died at the hands of a Roman soldier during the Second 
Punic War. 


38 Forced Periodic Vibrations 


We will now investigate the situation in which a periodic external force is applied to a spring- 
mass system. The behavior of this simple system models that of many oscillatory systems with 
an external force due, for example, to a motor attached to the system. We will first consider 
the case in which damping is present and will look later at the idealized special case in which 
there is assumed to be no damping. 


Forced Vibrations with Damping. The algebraic calculations can be fairly complicated in 
this kind of problem, so we will begin with a relatively simple example. 


EXAMPLE 1 


Suppose that the motion of a certain spring-mass system satisfies the differential equation 


5 
u” +u + 7u = 3 cost (1) 
and the initial conditions 
u(0) =2, u'(0) =3. (2) 
Find the solution of this initial value problem and describe the behavior of the solution for 
large t. 
Solution: 


The homogeneous equation corresponding to equation (1) has the characteristic equation 
r +r+ 1 = 0 with roots r = -5 + i. Thus a general solution u(t) of this homogeneous 


equation is 


u(t) = ce!" cost + coe? sint. (3) 
A particular solution of equation (1) has the form U(t) = Acost + Bsint, where A and B 
are found by substituting U(t) for u in equation (1). We have U'(f) = —Asint + B cost and 
U"(t) = —Acost — B sint. Thus, from equation (1) we obtain 


1 1 
(is + 2) cost + (-4 + e) sint = 3 cost. 


Consequently, A and B must satisfy the equations 


1 1 
-A+B=3, -A+-B=0, 
ras *3 


2 48 
with the result that A — 17 and B — 17 Therefore, the particular solution is 


UC) 12 "T 48 , i (4) 
= — cos — sint, 
17 17 
and the general solution of equation (1) is 
12 48 
u — u(t) + U(t) = ce"? cost 4 coe sint + 15 cos f + T sin f. (5) 

The remaining constants c; and c; are determined by the initial conditions (2). From 

equation (5), and its first derivative, we have 
48 


12 1 
üy-e e» woe 2S es 
u(0) ati u (0) Q9 teti 


22 14 
soc, = i and c; = 1T Thus we finally arrive at the solution of the given initial value problem 


(1), Q), namely, 


we eg de obeundo eee an, (6 
u = 17° cos 17° sin 17 COS 17 sın f. 


The graph of the solution (6) is shown by the green curve in Figure 3.8.1. 
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It is important to note that the solution consists of two distinct parts. The first two terms on the 
right-hand side of equation (6) contain the exponential factor e^'/?; as a result, they rapidly approach 
zero. It is customary to call these terms the transient solution. The remaining terms in equation (6) 
involve only sines and cosines, so they represent an oscillation that continues indefinitely. We refer 
to them as the steady-state solution. The dotted red and dashed blue curves in Figure 3.8.1 show 
the transient and the steady-state parts of the solution, respectively. The transient part comes from 
the solution of the homogeneous equation corresponding to equation (1) and is needed to satisfy the 
initial conditions. The steady-state solution is the particular solution of the full nonhomogeneous 
equation. After a fairly short time, the transient solution is vanishingly small and the full solution is 


essentially indistinguishable from the steady state. 


Full solution 


Steady-state 
solution 


Transient 
solution 


Solution of the initial value problem (1), (2): 
u” +u'+5u/4 = 3cost, u(0) = 2, u'(0) = 3. The full solution (solid green) 
is the sum of the transient solution (dotted red) and steady-state solution (dashed 
blue). 


The equation of motion of a general spring-mass system subject to an external force F(t) 
is equation (7) in Section 3.7: 


mu" (t) + yu'(t) +ku(t) = F(t), (7) 


where m, y, and k are the mass, damping coefficient, and spring constant of the spring-mass 
system. Suppose now that the external force is given by Fo cos(wt), where Fo and w are 
positive constants representing the amplitude and frequency, respectively, of the force. Then 
equation (7) becomes 


mu^ + yu' + ku = Focos(wt). (8) 


Solutions of equation (8) behave very much like the solution in the preceding example. 
The general solution of equation (8) must have the form 


u = cqu4(t) + cous(t) + Acos(wt) + Bsin(wt) = uc(t) + U(t). (9) 


The first two terms on the right-hand side of equation (9) are the general solution u,(t) of the 
homogeneous equation corresponding to equation (8), and the latter two terms are a particular 
solution U (t) of the full nonhomogeneous equation. The coefficients A and B can be found, as 
usual, by substituting these terms into the differential equation (8), while the arbitrary constants 
c, and c» are available to satisfy initial conditions, if any are prescribed. The solutions u(t) 


and u(t) of the homogeneous equation depend on the roots rı and r2 of the characteristic 
equation mr? + yr +k = 0. Since m, y, and k are all positive, it follows that rı and r» either 
are real and negative or are complex conjugates with a negative real part. In either case, both 
u(t) and u(t) approach zero as f — oo. Since u,(t) dies out as f increases, it is called the 
transient solution. In many applications, it is of little importance and (depending on the value 
of y) may well be undetectable after only a few seconds. 

The remaining terms in equation (9) — namely, U (t) = A cos(wt) + B sin(wt) —do not 
die out as f increases but persist indefinitely, or as long as the external force is applied. They 
represent a steady oscillation with the same frequency as the external force and are called the 
steady-state solution or the forced response of the system. The transient solution enables 
us to satisfy whatever initial conditions may be imposed. With increasing time, the energy 
put into the system by the initial displacement and velocity is dissipated through the damping 
force, and the motion then becomes the response of the system to the external force. Without 
damping, the effect of the initial conditions would persist for all time. 

It is convenient to express U(t) as a single trigonometric term rather than as a sum 
of two terms. Recall that we did this for other similar expressions in Section 3.7. Thus we 
write 


U(t) = Rcos(wt — 6). (10) 
The amplitude R and phase 6 depend directly on A and B and indirectly on the parameters in 


the differential equation (8). It is possible to show, by straightforward but somewhat lengthy 
algebraic computations, that 


2 2 
F, m(wg — w°) 
R=, cos § = —* and sind =, (11) 
where 
A= mw —w?)?+72w? and w = Ls (12) 
0 t 


Recall that wọ is the natural frequency of the unforced system in the absence of damping. 

We now investigate how the amplitude R of the steady-state oscillation depends on the 
frequency w of the external force. Substituting from equation (12) into the expression for R 
in equation (11) and executing some algebraic manipulations, we find that 


-1/2 
2\ 2 2 2 
Rk w w y? 
— — 1— | — +r| — where l = T (13) 
Fo Wo Wo mk 


Observe that the quantity Rk / Fo is the ratio of the amplitude R of the forced response to Fo/ k, 
the static displacement of the spring produced by a force Fo. 

For low frequency excitation— that is, as w — 0—it follows from equation (13) that 
Rk/Fy — lor R — Fo/k. At the other extreme, for very high frequency excitation, 
equation (13) implies that R — O0 as w — oo. At an intermediate value of w the amplitude 
may have a maximum. To find this maximum point, we can differentiate R with respect to w 
and set the result equal to zero. In this way we find that the maximum amplitude occurs when 
W = Wmax, Where 


2 2 
2 2 2 ^Y 
W max — Yo T 2m2 = wg ( E 5) E (14) 


Note that Wimax < wo and that w max is close to wo when y is small. The maximum value 
of R is 


Fo 


mE ge 
= 1+ : (15) 
ywoy 1 — (7? /4mk) Vwo 8mk 


Rmax = 
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where the last expression is an approximation that is valid when » is small (see Problem 5). If 


2 
E > 2, then Wimax as given by equation (14) is imaginary; in this case the maximum value 
m 


of R occurs for w = 0, and R is a monotone decreasing function of w. Recall that critical 


2 
damping occurs when E = 4. 


m 


^ 


For small y it follows from equation (15) that Rmax = =, Thus, for lightly damped 
0 

systems, the amplitude R of the forced response when w is near wy is quite large even 
for relatively small external forces, and the smaller the value of y, the more pronounced 
is this effect. This phenomenon is known as resonance, and it is often an important design 
consideration. Resonance can be either good or bad, depending on the circumstances. It must 
be taken very seriously in the design of structures, such as buildings and bridges, where it can 
produce instabilities that might lead to the catastrophic failure of the structure. On the other 
hand, resonance can be put to good use in the design of instruments, such as seismographs, 
that are intended to detect weak periodic incoming signals. 


RRIFyA 
10+ 


T = 0.015625 
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| FIGURE 3.8.2 | Forced vibration with damping: amplitude of steady-state 
response versus frequency of driving force for several values of the 


dimensionless damping parameter T = y7/ mk. 


Rk 
Figure 3.8.2 contains some representative graphs of E versus = for several values of 
0 
2 


| = L We refer to I' as a damping parameter, as the following examples will explain. 
m 

The graph corresponding to [ = 0.015625 is included because this is the value of I that 

occurs in Example 2 below. Note particularly the sharp peak in the curve corresponding to 

T = 0.015625 near — = 1. The limiting case as Il — 0 is also shown. It follows from 


wo 
F, 
equation (13), or from equations (11) and (12), that R —> 7 2 as y — 0 and hence 
m lwo — "d 


R 
E is asymptotic to the vertical line w = wo, as shown in the figure. As the damping in the 
0 

system increases, the peak response gradually diminishes. 


Figure 3.8.2 also illustrates the usefulness of dimensionless variables. You can easily 

verify that each of the quantities - — and I’ is dimensionless (see Problem 9d). The 
0 Wo 

importance of this observation is that the number of significant parameters in the problem has 

been reduced to three rather than the five that appear in equation (8). Thus only one family of 

curves, of which a few are shown in Figure 3.8.2, is needed to describe the response-versus- 

frequency behavior of all systems governed by equation (8). 

The phase angle ó also depends in an interesting way on w. For w near zero, it follows 
from equations (11) and (12) that cos ó = 1 and sinô = 0. Thus 6 = 0, and the response is 
nearly in phase with the excitation, meaning that they rise and fall together and, in particular, 
assume their respective maxima nearly together and their respective minima nearly together. 
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For w = wọ we find that cos ô = 0 and sin ô = 1, so 6 = 7/2. In this case the response lags 
behind the excitation by 7 /2; that is, the peaks of the response occur 7 /2 later than the peaks 
of the excitation, and similarly for the valleys. Finally, for w very large, we have cos 6 = —1 
and sin ó = 0. Thus 6 = s so that the response is nearly out of phase with the excitation; this 
means that the response is minimum when the excitation is maximum, and vice versa. Figure 
3.8.3 shows the graphs of 6 versus w / wo for several values of I’. For small damping, the phase 
transition from near 6 = 0 to near 6 = m occurs rather abruptly, whereas for larger values of 
the damping parameter, the transition takes place more gradually. 


0 1 2 3 4 ola, 


| FIGURE 3.8.3 | Forced vibration with damping: phase of steady-state 


response versus frequency of driving force for several values of the 
dimensionless damping parameter T = y? / mk. 


EXAMPLE 2 


Consider the initial value problem 


1 
u" + g” +u = 3cos(wt), u(0)=2, u'(0) =0. (16) 


Show plots of the solution for different values of the forcing frequency w, and compare them with 
corresponding plots of the forcing function. 


Solution: 


For this system we have wọ = 1 and I = 1/64 = 0.015625. Its unforced motion was discussed in 
Example 3 of Section 3.7, and Figure 3.7.7 shows the graph of the solution of the unforced problem. 
Figures 3.8.4, 3.8.5, and 3.8.6 show the solution of the forced problem (16) for w = 0.3, w = 1, and 
w = 2, respectively. The graph of the corresponding forcing function is also shown in each figure. 
In this example the static displacement, Fọ/ k, is equal to 3. 

Figure 3.8.4 shows the low frequency case, w/w» = 0.3. After the initial transient response 
is substantially damped out, the remaining steady-state response is essentially in phase with the 
excitation, and the amplitude of the response is somewhat larger than the static displacement. To 
be specific, R = 3.2939 and 6 = 0.041185. 

The resonant case, w /wg = 1, is shown in Figure 3.8.5. Here, the amplitude of the steady-state 
response is eight times the static displacement, and the figure also shows the predicted phase lag of 
T /2 relative to the external force. 

The case of comparatively high frequency excitation is shown in Figure 3.8.6. Observe that the 
amplitude of the steady forced response is approximately one-third the static displacement and that 
the phase difference between the excitation and the response is approximately 7 . More precisely, we 
find that R = 0.99655 and that 6 = 3.0585. 
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Solution Forcing function 


A forced vibration with damping; the solution (solid blue) 


of equation (16) with w = 0.3: u" + iu +u = 3cos(0.3r), 
u(0) = 2, u'(0) = 0. The dashed red curve is the external force: 
F(t) =3cos(0.3r). 


-20 


Forcing function Solution 


| FIGURE 3.8.5 | A forced vibration with damping; the solution (solid blue) 
of equation (16) with w = 1: u" + m +u = 3cost, u(0) = 2, 
u'(0) = 0. The dashed red curve is the external force: F(t) = 3 cost. 
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Forcing function Solution 


A forced vibration with damping; the solution (solid blue) 


of equation (16) with w = 2: u" + gu +u = 3cos(2t), u(0) = 2, 
u'(0) = 0. The dashed red curve is the external force: F(t) = 3 cos(2r). 


Forced Vibrations Without Damping. We now assume that y = 0 in equation (8), thereby 
obtaining the equation of motion of an undamped forced oscillator, 


mu" + ku = Focos(wt). (17) 


The form of the general solution of equation (17) is different, depending on whether the forcing 

frequency w is different from or equal to the natural frequency wo = ./k/m of the unforced 

system. First consider the case w z wo; then the general solution of equation (17) is 

Fo 

s cos(wt). (18) 
2 


u = cı cos(wot) + c» sin(wot) + 
m(wo — w?) 


The constants c, and cz are determined by the initial conditions. The resulting motion is, in 
general, the sum of two periodic motions of different frequencies (wo and w) and different 
amplitudes as well. 

It is particularly interesting to suppose that the mass is initially at rest so that the initial 
conditions are u(0) = 0 and u'(0) = 0. Then the energy driving the system comes entirely 
from the external force, with no contribution from the initial conditions. In this case it turns 
out that the constants c, and c» in equation (18) are given by 


Fo 


c= — €? = 0, (19) 


mwg — w?) i 
and the solution of equation (17) is 
Fo 
u= — ~ (cos(wt) — cos(wot)). (20) 
m(wo — w?) 
This is the sum of two periodic functions of different periods but the same amplitude. Making 
1 1 
use of the trigonometric identities for cos( A+B) with A = zí wo+w)t and B = (wow), 
we can write equation (20) in the form 
2R 1 1 
u = (wgw?) sin( 5 (0-94) sin( zwoor) Q1) 


If |wo — w| is small, then wọ + w is much greater than |wo — w|. Consequently, 


1 1 
sin (x Wy w) r) is a rapidly oscillating function compared to sin G (Wo — w) r) . Thus the 


motion is a rapid oscillation with frequency z(o + w) but with a slowly varying sinusoidal 


sin( 5 Gs 23! 


This type of motion, possessing a periodic variation of amplitude, exhibits what is called a 
beat. For example, such a phenomenon occurs in acoustics when two tuning forks of nearly 
equal frequency are excited simultaneously. In this case the periodic variation of amplitude is 
quite apparent to the unaided ear. In electronics, the variation of the amplitude with time is 
called amplitude modulation. 


amplitude 
2Fo 


2 2 
mug —c 


EXAMPLE 3 


Solve the initial value problem 


u" +u = 5 cos(0.80) u(0) =0, u'(0) = 0, (22) 
and plot the solution. 
Solution: 
In this case wọ = 1,w = 0.8, and Fo = 5 so from equation (21) the solution of the given problem is 


u = 2.778 sin(0.1t) sin(0.9r). (23) 


3.8 Forced Periodic Vibrations 


165 


166 CHAPTER3 Second-Order Linear Differential Equations 


A graph of this solution is shown in Figure 3.8.7. The amplitude variation has a slow frequency of 
0.1 and a corresponding slow period of 27 /0.1 = 207. Note that a half-period of 107 corresponds 
to a single cycle of increasing and then decreasing amplitude. The displacement of the spring-mass 
system oscillates with a relatively fast frequency of 0.9, which is only slightly less than the natural 
frequency wy. 

Now imagine that the forcing frequency w is increased, say, to w = 0.9. Then the slow 
frequency is halved to 0.05, and the corresponding slow half-period is doubled to 207 . The multiplier 
2.7778 also increases substantially, to 5.263. However, the fast frequency is only marginally 
increased, to 0.95. Can you visualize what happens as w takes on values closer and closer to the 
natural frequency wo = 1? 


u = 2.778 sin (0.1¢) 
u = 2.778 sin (0.1¢) sin (0.9¢) 


1 
A beat; the solution (solid blue) of equation (22): u” + u = 3 cos(0.81), 
u(0) = 0, u'(0) = Ois u = 2.778 sin (0.17) sin(0.9r). The dashed red curve is the external 
1 
force F(t) = 5 cos(0.8f). 


Now let us return to equation (17) and consider the case of resonance, where w = wo; 
that is, the frequency of the forcing function is the same as the natural frequency of the system. 
Then the nonhomogeneous term Fo cos(wt) is a solution of the homogeneous equation. In this 
case the solution of equation (17) is 


Fo 


u = c, COS Wot + c» sin wot + t sin(wot). (24) 


Ip 


Consider the following example. 


EXAMPLE 4 


Solve the initial value problem 


1 
u" +u = z cost, u(0)=0, u(0)=0, (25) 
and plot the graph of the solution. 


Solution: 


The general solution of the differential equation is 


: t 
u = cı cost + ca sint + 49H 
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and the initial conditions require that c; = c5 = 0. Thus the solution of the given initial value 


problem is 
t, 
u = — sint. 
4 
The graph of the solution is shown in Figure 3.8.8. 


uA 


10 


(26) 


-10 


Resonance; the solution (solid blue) of equation (25): 


1 t 
u" +u = z cost, u(0) —0,u'(0) = Qis u = z Sint. 


Because of the term ft sin(wof), the solution (24) predicts that the motion will become 
unbounded as t — oo regardless of the values of c, and cz, and Figure 3.8.8 bears this 
out. Of course, in reality, unbounded oscillations do not occur, because the spring cannot 
stretch infinitely far. Moreover, as soon as u becomes large, the mathematical model on 
which equation (17) is based is no longer valid, since the assumption that the spring force 
depends linearly on the displacement requires that u be small. As we have seen, if damping 
is included in the model, the predicted motion remains bounded; however, the response to the 
input function Fy cos(wf) may be quite large if the damping is small and w is close to wọ. 


Problems 


In each of Problems 1 through 3, write the given expression as a 
product of two trigonometric functions of different frequencies. 


l. sin(7t) — sin( 6t) 

2. cos(rt) + cos(2z1) 

3. sin(3f) + sin(4r) 

4. A mass of 5 kg stretches a spring 10 cm. The mass 
is acted on by an external force of 10sin(t/2) N (newtons) 
and moves in a medium that imparts a viscous force of 2 N 
when the speed of the mass is 4 cm/s. If the mass is set in motion from 


its equilibrium position with an initial velocity of 3 cm/s, formulate 
the initial value problem describing the motion of the mass. 


5. a. Find the solution of the initial value problem in 
Problem 4. 
b. Identify the transient and steady-state parts of the solution. 
Q c. Plot the graph of the steady-state solution. 


@ d. If the given external force is replaced by a force of 
2cos(wt) of frequency w, find the value of w for which the 
amplitude of the forced response is maximum. 


@ 6. A mass that weighs 8 Ib stretches a spring 6 in. The system 
is acted on by an external force of 8 sin( 8+) Ib. If the mass is pulled 
down 3 in and then released, determine the position of the mass at any 
time. Determine the first four times at which the velocity of the mass 
is zero. 

7. A spring is stretched 6 in by a mass that weighs 8 Ib. The mass 
is attached to a dashpot mechanism that has a damping constant of 


1 
z Ib-s/ft and is acted on by an external force of 4cos(21) Ib. 


a. Determine the steady-state response of this system. 

b. If the given mass is replaced by a mass m, determine the 
value of m for which the amplitude of the steady-state response 
is maximum. 


168 CHAPTER3 Second-Order Linear Differential Equations 


8. Aspring-mass system has a spring constant of 3 N/m. A mass of 
2 kg is attached to the spring, and the motion takes place in a viscous 
fluid that offers a resistance numerically equal to the magnitude of the 
instantaneous velocity. If the system is driven by an external force 
of (3cos(3t) — 2sin(3r)) N, determine the steady-state response. 
Express your answer in the form R cos(wt — 6). 


9. Inthis problem we ask you to supply some of the details in the 
analysis of a forced damped oscillator. 

a. Derive equations (10), (11), and (12) for the steady-state 
solution of equation (8). 
b. Derive the expression in equation (13) for Rk/ Fo. 
C. Show that sx and Rmax are given by equations (14) and 
(15), respectively. 
d. Verify that Rk/ Fo, w/wo, and T = ?/(mk) are all 
dimensionless quantities. 


10. Find the velocity of the steady-state response given by 
equation (10). Then show that the velocity is maximum when w = wọ. 


11. Find the solution of the initial value problem 


u" +u = F(t), u(0) =0, u'(0) 20, 
where 
Fot, 0<t<n, 
F(t) = 4 Fo(2a — t), Se 
0, Amd 


Hint: Treat each time interval separately, and match the solutions in 
the different intervals by requiring u and u’ to be continuous functions 
of t. 


@ 12. A series circuit has a capacitor of 0.25 x 1076F, a resistor 
of 5 x 10? Q, and an inductor of 1 H. The initial charge on the capacitor 
is zero. If a 12 V battery is connected to the circuit and the circuit is 
closed at t = 0, determine the charge on the capacitor at t = 0.001 s, 
att = 0.01 s, and at any time r. Also determine the limiting charge as 
t — oo. 


@ 13. Consider the forced but undamped system described by the 
initial value problem 


u” +u=3cos(wt), u(0)=0, u'(0) =0. 


a. Find the solution u(t) forw Æ 1. 

@ b. Plot the solution u(t) versus t for w = 0.7, w = 0.8, and 
w = 0.9. Describe how the response u(t) changes as w varies in 
this interval. What happens as w takes on values closer and closer 
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to 1? Note that the natural frequency of the unforced system is 
Qo = l. 
14. Consider the vibrating system described by the initial value 
problem 


u” +u -3cos(wt), u(0)=1, u'(0)-2l. 


a. Find the solution for w Æ 1. 

(3 b. Plot the solution w(t) versus t for w = 0.7, w = 0.8, and 
w = 0.9. Compare the results with those of Problem 13; that is, 
describe the effect of the nonzero initial conditions. 


Q 15. For the initial value problem in Problem 13, plot u’ versus 
u forw = 0.7, w = 0.8, and w = 0.9. (Recall that such a plot is 
called a phase plot.) Use a t interval that is long enough so that the 
phase plot appears as a closed curve. Mark your curve with arrows to 
show the direction in which it is traversed as f increases. 


Problems 16 through 18 deal with the initial value problem 


1 
u" + gt + 4u = F(t), u(0) =2, u'(0) =0. 


In each of these problems: 

@ a. Plot the given forcing function F(t) versus t, and also plot 
the solution u(t) versus t on the same set of axes. Use at interval 
that is long enough so the initial transients are substantially 
eliminated. Observe the relation between the amplitude and 
phase of the forcing term and the amplitude and phase of the 
response. Note that wọ = \/k/m = 2. 

Q b. Draw the phase plot of the solution; that is, plot u’ versus 
u. 


16. F(t) —3cos(t/4) 

17. F(t) = 3cos(2t) 

18. F(t) = 3cos(6t) 

Q 19. A spring-mass system with a hardening spring (Problem 24 
of Section 3.7) is acted on by a periodic external force. In the absence 


of damping, suppose that the displacement of the mass satisfies the 
initial value problem 


1 
“tut su =coswt, u(0)=0, u'(0) =0. 
a. Let w = 1 and plot a computer-generated solution of the 
given problem. Does the system exhibit a beat? 
b. Plot the solution for several values of w between 1/2 and 2. 
Describe how the solution changes as w increases. 
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CHAPTER4 


Higher-Order Linear 
Differential Equations 


The theoretical structure and methods of solution developed in the preceding chapter for 
second-order linear equations extend directly to linear equations of third and higher order. 
In this chapter we briefly review this generalization, taking particular note of those instances 
where new phenomena may appear, because of the greater variety of situations that can occur 
for equations of higher order. 


41 General Theory of n^ Order 
Linear Differential Equations 


An n® order linear differential equation is an equation of the form 


d" y diy dy 
Po(t + P(t Toce P,_1(t) M + P,(t)y = G(t). 1 
Kr PD s (NF PG) = GC) (1) 
We assume that the functions Po, ... , Pa, and G are continuous real-valued functions on 


some interval J: a < t < f, and that Po is nowhere zero in this interval. Then, dividing 
equation (1) by Po(t), we obtain 


n d” 1 


d"y 7 dy 
t) —— eee Ac) — n(t)y = g(t). 2 
dus TO i Teeg i) Fe + Pay g(t) (2) 


The linear differential operator L of order n defined by equation (2) is similar to the second- 
order operator introduced in Chapter 3. The mathematical theory associated with equation (2) 
is completely analogous to that for the second-order linear equation; for this reason we simply 
state the results for the n" order problem. The proofs of most of the results are also similar to 
those for the second-order equation and are usually left as exercises. 

Since equation (2) involves the n™ derivative of y with respect to f, it will, so to speak, 
require n integrations to solve equation (2). Each of these integrations introduces an arbitrary 
constant. Hence we expect that to obtain a unique solution it is necessary to specify n initial 
conditions 


L[y] = 


= ESI 
y(t) = yo, y'(fo) = yh- YP) = yf”, (3) 


where fọ may be any point in the interval J and yo, yo. ..., "n are any prescribed real 


constants. The following theorem, which is similar to Theorem 3.2.1, guarantees that the initial 
value problem (2), (3) has a solution and that it is unique. 


Theorem 4.1.1 


If the functions p1, p2, ... , Pn, and g are continuous on the open interval /, then there exists exactly 
one solution y = ġ(t) of the differential equation (2) that also satisfies the initial conditions (3), 
where tọ is any point in /. This solution exists throughout the interval 7. 
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We will not give a proof of this theorem here. However, if the coefficients pi, ... , Pn 
are constants, then we can construct the solution of the initial value problem (2), (3) much as 
in Chapter 3; see Sections 4.2 through 4.4. Even though we may find a solution in this case, 
we do not know that it is unique without the use of Theorem 4.1.1. A proof of the theorem can 
be found in Ince (Section 3.32) or Coddington (Chapter 6). 


The Homogeneous Equation. As in the corresponding second-order problem, we first 
discuss the homogeneous equation 


L[y] = y + pity? + + pray’ + Pal y = 0. (4) 


If the functions y1, y2, ... , Yn are solutions of the differential equation (4), then it follows by 
direct computation that the linear combination 


y = c1 y(t) + eoyo(1) +--+ + CnYn(t), (5) 


where c1, ... , c, are arbitrary constants, is also a solution of equation (4). It is then natural 
to ask whether every solution of equation (4) can be expressed as a linear combination 
of yj, ..., Yn. This will be true if, regardless of the initial conditions (3) that are 
prescribed, it is possible to choose the constants c1, ... , c, so that the linear combination 
(5) satisfies the initial conditions. That is, for any choice of the point tọ in Z, and for any 


—1) 


choice of yo, yg, ... , ye" , we must be able to determine c1, ... , c, so that the equations 


C1y1(to) +--+ + Cn Yn(to) = Yo 


cy, (fo) +--+ + cay) (toe) = yo 
, (6) 


cry? (fg) 4- ey" (to) = yt? 


are satisfied. The system (6) of n linear algebraic equations can be solved uniquely for the n 


constants cj, ... , Cn, provided that the determinant of the coefficient matrix is not zero. On 
the other hand, if the determinant of the coefficient matrix is zero, then it is always possible to 
choose values of yo, yo, ... , ye” so that equations (6) do not have a solution. Therefore a 
necessary and sufficient condition for the existence of a solution of equations (6) for arbitrary 
values of yo, yj, -.. Xj ” is that the Wronskian 
MI y» Ut Yn 
yi 35 $e Y, 
Wliyi, --->Ml=| - . . (7) 
(n-1) (n-1) (n-1) 
1 Y2 eee Yn 
is not zero at t = tọ. Since tọ can be any point in the interval /, it is necessary and sufficient 
that W[yj, y2, ... , Yn] be nonzero at every point in the interval. Just as for the second-order 
linear equation, it can be shown that if yj, y», ... , y, are solutions of equation (4), then 
WL), yo. --- , Yn] either is zero for every ¢ in the interval Z or else is never zero there; see 


Problem 15. Hence we have the following theorem. 


Theorem 4.1.2 


If the functions pi, p2, ... , Pn are continuous on the open interval /, if the functions y1, yo. ... , Yn 
are solutions of equation (4), and if W[y1, yo, ... , y4](*) + 0 for at least one point in Z, then every 
solution of equation (4) can be expressed as a linear combination of the solutions y1, yo. ... , Yn- 


Solutions y;, ... , y, of equation (4) whose Wronskian is nonzero are said to form 
a fundamental set of solutions. The existence of a fundamental set of solutions can be 
demonstrated in the same way as for the second-order linear equation (see Theorem 3.2.5). 


4.1 General Theory of nt? Order Linear Differential Equations 


Since all solutions of the homogeneous n" order linear differential equation (4) are of the 
form (5), we use the term general solution to refer to an arbitrary linear combination of any 
fundamental set of solutions of equation (4). 


Linear Dependence and Independence. We now explore the relationship between 
fundamental sets of solutions and the concept of linear independence, a central idea in the 


study of linear algebra. The functions fi, fo, ... , fn are said to be linearly dependent on an 
interval 7 if there exists a set of constants ky, k2, ... , kn, not all zero, such that 

hf 9 ko fa() t t E f(t) =0 (8) 
for all t in J. The functions fi, ... , fn are said to be linearly independent on / if they are 


not linearly dependent there. 


EXAMPLE 1 


Determine whether the functions f(t) = 1, fo(t) = t, and f3(t) = t? are linearly independent or 
dependent on the interval 1: —oo < t < oo. 


Solution: 


Form the linear combination 
ky fi(f) + ko fa(1) + ks fa(t) = ky + kot + kat’, 
and set it equal to zero to obtain 
ky thot + kat? = 0. (9) 


If equation (9) is to hold for all t in 7, then it must certainly be true at any three distinct points in Z. 
Any three points will serve our purpose, but it is convenient to choose t = 0, t = 1, and t = —1. 
Evaluating equation (9) at each of these points, we obtain the system of equations 


ki = 0; 
kı +k +k; = 0, (10) 
kı — ko + kz = 0. 


From the first of equations (10) we note that k; = 0; then from the other two equations it 
follows that k; = k3 = 0 as well. Therefore, there is no set of constants kı, kz, k3, not all zero, for 
which equation (9) holds even at the three chosen points, much less throughout 7. Thus the given 
functions are not linearly dependent on /, so they must be linearly independent. Indeed, they are 
linearly independent on any interval. This can be established just as in this example, possibly using 
a different set of three points. 


EXAMPLE 2 


Determine whether the functions 


A@ =1, fot) =24+t, f(t) 23—0, and f(t) =4t4+2° 


are linearly independent or dependent on any interval /. 


Solution: 
Form the linear combination 


ky f(t) + o f(t) + ka fa(£) + ka falt) = ky +ko(2 +t) +3(3 — 02) + kalt +t’) 
= (ky + 2k; + 3k3) + (ko + Ak4)t + (-kg + Kg). (0) 
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For this expression to be zero throughout an interval, it is certainly sufficient to require that 
ky + 2k; + 3k3 => 0, ky + 4k4 => 0, =k; + k4 = 0. 


These three equations, with four unknowns, have many solutions. For instance, if k4 = 1, then 
k3 = 1, kọ = —4, and kı = 5. If we use these values for the coefficients in equation (11), then these 
functions satisfy the linear relation 


5AA) — 4fo(0) + fa(1) + falt) = 0 


for each value of t. Thus the given functions are linearly dependent on every interval. 


The concept of linear independence provides an alternative characterization of 
fundamental sets of solutions of the homogeneous equation (4). Suppose that the functions 
Yr «ee , Yn are solutions of equation (4) on an interval 7, and consider the equation 


ky) +--+ +knyn(t) = 0. (12) 


By differentiating equation (12) repeatedly, we obtain the additional n — 1 equations 
kiyi) + +--+ Ey) = 0, 


: (13) 
n—1 n— 
kay Pt) +o + kay MC) = 0. 
The system consisting of equations (12) and (13) is a system of n linear algebraic equations for 


the n unknowns Kj, ... , kn. The determinant of coefficients for this system is the Wronskian 
Wlyi, --- > yndX() of y1, ... , Yn. This leads to the following theorem. 


Theorem 4.1.3 


If y,(t), ...,yn(t) form a fundamental set of solutions of the homogeneous n" order linear 
differential equation (4) 


L[y] = Y? + pithy"? +++) + pi a(0y + pa(t)y = 0 


on an interval 7, then y;(t), ... , yn(t) are linearly independent on 7. Conversely, if y;(t), ..., 
ya (t) are linearly independent solutions of equation (4) on 7, then they form a fundamental set of 
solutions on J. 


To prove this theorem, first suppose that y(t), ... , y,(t) form a fundamental set 
of solutions of the homogeneous differential equation (4) on J. Then the Wronskian 
Wlyi, --->¥nl(t) Æ O for every t in J. Hence the system (12), (13) has only the solution 
kı =--- =k, = 0 for every t in IZ. Thus y\(t), ... , y,(t) cannot be linearly dependent on 7 
and must therefore be linearly independent there. 

To demonstrate the converse, let y;(t), ... , Y,(t) be linearly independent on J. To show 
that they form a fundamental set of solutions, we need to show that their Wronskian is never 
zero in J. Suppose that this is not true; then there is at least one point tọ where the Wronskian is 


zero. At this point the system (12), (13) has a nonzero solution; let us denote it by ky, ... , ky. 
Now form the linear combination 
b(t) = KEY) +++ + ey). (14) 
Then y = ¢ (t) satisfies the initial value problem 
Lly]=0, y(t) =0, y'(to) —0, ... "P (i) — 0. (15) 


The function ¢ satisfies the differential equation because it is a linear combination of solutions; 
it satisfies the initial conditions because these are just the equations in the system (12), 
(13) evaluated at tọ. However, the function y(t) = O for all t in Z is also a solution of 
this initial value problem, and by Theorem 4.1.1, the solution to the initial value problem 
(15) is unique. Thus ¢(t) = O for all t in J. Consequently, yi(1), ... , yn(t) are linearly 
dependent on 7, which is a contradiction. Hence the assumption that there is a point where 
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the Wronskian is zero is untenable. Therefore, the Wronskian is never zero on J, as was to be 
proved. 

Note that for a set of functions fi, ... , fa that are not solutions of the homogeneous 
linear differential equation (4), the converse part of Theorem 4.1.3 is not necessarily true. 
They may be linearly independent on J even though the Wronskian is zero at some points, 
or even every point, but with different sets of constants Kj, ... , kn at different points. See 
Problem 18 for an example. 


The Nonhomogeneous Equation. Now consider the nonhomogeneous equation (2) 
Liy] = y  p( t) y? +--+ pa(t)y = a(t). 


If Y; and Y, are any two solutions of equation (2), then it follows immediately from the linearity 
of the operator L that 


LIY — Y;f(t) = LYO — LIYN) = g(t) — g(t) = 0. 


Hence the difference of any two solutions of the nonhomogeneous equation (2) is a solution of 
the homogeneous differential equation (4). Since any solution of the homogeneous equation 
can be expressed as a linear combination of a fundamental set of solutions y1, ... , Yn, it 
follows that any solution of the nonhomogeneous differential equation (2) can be written as 


y=cyyi(t) + eyo(1) +--+ + enyn(t) + Y(t), (16) 


where Y is some particular solution of the nonhomogeneous differential equation (2). 
The linear combination (16) is called the general solution of the nonhomogeneous 
equation (2). 

Thus the primary problem is to determine a fundamental set of solutions { Vip see yn] 
of the homogeneous n" order linear differential equation (4). If the coefficients are constants, 
this is a fairly simple problem; it is discussed in the next section. If the coefficients are not 
constants, it is usually necessary to use numerical methods such as those in Chapter 8 or series 
methods similar to those in Chapter 5. These tend to become more cumbersome as the order 
of the equation increases. 

To find a particular solution Y(t) in equation (16), the methods of undetermined 
coefficients and variation of parameters are again available. They are discussed and illustrated 
in Sections 4.3 and 4.4, respectively. 

The method of reduction of order (Section 3.4) also applies to n® order linear differential 
equations. If y; is one solution of equation (4), then the substitution y = v(t) y,(t) leads to 
a linear differential equation of order n — 1 for v' (see Problem 19 for the case when n — 3). 
However, if n > 3, the reduced equation is itself at least of second order, and only rarely will 
it be significantly simpler than the original equation. Thus, in practice, reduction of order is 
seldom useful for equations of higher than second order. 


Problems 


In each of Problems 1 through 4, determine intervals in which 7. fit) 22t —3, fot) 2? +1, BO 222 — t, 
solutions are sure to exist. fat) =? +t 


(4) 7 = 
1L y" +4y"4+3y =t In each of Problems 8 through 11, verify that the given functions are 


2. t(t— ly ety" Ay =0 solutions of the differential equation, and determine their Wronskian. 
3. (x — Dy +(x + Dy” (tanx)y 20 8. yO+y"=0; 1, t, cost, sint 
4 (x? ER 4) y (9 + xy" +9y=0 9. y" J 2y" _ y! — 2y 20; e, e, e? 


In each of Problems 5 through 7, determine whether the given 10. xy"—y"—20; 1, x, x? 
functions are linearly dependent or linearly independent. If they are 
linearly dependent, find 2s relation pie ne ! MH. °y” +x’y"—2xy' +2y= 0; x, x 1/x 
12. a. Show that W[5, sin? t, cos(27)] = 0 for all t by directly 
5. fit) 22t —3, A) =P? M, BA) 222 -t evaluating the Wrosnkian. 
6. fit) —2t—3, fo(t) = 21? +1, Bt) = 30 t b. Establish the same result without direct evaluation of the 
Wronskian. 
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13. Verify that the differential operator defined by 
L[y] = Y? + py() 97? +++ + pay 
is a linear differential operator. That is, show that 
L[ciy + c2y2] = cı L[y1] + coL yal, 


where y; and y» are n-times-differentiable functions and c, and c» are 
arbitrary constants. Hence, show that if y1, yo, ... , y, are solutions 
of L[y] = 0, then the linear combination c; y; +--+ + cy, is also a 
solution of L[y] — 0. 


14. Let the linear differential operator L be defined by 


L[y] = apy c aiy "7D +--+ + any, 


where ag, a4, ... , a, are real constants. 
a. Find L[t”]. 
b. Find L[e'']. 
c. Determine four solutions of the equation y? —5y"--4y = 0. 
Do you think the four solutions form a fundamental set of 
solutions? Why? 


15. In this problem we show how to generalize Theorem 3.2.7 
(Abel's theorem) to higher-order equations. We first outline the 
procedure for the third-order equation 


y" + pithy” + palt)y’ + p3(t)y — 0. 
Let y1, y2, and y; be solutions of this equation on an interval /. 
a. If W = Wl[yi. y2, y3], show that 


MI y2 y3 


W-» » » 
yy yy! yy! 

Hint: The derivative of a 3-by-3 determinant is the sum of three 
3-by-3 determinants obtained by differentiating the first, second, 
and third rows, respectively. 
b. Substitute for y", y7', and y7" from the differential equation; 
multiply the first row by p3, multiply the second row by p», and 
add these to the last row to obtain 


W = — p(t) W. 


C. Show that 


WIy1, y2, y3l(t) = cexp (- [ mina). 


It follows that W is either always zero or nowhere zero on /. 
d. Generalize this argument to the n™ order equation 


y? + piy? +--+ + pathy =0 


with solutions y1, ... , Yn. That is, establish Abel’s formula 


Wl[yi, --- + Yn](t) = cexp (- f mo) (17) 


for this case. 
In each of Problems 16 and 17, use Abel's formula (17) to find the 
Wronskian of a fundamental set of solutions of the given differential 
equation. 
16. y” +2y"—y' —3y=0 
17. ty” +2y"—y'+ty=0 
18. Let f(t) = t?°]t| and g(t) = t°. 
a. Show that the functions f(t) and g(t) are linearly dependent 
o0<r<l. 
b. Show that f(t) and g(t) are linearly dependent on 
-1<1r<0. 
€. Show that f(t) and g(t) are linearly independent on 
-l<t<l. 
d. Show that W[f, g](£) is zero for allt in —1 < t < 1. 
e. Explain why the results in c and d do not contradict 
Theorem 4.1.3. 
19. Show that if y; is a solution of 
y" + pit) y" + pa(0 y' + p3(t)y — 0, 
then the substitution y = y,(t)v(t) leads to the following second- 
order equation for v’: 


yiv” + (3yj + piyv" + (3y? + 2piy + poyi)v’ = 0. 
In each of Problems 20 and 21, use the method of reduction of order 
(Problem 19) to solve the given differential equation. 
20. (2—0y" -(2t —3)y"—ty -y 0, t<2; y) =e 


21. P(t +y" —3t(t -2)y" -6(12-0y'—6y 20, t > 0; 
yi) mE, yit mc 


42 Homogeneous Differential Equations 
with Constant Coefficients 


Consider the n'^ order linear homogeneous differential equation 


L[y] = apy + ayy") +--+ + any’ + any = 0, (1) 


where ag, a4, .. 


. , An are real constants and ag zz 0. From our knowledge of second-order 


linear equations with constant coefficients, it is natural to anticipate that y = e"' is a solution 
of equation (1) for suitable values of r. Indeed, 


Le] = e" (aor" + ay"! +-+ + anir + dn) = e" Z(r) (2) 
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for all r, where 
Z(r) = aor" -- ar"! +-+- + anr + an. (3) 


For those values of r for which Z(r) = 0, it follows that L[e’’] = O and y = e” isa solution of 
equation (1). The polynomial Z(r) is called the characteristic polynomial, and the equation 
Z(r) = 0 is the characteristic equation of the differential equation (1). Since ag 4 0, we 
know that Z(r) is a polynomial of degree n and therefore has n zeros,! say, ri, ro, .. . , Fns 
some of which may be equal and some of which may be complex-valued. Hence we can write 
the characteristic polynomial in the form 


Z(r) —ao(r — ri)(r = 72): (a — r4). (4) 


Real and Unequal Roots. If the roots of the characteristic equation are real and no two 


are equal, then we have n distinct solutions e", e", ... , e'' of equation (1). If these 


functions are linearly independent, then the general solution of the homogeneous n'" order 
linear differential equation (1) is 

y — cue"! - coe"! +++» H ee. (5) 
One way to establish the linear independence of e", e", ... , e^! is to evaluate their 


Wronskian determinant; another way is outlined in Problem 30. 


EXAMPLE 1 


Find the general solution of 


y +y" = Ty" — y by e 0. (6) 
Also find the solution that satisfies the initial conditions 
x0) =1, y(0-20, y'(0-—-2, y"(0) = -1. (7) 
Plot its graph and determine the behavior of the solution as t — oo. 


Solution: 


Assuming that y = e"', we must determine r by solving the polynomial equation 


r* xD -Tr -—r+6=0. (8) 
The roots of this equation are r; = 1, r2 = — 1, r4 = 2, and r4 = —3. Therefore, the general solution 
of differential equation (6) is 
y 2 cyel toe! + cse?! + cae 7. (9) 
The initial conditions (7) require that cj, ... , c4 satisfy the four equations 


catat c3+ c4-— d, 
cı — c2 +2c3— 3c4 = 0, 
cı +c +4c3 + 9c4 
Cy — Co + 8c3 — 27c4 


(10) 


sll 
h. s 
Pp N 


! An important question in mathematics for more than 200 years was whether every polynomial equation has at least 
one root. The affirmative answer to this question, the fundamental theorem of algebra, was given by Carl Friedrich 
Gauss (1777—1855) in his doctoral dissertation in 1799, although his proof does not meet modern standards of rigor. 
Several other proofs have been discovered since, including three by Gauss himself. Today, students often meet the 
fundamental theorem of algebra in a first course on complex variables, where it can be established as a consequence 
of some of the basic properties of complex analytic functions. 
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By solving this system of four linear algebraic equations, we find that 


y = —e' + —e e &. (11) 


The graph of the solution is shown in Figure 4.2.1. Observe that the dominant term, as t — oo, in 


. . 2 : 
the solution is "di . As a result, we conclude that the solution approaches —oo as t — oo. 


Solution of the initial value problem (6), (7): 


y? ty" —Ty" — y! +6y — 0, y(0) =1, y'(0) — 0, y"(0) = —2, 
y"(0) = 1. 


As Example 1 illustrates, the procedure for solving an n order linear differential 
equation with constant coefficients depends on finding the roots of a corresponding n'" degree 
polynomial equation. If initial conditions are prescribed, then a system of n linear algebraic 
equations must be solved to determine the proper values of the constants cj, ... , Cn. Each of 
these tasks becomes much more complicated as n increases, and we have omitted the detailed 
calculations in Example 1. Computer assistance can be very helpful in such problems. 

For third and fourth degree polynomials there are formulas,” analogous to the formula 
for quadratic equations but more complicated, that give exact expressions for the roots. 
Root-finding algorithms are readily available on calculators and computers. Sometimes 
they are included in the differential equation solver, so that the process of factoring the 
characteristic polynomial is hidden and the solution of the differential equation is produced 
automatically. 

If you are faced with the need to factor the characteristic polynomial by hand, here is one 
result that is sometimes helpful. Suppose that the polynomial 


agr" Fair" | p. pa, ara, =0 (12) 


has integer coefficients. If r = p/q is a rational root, where p and q have no common factors, 
then p must be a factor of a,, and q must be a factor of ag. For example, in equation (8) 
the factors of ag are +1 and the factors of a, are +1, +2, +3, and +6. Thus the only possible 
rational roots of this equation are +1, +2, +3, and +6. By testing these possible roots, we find 
that 1, — 1, 2, and —3 are actual roots. In this case there are no other roots, since the polynomial 


?The method for solving the cubic equation was apparently discovered by Scipione dal Ferro (1465-1526) about 
1500, although it was first published in 1545 by Girolamo Cardano (1501-1576) in his Ars Magna. This book also 
contains a method for solving quartic equations that Cardano attributes to his pupil Ludovico Ferrari (1522-1565). 
The question of whether analogous formulas exist for the roots of higher degree equations remained open for more 
than two centuries, until 1826, when Niels Abel showed that no general solution formulas can exist for polynomial 
equations of degree five or higher. A more general theory was developed by Evariste Galois (1811-1832) in 1831, 
but unfortunately it did not become widely known for several decades. 
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is of fourth degree. If some of the roots are irrational or complex, as is usually the case, then 
this process will not find them, but at least the degree of the polynomial can be reduced by 
dividing the polynomial by the factors corresponding to the rational roots. 

If the roots of the characteristic equation are real and different, we have seen that the 
general solution (5) is simply a sum of exponential functions. For large values of t the solution 
is dominated by the term corresponding to the algebraically largest root. If this root is positive, 
then solutions become exponentially unbounded, approaching +00 or —oo depending on the 
sign of the coefficient of the dominant term in the solution. If the largest root is negative, then 
solutions tend exponentially to zero. Finally, if this root is zero, then solutions approach a 
nonzero constant as t becomes large. Of course, for certain initial conditions, the coefficient 
of the otherwise dominant term may be zero; then the nature of the solution for large f is 
determined by the next largest root. 


Complex Roots. Since the coefficients ap, aj, a5, ... , a, are real numbers, if the 
characteristic equation has complex roots, they must occur in conjugate pairs, A +i i. That is, 
when r = A +i, is a root of the characteristic equation, sois 7 = A — i u. Provided that none 
of the roots is repeated, equation (5) still gives the form of the general solution of equation (1). 
However, just as for the second-order differential equation (Section 3.3), we can replace the 
complex-valued solutions e^)! and e—‘/)" by the real-valued solutions 


e" cos(ut), eMsin(yut) (13) 
obtained as the real and imaginary parts of e */?!, Thus, even though some of the roots of 


the characteristic equation are complex, it is still possible to express the general solution of 
equation (1) as a linear combination of real-valued solutions. 


EXAMPLE 2 


Find the general solution of 


y^? —y-0. (14) 
Also find the solution that satisfies the initial conditions 
T t H 5 m 
y(0) = yo? (0 =—4, y(0- o? (0) = —2 (15) 


and draw its graph. 


Solution: 


Substituting e"' for y, we find that the characteristic equation is 
r—L=(r?-1)(7? +1) 20. 
Therefore, the roots are r = 1, —1, i, and —i, and the general solution of equation (14) is 
y = cie! + coe ! - c4 cost + ca sint. 


If we impose the initial conditions (15), we obtain (see Problem 26a) 


thus the solution of the given initial value problem is 
1 
y=3e + zst- sint. (16) 


The graph of this solution is shown in Figure 4.2.2. 

Observe that the initial conditions (15) cause the coefficient cı of the exponentially growing 
term in the general solution to be zero. Therefore, this term is absent in the solution (16), which 
describes an exponential decay to a steady oscillation, as Figure 4.2.2 shows. However, if the initial 
conditions are changed slightly, then c; is likely to be nonzero, and the nature of the solution changes 
enormously. For example, if the first three initial conditions remain the same, but the value of y"(0) 
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| FIGURE 4.2.2 | Solution of the initial value 


T 
problem (14), (15): y? — y = 0, (0) = 5, 


5 
y(0) 2 —4, y” (0) = =, y” (0) = —2. 


2 
15 
is changed from —2 to —$ then the solution of the initial value problem becomes (see Problem 
26b) 
1 95 1 17 
y= ag aye 430 T sint. (17) 


The coefficients in the solution (17) differ only slightly from those in the solution (16), but the 
exponentially growing term, even with the relatively small coefficient of 32 completely dominates 


the solution by the time f is larger than about 4 or 5. This is clearly seen in Figure 4.2.3, which shows 
the graphs of the two solutions (16) and (17). 


MBH Ww e 


4 6 t 


Two solutions of the 


homogeneous differential equation (14). The 
blue curve satsifies the initial conditions (15); 
the red curve satisfies the modified problem in 
which the last initial condition is changed to 
y"(0) = —15/8. 


Repeated Roots. If the roots of the characteristic equation are not distinct—that is, if 
some of the roots are repeated —then the solution (5) is clearly not the general solution 
of equation (1). Recall that if rı is a repeated root for the second-order linear equation 
agy" + ayy’ + ay = 0, then two linearly independent solutions are e"! and te’. For an 
equation of order n, if a root of Z(r) = 0, say r = rı, has multiplicity s (where s < n), then 


gt te, Pent, o... pole (18) 


are corresponding solutions of equation (1). See Problem 31 for a proof of this statement, 
which is valid whether the repeated root is real or complex. 

Note that a complex root can be repeated only if the differential equation (1) is of 
order four or higher. If a complex root A + iu is repeated s times, the complex conjugate 
A — ips is also repeated s times. Corresponding to these 2s complex-valued solutions, we 
can find 2s real-valued solutions by noting that the real and imaginary parts of e(^*"?', 
te Dt o, tletin) are also linearly independent solutions: 


e" cos(ut), e" sin(ut), te" cos(ut), te" sin(pt), ..., 
Ple cos( ut), £e gin( nt). 


Hence the general solution of equation (1) can always be expressed as a linear combination of 
n real-valued solutions. Consider the following example. 
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EXAMPLE 3 


Find the general solution of 
yO 4 2y" c y 2 0. (19) 


Solution: 


The characteristic equation is 
r* x27 -1-(0?-D( +1) 20. 


Since r? + 1 = (r — i)(r + i), it follows that the roots of the characteristic equation are 7; = i and 
rj; = —i. Each of these roots has multiplicity 2. Thus the general solution of equation (19) is 


y = cı cost +c) sint + cat cost + cat sint. 


In determining the roots of the characteristic equation, it may be necessary to compute 
the cube roots, the fourth roots, or even higher roots of a (possibly complex) number. This 
can usually be done most conveniently by using Euler's formula e! = cost + i sint and the 
algebraic laws given in Section 3.3. This is illustrated in the following example. 


EXAMPLE 4 


Find the general solution of 
yO 4 y 2 0. (20) 
Solution: 
The characteristic equation is 
ré+1=0. 


To solve the equation, we must compute the fourth roots of —1. Now —1, thought of as a complex 
number, is —1 + Oi. It has magnitude 1 and polar angle 7: 


—l=cos7 +isinz = e". 


Moreover, because sin(x) and cos(x) both have period 27, the angle is determined only up to a 
multiple of 27: 


—1=cos(m + 2mm) -- i sin + 2mr) = eC 2mm) 


where m is zero or any positive or negative integer. Now, by the properties of exponents, 


(-1)'/4 = (ein tammy) ^ = el(a/4+mn/2) _ e(t + 7) +i Ze + 7) 


The four fourth roots of —1 are obtained by setting m = 0, 1, 2, and 3; they are 
1+i =li -1 —i 1—i 
va y2 ya; ya 
It is easy to verify that, for any other value of m, we obtain one of these four roots. For 


example, corresponding to m = 4, we obtain ( 1 +i) / 2 . The general solution of the homogeneous 
fourth-order linear differential equation (20) is 


y = e'N? Cea) +c n(5)) peti? Gea) e). 
(21) 
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In conclusion, we note that the problem of finding all the roots of a polynomial equation 
may not be entirely straightforward, even with computer assistance. In particular, it may be 
difficult to determine whether two roots are equal or merely very close together. Recall that 
the form of the general solution is different in these two cases. 

If the constants dao, a1, ... , a, in equation (1) are complex numbers, the solution of 


equation (1) is still of the form (4). In this case, however, the roots of the characteristic equation 
are, in general, complex numbers, and it is no longer true that the complex conjugate of a root 
is also a root. The corresponding solutions are complex-valued. 


Problems 


In each of Problems 1 through 4, express the given complex number 
in polar form R(cos0 + isin) = Re”? 


l. 144 
2. -14+ i 
3. -3 


4. J3-i 


In each of Problems 5 through 7, follow the procedure in Example 4 
to determine the indicated roots of the given complex number. 


5, 11/3 
6. (1-7? 
7. (2(cos(m/3) +i sin /3))) ? 


Ineach of Problems 8 through 19, find the general solution of the given 
differential equation. 


8. y" — y! — y +p =O 
9. y" —3y" -3y'- y 20 


10. y(9 — 4y" -- Ay" =0 

11. y+y=0 

12. y(9 —3y% 43y”-y=0 

13. y® —y”=0 

14. yO) —3y(09 4 3y" — 3y"  2y =0 
15. y® +8y + 16y 20 


16. y(9 -2y" y z0 

17. y" +5y” +6y’+2y =0 

Q 18. yO —7y" + 6y" + 30y’ — 36y =0 
Q 19. 12y% + 31y”+ 75y" - 37y'+ 5y =0 


In each of Problems 20 through 25, find the solution of the given initial 
value problem, and plot its graph. How does the solution behave as 
t — oo? 


Q 20. y"+y'=0; 


O 21. yX+y=0; 
y'(0 —-1, y"(0) 20 


O 22. y(9 — 4y" + Ay" = 0; 
y") 20, y"(1) 20 


O 23. 2y — y" — 9y" + Ay' c Ay = 0; 
y0) =0, y"(0) — —2, y"(0) =0 


Q 24. 4y”+y’4+5y=0; y(0-22, y(0 21, y/'(0 2-1 
© 25. 6y”+5y"+y'=0; y(0) 2 2, y(0 22, y'(0 20 


y(0) 20, y(0) —1, y'(0) 22 
y(0) 20, y'(0) =0, 


yD) =-1, y(1) =2, 


y(0) = —2, 


26. a. Verify that y(t) 23e ' + 1 cost — sint is the solution to 
7 B os 
yO -y =0, (0) = 5,00) = —4 y"(0) = 5, y" (0) = 2. 


7 
O b. Find the solution to y — y = 0, y(0) = T y'(0) = —4, 


5 15 
H" Xon mU" . 
yO exp (0) y 


Note: These are the initial value problems considered in Example 2. 
27. Show that the general solution of y(® — y = 0 can be written as 
y = cı cost + c sint + c3cosht + c4 sinh f. 


Determine the solution satisfying the initial conditions y(0) = 0, 
y (0) = 0, y"(0) = 1, y” (0) = I. Why is it convenient to use the 
solutions cosh f and sinh f rather than e' and e™*? 
28. Consider the equation y? — y = 0. 
a. Use Abel's formula (Problem 15d of Section 4.1) to find 
the Wronskian of a fundamental set of solutions of the given 
equation. 
b. Determine the Wronskian of the solutions e', e~’, cost, and 
sint. 
C. Determine the Wronskian of the solutions cosh f, sinh f, cos f, 
and sin f. 


29. Consider the spring-mass system, shown in Figure 4.2.4, 
consisting of two unit masses suspended from springs with spring 
constants 3 and 2, respectively. Assume that there is no damping in 
the system. 
a. Show that the displacements u; and u» of the masses from 
their respective equilibrium positions satisfy the equations 


ut + 5u, = 2us, uy + 2u5 = 2u,. (22) 
b. Solve the first of equations (22) for u» and substitute into the 
second equation, thereby obtaining the following fourth-order 
equation for u4: 


u\ + Tu! + 6u, = 0. (23) 
Find the general solution of equation (23). 
c. Suppose that the initial conditions are 


uj(0) =1, u4(0) —0, (0) =2, u0) =0. (24) 


Use the first of equations (22) and the initial conditions (24) to 
obtain values for u^(0) and u^ (0). Then show that the solution 
of equation (23) that satisfies the four initial conditions on u, 
is uj(f) = cost. Show that the corresponding solution uw is 
u(t) = 2cost. 


d. Now suppose that the initial conditions are 


uj(0 — —2, u4(0 —0, w(0)=1, u$(0) =0. (25) 


Proceed as in part c to show that the ae) solutions are 


u(t) = -2ew( V/6r) and u(t) = cos ver). 


e. Observe that the solutions obtained in parts c and d describe 
two distinct modes of vibration. In the first, the frequency of the 
motion is 1, and the two masses move in phase, both moving 
up or down together; the second mass moves twice as far as 
the first. The second motion has frequency /6, and the masses 
move out of phase with each other, one moving down while the 
other is moving up, and vice versa. In this mode the first mass 
moves twice as far as the second. For other initial conditions, not 
proportional to either of equations (24) or (25), the motion of the 
masses is a combination of these two modes. 


| FIGURE 4.2.4 | A two-spring, two-mass system. 


30. In this problem we outline one way to show that if rj, ... , fn 
are all real and different, then e’!’, ... , e"! are linearly independent 
on —co « t < oo. To do this, we consider the linear relation 


cie V fees + ene! =0, —ooct« oo (26) 


and show that all the constants are zero. 
a. Multiply equation (26) by e"! 
to t, thereby obtaining 


and differentiate with respect 


C(r2 — rpe”)! +-+ Cnlfn— T T)" =0. 
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—(ro—ri)t 


b. Multiply the result of part a by e and differentiate 


with respect to t to obtain 


—r3)t 
es(rs — ra) (r3  r)e 77? 
s (rn-rg)t |. 
tee + Cnn — ro), — re — 0. 
c. Continue the procedure from parts a and b, eventually 
obtaining 
Gu Ung eru deer re e P" en 


Hence c, = 0, and therefore, 
yaf r. t 
ce! +- Ce" = 0. 


d. Repeat the preceding argument to show that c,_; = 0. In 
a similar way it follows that c,_» = ::- = c, = 0. Thus the 


"1! are linearly independent. 


x ryt 
functionse!,...,e 


31. In this problem we indicate one way to show that if r = ry is 
a root of multiplicity s of the characteristic polynomial Z(r), then 
eM, te M, 2. , tle" are solutions of equation (1). This problem 
extends to n™ order equations the method for second-order equations 
given in Problem 17 of Section 3.4. We start from equation (2) in the 


text 
Lie] 2 e" Z(r) (27) 


and differentiate repeatedly with respect to r, setting r = rı after each 
differentiation. 
a. Recall that if rı is a root of multiplicity s, then 
Z(r) = (r — rj)*q(r), where q(r) is a polynomial of degree 
n — s and q(ri) Æ 0. Show that Z(ri), Z'(rj), ... , ZG ^U (rj) 
are all zero, but Z? (r1) Æ 0. 
b. By differentiating equation (27) repeatedly with respect to r, 
show that 


[7 rt] — [7 FF n rt 
57 ble" = Ls | = Lite", 


s—l 


Óürs-l 


€. Show that e", re", 
equation (27). 


Lle"'] = L[t? | e]. 


= Fit š 
L.. tle"! are solutions of 


43. The Method of Undetermined Coefficients 


A particular solution Y of the nonhomogeneous n"™ order linear differential equation with 


constant coefficients 


L[y] = apy + ayy "7? + +++ + any’ + any = g(t) (1) 


can be obtained by the method of undetermined coefficients, provided the nonhomogeneous 
term g(t) is of an appropriate form. Although the method of undetermined coefficients is not 
as general as the method of variation of parameters described in the next section, it is usually 


much easier to use when it is applicable. 
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Just as for the second-order linear differential equation, when the constant coefficient 
linear differential operator L is applied to a polynomial Aot" + At”! + ... + Am, an 
exponential function e^', a linear combination of sine and cosine functions a, cos( Gt) + 
dy sin( 6t), the result is a polynomial, an exponential function, or a linear combination of 
sine and cosine functions, respectively. Hence, if g(t) is a sum of polynomials, exponentials, 
sines, and cosines, or products of such functions, we can expect that it is possible to find Y(t) 
by choosing a suitable combination of polynomials, exponentials, and so forth, multiplied by 
a number of undetermined constants. The constants are then determined by substituting the 
assumed expression into the nonhomogeneous linear differential equation (1). 

The main difference in using this method for higher-order equations stems from the 
fact that roots of the characteristic polynomial equation may have multiplicity greater than 
2. Consequently, terms proposed for the nonhomogeneous part of the solution may need to 
be multiplied by higher powers of ¢ to make them different from terms in the solution of 
the corresponding homogeneous equation. The following examples illustrate this. In these 
examples we have omitted numerous straightforward algebraic steps, because our main goal 
is to show how to arrive at the correct form for the assumed solution. 


EXAMPLE 1 


Find the general solution of 
y" — 3y" c 3y' - y = Ae. Q) 
Solution: 
The characteristic polynomial for the homogeneous equation corresponding to equation (2) is 
r? — 3r? +3r— 1 = (r — 1), 
so the general solution of the homogeneous equation is 
yet) 2 cje! + stel + cs) el. (3) 


To find a particular solution Y(t) of equation (2), we start by assuming that Y(t) = Ae’. 
However, since e’, te’, and 1?e are all solutions of the homogeneous equation, we must multiply this 
initial choice by t°. Thus our final assumption is that Y(t) = Arte’, where A is an undetermined 
coefficient. 

To find the correct value for A, differentiate Y(t) three times, substitute for y and its derivatives 
in equation (2), and collect terms in the resulting equation. In this way we obtain 


6Ae! = 4e'. 
2 ] TE 
Thus A — 3 and the particular solution is 
2 
Y(t) = she. (4) 


The general solution of the nonhomogeneous differential equation (2) is the sum of ye(t) from 
equation (3) and Y(t) from equation (4): 


2 
y = cie 4 cote! + cte! + zie. 


EXAMPLE 2 


Find a particular solution of the equation 


y® +2y" +y = 3sint — 5cost. (5) 


4.3 The Method of Undetermined Coefficients 183 


Solution: 
The general solution of the homogeneous equation was found in Example 3 of Section 4.2; it is 
ye(t) = cy cost + co sint + cat cost + cat sint, (6) 


corresponding to the roots r = i, i, —i, and —i of the characteristic equation. Our initial assumption 
for a particular solution is Y(t) = A sint + B cos t, but we must multiply this choice by £? to make 
it different from all solutions of the homogeneous equation. Thus our final assumption is 


Y(t) = AP sint + B cost. 


Next, we differentiate Y(t) four times, substitute into the differential equation (5), and collect terms, 
obtaining finally 


—8A sint — 8B cost = 3 sint — 5cost. 


5 
Thus A = — 3° B= 3° and the particular solution of equation (4) is 


35. 55 
Y(t) niu sin? + «t COS f. (7) 


If g(t) is a sum of several terms, it may be easier in practice to compute separately 
the particular solution corresponding to each term in g(t). In the same way as for second- 
order differential equations, the particular solution of the complete problem is the sum of the 


particular solutions of the individual component problems. This is illustrated in the following 
example. 


EXAMPLE 3 


Find a particular solution of 
y" — 4y' 2 t - 3cost +e”, (8) 


Solution: 


First we solve the homogeneous equation. The characteristic equation is r? — 4r — 0, and the roots 
are r = 0, +2; hence 


y(t) = ci + coe? + cse. 


We can write a particular solution of equation (8) as the sum of particular solutions of the differential 
equations 


y" zu 4y' =t, y" = Ay’ = 3cos t, y" 4y’ = e. 


Our initial choice for a particular solution Y,(t) of the first equation is Aot + Aj, but a constant is a 
solution of the homogeneous equation, so we multiply by t. Thus 


Y\(t) = t( Aot + Aj). 
For the second equation we choose 
Y(t) = Bcost + C sint, 


and there is no need to modify this initial choice since sin? and cos: are not solutions of the 
homogeneous equation. Finally, for the third equation, since e^?' is a solution of the homogeneous 
equation, we assume that 


Y3(t) = Ete. 


The constants are determined by substituting into the individual differential equations; they are 
1 1 
Ap = Ty A120, B=0, C= T7 and E = 3" Hence a particular solution of equation (8) is 


1 3 1 
Y(t) = x À E- sint + FE (9) 
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You should keep in mind that the amount of algebra required to calculate the coefficients 
may be quite substantial for higher-order equations, especially if the nonhomogeneous term 
is even moderately complicated. A computer algebra system can be extremely helpful in 
executing these algebraic calculations. 

The method of undetermined coefficients can be used whenever it is possible to guess the 
correct form for Y(t). However, this is usually impossible for differential equations not having 
constant coefficients, or for nonhomogeneous terms other than the type described previously. 
For more complicated problems we can use the method of variation of parameters, which is 


discussed in the next section. 


Problems 


In each of Problems | through 6, determine the general solution of the 
given differential equation. 
1. y” ay" — y +y=2e" +3 
y(9 — y =3t + cost 
y" py" +y +y=e" +4t 
y® — 4y! 2 (2 +e! 
y(9 4 2y" + y 2 3 + cos 2t 
6. yO y" ct 
In each of Problems 7 through 9, find the solution of the given initial- 
value problem. Then plot a graph of the solution. 


Q 7. »"«4y-5 y(0) =y'(0) =0, "(0-21 

Q 8 y®+2y"+y=3t+4; y(0) = y'(0) =0, 

y'(0) 2 y"(0) = 1 

Q 9. yO -2y" + y" -8y' 2 12y = I2sint — e^; 

y(0) 23, y'(0) 20, y"(0) ——1, y"(0) 22 

In each of Problems 10 through 13, determine a suitable form for 


Y(t) if the method of undetermined coefficients is to be used. Do not 
evaluate the constants. 


10. y"—2y" +y =P 2e 

11. y" —y'-te^! +2cost 

12. yO — y" — y" E y' 2 1? E A p tint 

13. yO + 2y" + 2y" = 3e! + 2te + e^! sint 


14. Consider the nonhomogeneous n' order linear differential 
equation 


Ur de oq 


aoy? tay") Toca = g(t), (10) 
, à are constants. Verify that if g(t) is of the form 


e"! (bot™ qais bin), 


where ag, ... 


then the substitution y = e°‘u(t) reduces equation (10) to the form 
kyu + ku 7D +--+ — bot” +--+ bm, AD 


where kg, ... , kn are constants. Determine Ko and k, in terms of the 
a's and a. Thus the problem of determining a particular solution of 
the original equation is reduced to the simpler problem of determining 
a particular solution of an equation with constant coefficients and a 
polynomial for the nonhomogeneous term. 

Method of Annihilators. In Problems 15 through 17, we consider 
another way of arriving at the proper form of Y(t) for use in the 
method of undetermined coefficients. The procedure is based on the 


observation that exponential, polynomial, or sinusoidal terms (or sums 
and products of such terms) can be viewed as solutions of certain linear 
homogeneous differential equations with constant coefficients. It is 


d 
convenient to use the symbol D for a Then, for example, e~ is a 
solution of (D + 1) y = 0; the differential operator D + 1 is said to 
annihilate, or to be an annihilator of, e~'. In the same way, D? +4 


is an annihilator of sin 2t or cos2t, (D — 3)? = D? — 6D + 9 is an 
annihilator of e% or te*’, and so forth. 


15. Show that linear differential operators with constant coefficients 
obey the commutative law. That is, show that 


(D—a)(D-b)f =(D—b)(D—a)f 


for any twice-differentiable function f and any constants a and b. The 
result extends at once to any finite number of factors. 


16. Consider the problem of finding the form of a particular solution 
Y(t) of 
(D —2)7(D + DY —3e? — te^, (12) 
where the left-hand side of the equation is written in a form 
corresponding to the factorization of the characteristic polynomial. 
a. Show that D — 2 and ( D + 1)?, respectively, are annihilators 
of the terms on the right-hand side of equation (12), and that the 
combined operator ( D — 2)( D 4- 1)? annihilates both terms on 
the right-hand side of equation (12) simultaneously. 
b. Apply the operator (D — 2)(D + 1)? to equation (12) and 
use the result of Problem 15 to obtain 


(D —2)*(D + 1?Y 20. (13) 


Thus Y is a solution of the homogeneous equation (13). By 
solving equation (13), show that 


Y(t) = cie + eyte? +est e” + cyte + cse 
+ cgte^! + cte, (14) 


where c1, ... , c; are constants, as yet undetermined. 

C. Observe that e”, te”, te”, and e^' are solutions of the 
homogeneous equation corresponding to equation (12); hence 
these terms are not useful in solving the nonhomogeneous 
equation. Therefore, choose cy, c», C3, and cs to be zero in 
equation (14), so that 


Y(t) = c4Pe? + cgte^! + cit? et. (15) 
This is the form of the particular solution Y of equation (12). 


The values of the coefficients c4, cg, and c; can be found by 
substituting from equation (15) in the differential equation (12). 


Summary of the Method of Annihilators. Suppose that 


L(D)y = g(t), (16) 
where L( D) is a linear differential operator with constant coefficients, 
and g(t) is a sum or product of exponential, polynomial, or sinusoidal 
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which is a homogeneous equation of higher-order. 

c. Solve equation (17). 

d. Eliminate from the solution found in step c the terms that 
also appear in the solution of L(D)y = 0. The remaining 
terms constitute the correct form of a particular solution of 


terms. To find the form of a particular solution of equation (16), you 
can proceed as follows: 
a. Find a differential operator H(D) with constant coeffi- 
cients that annihilates g(t)—that is, an operator such that 
H(D)g(t) =0. 
b. Apply H( D) to equation (16), obtaining 


H(D)LCD) y — 0, (7) 


equation (16). 


not evaluate the coefficients. 


44 The Method of Variation of Parameters 


The method of variation of parameters for determining a particular solution of the 
nonhomogeneous n" order linear differential equation 


L[y] = Y? + pithy? +- + prt)’ + Palt) y = g(t) (1) 


is a direct extension of the method for the second-order differential equation (see Section 
3.6). As before, to use the method of variation of parameters, it is first necessary to solve the 
corresponding homogeneous differential equation. In general, this may be difficult unless the 
coefficients are constants. However, the method of variation of parameters is still more general 
than the method of undetermined coefficients in that it leads to an expression for the particular 
solution for any continuous function g, whereas the method of undetermined coefficients is 
restricted in practice to a limited class of functions g. 

Suppose then that we know a fundamental set of solutions yj, Y2, ... , y, of the 
homogeneous equation. Then the general solution of the homogeneous equation is 


ye(t) = cy y(t) + eoyx(t) +++) + cnyp(t). (2) 


The method of variation of parameters for determining a particular solution of equation (1) 
rests on the possibility of determining n functions uj, u2, ... , Un such that Y(t) is of the form 


Y(t) = u(t) y(t) + u(t) yo(t) +++ + Unt) Yat). (3) 


Since we have n functions to determine, we will have to specify n conditions. One of 
these is clearly that Y satisfy equation (1). The other n — 1 conditions are chosen so as to 
make the calculations as simple as possible. Since we can hardly expect a simplification in 
determining Y if we must solve high order differential equations for u1, ... , Un, itis natural 


to impose conditions to suppress the terms that lead to higher derivatives of uj, ... , Un. From 
equation (3) we obtain 
Y' = (uy, + Uy, b Unya) + Qn yy H uya + +++ sys), (4) 


where we have omitted the independent variable t on which each function in equation (4) 
depends. Thus the first condition that we impose is that 


uyi + Uy d d us ys = 0. (5) 

It follows that the expression (4) for Y’ reduces to 
Y' = uyy, +uoy, t: + uny,. (6) 
We continue this process by calculating the successive derivatives Y", ... , Y "^U, After 
each differentiation we set equal to zero the sum of terms involving derivatives of u1, ... , Up. 


In this way we obtain n — 2 further conditions similar to equation (5); that is, 
(m) (m) 


uyy busy, +e uy =0, m=1,2,...,n— 2. (7) 
As a result of these conditions, it follows that the expressions for Y", ... , Y (^-D reduce to 
YO — uy Huygh ua, m—2,8,...,n-1. (8) 


17. Use the method of annihilators to find the form of a particular 
solution Y(t) for each of the equations in Problems 10 through 13. Do 
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Finally, we need to impose the condition that Y must be a solution of equation (1). By 
differentiating Y "^U from equation (8), we obtain 


(n) (n—1) 


YO = (uyi cus) + uy over uy). (9) 


To satisfy the differential equation we substitute for Y and its derivatives in equation (1) 
from equations (3), (6), (8), and (9). Then we group the terms involving each of the 
functions yi, ... , y, and their derivatives. It then follows that most of the terms in the 
equation drop out because each of yi, ... , y, is a solution of equation (1) and therefore 
L[y;] 20,4 = 1,2, ... , n. The remaining terms yield the relation 


=] 1 
P P H HUIT” =g. (10) 


Equation (10), equation (5), and the n — 2 equations (7) provide n simultaneous linear 


/ 


nonhomogeneous algebraic equations for u^, Uh, ... , uh: 


yit + you, +-+- + Ynt’, = 0, 
yy + Yu t b yu, = O, 


Mest 


yiu + yiu, +--+ + yu), = 0, (11) 


1 
yn "ul diee aa 1) ul = g. 


The system (11) is a linear algebraic system for the unknown quantities u,, ... ,u’,. 


By solving this system and then integrating the resulting expressions, you can obtain the 
coefficients u1, ... , Un. A sufficient condition for the existence of a solution of the system of 
equations (11) is that the determinant of coefficients is nonzero for each value of t. However, 
the determinant of coefficients is precisely W[y1, Y2, ... , Yn], and it is nowhere zero since 
Yr «e. , Yn is a fundamental set of solutions of the homogeneous equation. Hence it is possible 
to determine u^, ... , u,. Using Cramer’s” rule, we can write the solution of the system of 
equations (11) in the form 

uo SO m —1,2, ...,n. (12) 
Here W(t) = Wlyi, yo. ... , Yn](t), and W,, is the determinant obtained from W by replacing 
the m column by the column (0, 0, ... , 0, 1)7. With this notation a particular solution of 
equation (1) is given by 


8G) Was) 3 
Y = m 5, 13 
(oe d x) e (3) 


m=1 


where tọ is arbitrary. 


EXAMPLE 1 


Given that y;(7) = e', y2(t) = te’, and y3(t) = e™ are solutions of the homogeneous equation 
corresponding to 


Ih 


y"—y'—y-ttyz-g(). (14) 


determine a particular solution of equation (14) in terms of an integral. 


3Cramer’s rule is credited to the Swiss mathematician Gabriel Cramer (1704-1752), professor at the Académie de 
Calvin in Geneva, who published it in a general form (but without proof) in 1750. For small systems the result had 
been known earlier. 
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Solution: 

We use equation (13). First, we have 

el 

W(t) = W[e',te',e'Y1) = |e! (t4 De! —e^|. 
e! (t + 2)e! eT 


Factoring e' from each of the first two columns and e™ from the third column, we obtain 


1 t 1 
Wi) -e1 t+1 -I1|. 
1 t+2 1 


Then, by subtracting the first row from the second and third rows, we have 


1 t 1 
W(t) = e0 1 —2). 
0 2 0 


Finally, evaluating the latter determinant by minors associated with the first column, we find that 
W(t) = 4e. 
Next, 
0 te et 
W(t) = JO (t--De! —e*|. 
1 (t+2)e' eT 


Using minors associated with the first column, we obtain 


t —t 


Wi(t) i i 2t—1 
1) — Eee. 
: (t--De! —e7 
In a similar way, 
e! 0 et ; : 
e e 
W(t) = le —e™| = — =2 
e —e™ 
e! 1 ert 
and 
el te! ; : 
t t g te 2t 
W3(t) = |e (t+ l)e 0| = : d^ 
a "m i e (t 4- 1)e 


Substituting these results in equation (13), we have 


‘ -1-2 i 2 : ^ 
Y(t) za &GX Passie f ECs) lasse f iC LP 
to 4es to 4es io 4es 


1 


= J (e^*(-1 +2t—s)) + e U^? )g(s)ds. (15) 
to 


Depending on the specific function g(t), it may or may not be possible to evaluate the integrals in 
equation (15) in terms of elementary functions. 


Although the procedure is straightforward, the algebraic computations involved in 
determining Y(t) from equation (13) become more and more complicated as n increases. 
In some cases the calculations may be simplified to some extent by using Abel’s identity 
(Problem 15d of Section 4.1), 


W(t) = Wlyi, .-- > yal(t) = cexp(— f poa). 


The constant c can be determined by evaluating W at some convenient point. 
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Problems 


In each of Problems | through 4, use the method of variation of 
parameters to determine the general solution of the given differential 


equation. 


ddp 
2 2 


1. y" -y'c-tant, 

2. y" _ y =t 

3. y" = 2y" _ y! + 2y = e? 

4. y" —y"-py' —y —esint 
In each of Problems 5 and 6, find the general solution of the given 
differential equation. Leave your answer in terms of one or more 
integrals. 


T T 
5. y" — y" p y! — y «sect, =a Sts — 


2 
6. y"—y'ccsct, O<t<T 

In each of Problems 7 and 8, find the solution of the given initial-value 
problem. Then plot a graph of the solution. 

Q 7. y"-y"+y'-y=sect; y(0)=2, y(0) =-1, 
y") =1 


[G] 8. y ecl — tant; (5) = 2, »() ze] 
T 

v(1) Se 
4 


References 


Coddington, E. A., An Introduction to Ordinary Differential 
Equations (Englewood Cliffs, NJ: Prentice-Hall, 1961; New 
York: Dover, 1989). 

Coddington, E. A. and Carlson, R., Linear Ordinary Differential 
Equations (Philadelphia, PA: Society for Industrial and 
Applied Mathematics, 1997). 


9. Given that x, x?, and 1/x are solutions of the homogeneous 


equation corresponding to 
x3y" 4 x?y" -2xy! +2y 22x*, x » 0, 


determine a particular solution. 


10. Findaformula involving integrals for a particular solution of the 
differential equation 


y"—y'cy-yzcg(). 


11. Finda formula involving integrals for a particular solution of the 
differential equation 


y? -y = g(t). 


Hint: The functions sin t, cost, sinh t, and cosht form a fundamental 
set of solutions of the homogeneous equation. 


12. Finda formula involving integrals for a particular solution of the 
differential equation 


y" — 3y" +3y' — y = g(t). 


If g(t) = t?e!, determine Y(t). 


Ince, E. L., Ordinary Differential Equations (London: Longmans, 
Green, 1927; New York: Dover, 1956). 


CHAPTERS 


Series Solutions of 
Second-Order Linear 
Equations 


Finding the general solution of a linear differential equation depends on determining a 
fundamental set of solutions of the homogeneous equation. So far, we have given a systematic 
procedure for constructing fundamental solutions only when the equation has constant 
coefficients. To deal with the much larger class of equations that have variable coefficients, 
it is necessary to extend our search for solutions beyond the familiar elementary functions of 
calculus. The principal tool that we need is the representation of a given function by a power 
series. The basic idea is similar to that in the method of undetermined coefficients: we assume 
that the solutions of a given differential equation have power series expansions, and then we 
attempt to determine the coefficients so as to satisfy the differential equation. 


51 Review of Power Series 


In this chapter we discuss the use of power series to construct fundamental sets of solutions of 
second-order linear differential equations whose coefficients are functions of the independent 
variable. We begin by summarizing very briefly the pertinent results about power series that 
we need. Readers who are familiar with power series may go on to Section 5.2. Those who 
need more details than are presented here should consult a book on calculus. 


oo 
1. A power series $5 a,(x — xo)" is said to converge at a point x if 


n=0 
m 


lim X an( xX — xo)” 
m—> oo 
n=0 
exists for that x. The series certainly converges for x = xo; it may converge for all x, or 


it may converge for some values of x and not for others. 
oo 


2. The power series $5 a,(x — xo)" is said to converge absolutely at a point x if the 
n=0 
associated power series 


oo oo 
3 las — x9"| = M lanl lx — xol" 
n=0 


n=0 
converges. It can be shown that if the power series converges absolutely, then the power 
series also converges; however, the converse is not necessarily true. 


3. One of the most useful tests for the absolute convergence of a power series is the ratio 
test: If a, ZZ 0, and if, for a fixed value of x, 


1 
[Angi (x — xo)" * An+1 
Big | ee nt 


n—oo 


= |x — xo| lim = |x — xolL, 


noo 


n(x = Xo)” an 
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then the power series converges absolutely at that value of x if |x — x9|L < 1 and 
diverges if |x — xo|L > 1. If |x — xo|L = 1, the ratio test is inconclusive. 


EXAMPLE 1 
For which values of x does the power series 


cona —2)" 2(x 2) —2(x - 2? -3(x - 2)? — ... 


n-l 


converge? 


Solution: 


We first test for absolute convergence using the ratio test. We have 


fim [SDF DG 2)" =p) ia = 
n>œ| — (—1)"*!n(x — 2)" 7 fac E s 


According to statement 3, the series converges absolutely for |x — 2| < 1, that is, for 1 < x < 3, 
and diverges for |x — 2| > 1. The values of x corresponding to |x — 2| = 1 are x = 1 and x = 3. 
The series diverges for each of these values of x since the nth term of the series does not approach 
zero as n — oo. This power series converges (absolutely) for 1 < x < 3 and diverges for x < 1 
and for x > 3. 


oo 
4. If the power series X` a,(x — xo)" converges at x = x), it converges absolutely for 
n=0 
|x — xo| € |x, — xol; and if it diverges at x = x, it diverges for |x — xo| > |x, — xol. 
5. For a typical power series, such as the one in Example 1, there is a positive number 
oo 
p, called the radius of convergence, such that $5 a,(x — xo)" converges absolutely 
n=0 
for |x — xo| < p and diverges for |x — xo| > p. The interval |x — xo| « p is called 
the interval of convergence; it is indicated by the hatched lines in Figure 5.1.1. The 
series may either converge or diverge when |x — xo| = p. Many important power series 
converge for all values of x. In this case it is customary to say that p is infinite and the 
interval of convergence is the entire real line. It is also possible for a power series to 
converge only at xo. For such a series we say that p = 0 and the series has no interval of 
convergence. When these exceptional cases are taken into account, every power series 
has a nonnegative radius of convergence p, andif p > 0, then there is a (finite or infinite) 
interval of convergence centered at xq. 


Series 


ae dt converges aii ee 
iverges absolutely iverges 
Xo- p Xo Xotp i 


NC Series may va 


converge or diverge 


| FIGURE 5.1.1 | The interval of convergence of a power series. 


EXAMPLE 2 


Determine the radius of convergence of the power series 


oo 


(x 4- 1)" 
DE 


n=1 


5.1 Review of Power Series 


Solution: 
We apply the ratio test: 
(x +1)"*! n2" Ix - 1| ,. n |x + 1| 
im = lim = f 
noo] (n + 1)2"41 (x +1)” 2 nsontl 2 


Thus the series converges absolutely for |x + 1| < 2, that is, for —3 < x < 1, and diverges for 
|x + 1| > 2. The radius of convergence of the power series is p = 2. Finally, we check the end- 
points of the interval of convergence. At x = 1 the series becomes the harmonic series 


oo 


which diverges. At x — —3 we have 


oo 


(-34+1" «x(-D" 
2 n2" =) n ` 


n=1 n=1 


Recognizing this as the alternating harmonic series, we recall that it converges but does not converge 
absolutely. The power series is said to converge conditionally at x = —3. To summarize, the given 
power series converges for —3 < x < 1 and diverges otherwise. It converges absolutely for 
—3 < x < 1 and has a radius of convergence of 2. 


oo oo 
Suppose that X` a,(x — xo)” and Y b,(x — xo)" converge to f(x) and g(x), respec- 
n=0 n=0 
tively, for |x — xol < p, p > 0. 


6. The two series can be added or subtracted termwise, and 


f(x) £ g(x) = M (a, £ bn) (x — x)": 


n=0 


the resulting series converges at least for |x — xo| < p. 
7. The two series can be formally multiplied, and 


oo oo oo 
F8 = | 3 ane = x0)" | [So b x9" | = So en(x — x0)", 
n=0 n=0 n=0 
where c, = aob, + a1b, 4 +--+ + Gnbo. The resulting series converges at least for 


|x — xol < p. 
Further, if by Z 0, then g(xo) # 0, and the series for f(x) can be formally divided 
by the series for g( x), and 


fx) _> , 
gx) 2. dlm t 


In most cases the coefficients d, can be most easily obtained by equating coefficients in 
the equivalent relation 


oo oo oo 
o an(x — xo)” = x dd — xo)" * buts — Xo)" 
n=0 n=0 n=0 


n 


= M | XO dibar | (x — xo)". 


n=0 X k=0 


In the case of division, the radius of convergence of the resulting power series may be 
less than p. 
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8. The function f is continuous and has derivatives of all orders for |x —x9| < p. Moreover, 
f^. f", ... can be computed by differentiating the series termwise; that is, 


f(x) = a, + 2ay(x — xo) +++ + na,(x — x9 TM 


oo 


= So nay (x — x0)", 


n=1 


f" (x) = 2a, + 6a3(x — xo) +- + n(n — lay (x apy t pus. 


= $ n(n Dax — xo)", 


n=2 


and so forth, and each of the series converges absolutely for |x — xo| < p. 
9. The value of a, is given by 


f? (xo) 


an = xi 


The series is called the Taylor! series for the function f about x = xo. 


oo oo 
10. If 35 a&(x — xo)" = YO ba(x — xo)" for each x in some open interval with center xo, 
=0 =0 
n n PA 
then a, = b, forn = 0, 1,2,3, ... . In particular, if X a, (x — xo)" = 0 for each such 
n=0 
x, thendg — a; =---=a,=:-::=0. 


A function f that has a Taylor series expansion about x = xo 


o (n) 
o= a (x — xo)”, 


n=0 


with a radius of convergence p > 0, is said to be analytic at x = xo. All of the familiar 
functions of calculus are analytic except perhaps at certain easily recognized points. For 
example, sin x and e* are analytic everywhere, 1/x is analytic except at x = 0, and tan x 
is analytic except at odd multiples of 7/2. According to statements 6 and 7, if f and g are 
analytic at xo, then f + g, f - g, and f/g (provided that g(xo) + 0) are also analytic at 
x = x. In many respects the natural context for the use of power series is the complex plane. 
The methods and results of this chapter nearly always can be directly extended to differential 
equations in which the independent and dependent variables are complex-valued. 


Shift of Index of Summation. The index of summation in an infinite series is a dummy 
parameter just as the integration variable in a definite integral is a dummy variable. Thus it is 
immaterial which letter is used for the index of summation. For example, 


Y 2x" 2! x! 
n! j! 
n=0 j= J 


Just as we make changes of the variable of integration in a definite integral, we find it 
convenient to make changes of summation indices in calculating series solutions of differential 
equations. We illustrate by several examples how to shift the summation index. 


!Brook Taylor (1685-1731), English mathematician, received his education at Cambridge University. His book 
Methodus incrementorum directa et inversa, published in 1715, includes a general statement of the expansion theorem 
that is named for him. This is a basic result in all branches of analysis, but its fundamental importance was not 
recognized until 1772 (by Lagrange). Taylor was also the first to use integration by parts, was one of the founders 
of the calculus of finite differences, and was the first to recognize the existence of singular solutions of differential 
equations. 


5.1 Review of Power Series 


EXAMPLE 3 


oo 
Write ys anx” as a series whose first term corresponds to n = 0 rather than n = 2. 
n=2 


Solution: 


Let m = n — 2; then n = m + 2, and n = 2 corresponds to m = 0. Hence 
oo oo 
X anx” = X naak T, (1) 
n=2 m=0 


By writing out the first few terms of each of these series, you can verify that they contain precisely the 
same terms. Finally, in the series on the right-hand side of equation (1), we can replace the dummy 
index m by n, obtaining 


oo oo 
n n+2 
1 anx = ) An+2x (2) 


n=2 n=0 


In effect, we have shifted the index upward by 2 and have compensated by starting to count at a level 
2 lower than originally. 


EXAMPLE 4 
Write the series 


3 n 20 + Danla — xo)" (3) 


n=2 
as a series whose generic term involves (x — xo)" rather than (x — xo)". 
Solution: 


Again, we shift the index by 2 so that n is replaced by n + 2 and start counting 2 lower. We obtain 


Y e o 3)a (à = xo". (4) 


n=0 


You can readily verify that the terms in the series (3) and (4) are exactly the same. 


EXAMPLE 5 


Write the expression 


oo 


x? p» (r+n)a,x't"! (5) 


n=0 


as a series whose generic term involves x"*". 


Solution: 


First, take the x? inside the summation, obtaining 


5 (r 4 n)a,x "t, (6) 


n=0 
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Next, shift the index down by 1 and start counting 1 higher. Thus 


oo oo 


5 (r 4 na,x ^" *! = 5 (r &n- Da, ax" "^. (7) 


n=0 n=1 


Again, you can easily verify that the two series in equation (7) are identical and that both are exactly 
the same as the expression (5). 


EXAMPLE 6 


Assume that 


oo oo 
5 nax”! = * anx” (8) 


for all x, and determine what this implies about the coefficients an. 


Solution: 


We want to use statement 10 to equate corresponding coefficients in the two series. In order to do 
this, we must first rewrite equation (8) so that the series display the same power of x in their generic 
terms. For instance, in the series on the left-hand side of equation (8), we can replace n by n + 1 and 
start counting 1 lower. Thus equation (8) becomes 


oo 


oo 
5 (n+ agyix” = 5 anx”. (9) 
n=0 


n=0 
According to statement 10, we conclude that 
(n+ Tar = ai n= 0,1,2, 3; ee 


or 


a 


Ong = IT n=0,1,2,3,.... (10) 


Hence, choosing successive values of n in equation (10), we have 


a do a 4% 
d, = do, n=; a? a3 3 3r 
and so forth. In general, 
0 
an= —, n=1,2,3,.... (11) 
n! 


Thus the relation (8) determines all the following coefficients in terms of ag. Finally, using the 
coefficients given by equation (11), we obtain 


oo oo oo n 
n do n x F 
j Any = j X = ag j = ae , 
n! n! 
n=0 n=0 n=0 
oo x" 
where we have followed the usual convention that 0! = 1, and recalled that e* = J T for all 
n! 
n=0 


values of x. (See Problem 8.) 
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Problems 


In each of Problems 1 through 6, determine the radius of convergence 


oo 
== n 
of the given power series. 15. Lety — 2 ünX . 


n=0 
oo 
1. 3ix-3" a. Compute y' and y" and write out the first four terms of each 
n=0 series, as well as the coefficient of x” in the general term. 
oO n b. Show that if y" = y, then the coefficients aj and a, are 
2. b» P dd arbitrary, and determine a» and a3 in terms of dp and ay. 
n=0 C. Show thatan, = — £ n =0,1,2,3,.... 
oo x (n +2)(n + 1) 
Ew 
LE In each of Problems 16 and 17, verify the given equation. 
4 ~ nyn 5 = pat > — q7 
e 22x 16. So a,(x-—D = J a(x — 1) 
n=0 n=0 n=1 


5. 3 (x — xo)” 


oo oo oo 
17. X ay 44x* + X ayx*H =a; + X (aky + ay_1)x* 


n 


n-l k=0 k=0 k=1 
œ (—1)"n?(x +2)" In each of Problems 18 through 22, rewrite the given expression as a 
6. > 739 single power series whose generic term involves x". 


n=1 
co 


In each of Problems 7 through 13, determine the Taylor series about 18. X n(n — Dax"? 
the point xo for the given function. Also determine the radius of n-2 


convergence of the series. oo oo 
19. x. nasx" | + X ayx* 
k=0 


Ts sin x, XQ = 0 n=1 
8. e, =0 eo oo 
í "9 20. So m(m- Dagx"? +x So kax"! 
9. x, xo9-1 m=2 k=1 
oo oo 
10. x? x9--1 21. Y na,x" x Y aux" 
11. Inx, x=1 n=l ia 
oo oo 
12 1 x =0 22. x X n(n — Da,x"? + 5 a,x" 
2 1—x 077 n-2 n=0 
1 23. Determine the a, so that the equation 
13. 1 , Xo = 2 
—x oo oo 
od 5 nax”! +2 5 a,x" = 0 
14. Lety 2 X} nx". n=l n=0 
n=0 


a. Compute y' and write out the first four terms of the series. is satisfied. Try to identify the function represented by the series 


oo 
b. Compute y" and write out the first four terms of the series. ax". 
n=0 


5.2 Series Solutions Near an Ordinary 
Point, Part | 


In Chapter 3 we described methods of solving second-order linear differential equations with 
constant coefficients. We now consider methods of solving second-order linear equations 
when the coefficients are functions of the independent variable. In this chapter we will denote 
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the independent variable by x. It is sufficient to consider the homogeneous equation 


dy dy 
PO) 72 t 4) tRy =0, (1) 


since the procedure for the corresponding nonhomogeneous equation is similar. 
Many problems in mathematical physics lead to equations of the form (1) having 
polynomial coefficients; examples include the Bessel equation 


Hg 


xy" + xy! + (x? —v)y 2 0, 


where v is a constant, and the Legendre equation 
(1 —x°)y" — 2xy' tala + 1)y =0, 


where a. is a constant. For this reason, as well as to simplify the algebraic computations, we 
primarily consider the case in which the functions P, Q, and R are polynomials. However, as 
we will see, the method of solution is also applicable when P, Q, and R are general analytic 
functions. 

For the present, then, suppose that P, Q, and R are polynomials and that there is no factor 
(x—c) that is common to all three of them. If there is such a common factor (x —c), then divide 
it out before proceeding. Suppose also that we wish to solve equation (1) in the neighborhood 
of a point xo. The solution of equation (1) in an interval containing xo is closely associated 
with the behavior of P in that interval. 

A point xo such that P(xo) + 0 is called an ordinary point. Since P is continuous, it 
follows that there is an open interval containing xo in which P(x) is never zero. In that interval, 
which we will call J, we can divide equation (1) by P(x) to obtain 


y" + p(x)y' - q(x)y — 0, Q) 


where p(x) = Q(x)/ P(x) and q(x) = R(x)/ P(x) are continuous functions on J. Hence, 
according to the existence and uniqueness theorem, Theorem 3.2.1, there exists a unique 
solution of equation (1) in the interval 7 that also satisfies the initial conditions y(xo) = yo 
and y'(xo) = yg for arbitrary values of yo and y,. In this and the following section, we discuss 
the solution of equation (1) in the neighborhood of an ordinary point. 

On the other hand, if P(xo) = 0, then xo is called a singular point of equation (1). In 
this case, because (x — xo) is not a factor of P, Q, and R, at least one of Q(xo) and R(xo) 
is not zero. Consequently, at least one of the coefficients p and q in equation (2) becomes 
unbounded as x — Xo, and therefore Theorem 3.2.1 does not apply in this case. Sections 
5.4 through 5.7 deal with finding solutions of equation (1) in the neighborhood of a singular 
point. 

We now take up the problem of solving equation (1) in the neighborhood of an ordinary 
point xo. We look for solutions of the form 


oo 
y = ay + ai(x — x) av (x — x9) m Y Lax — xp" 3) 
n=0 


and assume that the series converges in the interval |x — xo| « p for some p > 0. 

While at first sight it may appear unattractive to seek a solution in the form of a power 
series, this is actually a convenient and useful form for a solution. Within their intervals of 
convergence, power series behave very much like polynomials and are easy to manipulate both 
analytically and numerically. Indeed, even if we can obtain a solution in terms of elementary 
functions, such as exponential or trigonometric functions, we are likely to need a power series 
or some equivalent expression if we want to evaluate the solution numerically or to plot its 
graph. 

The most practical way to determine the coefficients a, is to substitute the series (3) and 
its derivatives for y, y’, and y" in equation (1). The following examples illustrate this process. 
The operations, such as differentiation, that are involved in the procedure are justified so long 
as we stay within the interval of convergence. The differential equations in these examples are 
also of considerable importance in their own right. 
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EXAMPLE 1 


Find a series solution of the equation 


y"+y=0, —oo«x «oo. (4) 


Solution: 


As we know, sin x and cos x form a fundamental set of solutions of this equation, so series methods 
are not needed to solve it. However, this example illustrates the use of power series in a relatively 
simple case. For equation (4), P(x) = 1, Q(x) = 0, and R(x) = 1; hence every point is an ordinary 
point. 
We look for a solution in the form of a power series about xo = 0 
oo 
y = ay ax - a3x? ax? 4 b aaxl $= Yaw (5) 
n=0 


and assume that the series converges in some interval |x| « p. Differentiating equation (5) term by 
term, we obtain 


oo 
y = ay 4-2ayx -3agx? 4 E narx”! +- = * nau (6) 
n-l 
and 
oo 
y" = Day  3-2agx ++: HAC — Day"? $= V n(n — Daun. (7) 
n=2 


Substituting the series (5) and (7) for y and y” in equation (4) gives 


oo oo 
Y nin = Da"? + Y anx" =0. 


n=2 n=0 


To combine the two series, we need to rewrite at least one of them so that both series display the 
same generic term. (See Problem 22 in Section 5.1.) Thus, in the first sum, we shift the index of 
summation by replacing n by n + 2 and starting the sum at 0 rather than 2. We obtain 


oo oo 
5 (n +2)(n + Da, ax" + » a zd 
=0 =0 


or 
oo 


5 ((n + 2)(n + Vanya + an) x" =0. 

n=0 
For this equation to be satisfied for all x, the coefficient of each power of x must be zero; hence we 
conclude that 


(n+2)(n+ 1)an42 +a =0, n=0,1,2,3,.... (8) 
Equation (8) is referred to as a recurrence relation. The successive coefficients can be 
evaluated one by one by writing the recurrence relation first for n = 0, then for n = 1, and so 
forth. In this example equation (8) relates each coefficient to the second one before it. Thus the even- 
numbered coefficients (ao, 42, a4, . . . ) and the odd-numbered ones (a, a5, as, . . . ) are determined 
separately. For the even-numbered coefficients we have 
do do az do a4 ao 
Sig we unc up degunt 
These results suggest that in general, if n = 2k, then 
eg 
an = ay = ———ag, k= 1,2,3,.... 9 
n 2k (2k) ! 0 ( ) 


We can prove equation (9) by mathematical induction. First, observe that it is true for k — 1. Next, 
assume that it is true for an arbitrary value of k and consider the case k + 1. We have 
Ax (=1)* eit 
= ao = ao. 
Qk-2)Qk-1 — Qk-2Qk4DQO!?^ Qk-2!? 


o2 = 
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Hence equation (9) is also true for k + 1, and consequently it is true for all positive integers k. 
Similarly, for the odd-numbered coefficients 
ai ai a3 ai a5 ai 


and in general, ifn = 2k + 1, then? 
(-D* 
Gre pr 


Substituting these coefficients into equation (5), we have 


An = A241 = k= 1, 2, 3, TTE (10) 


a a0 4 a s 


_ 02 01.3 
PE S a a a 


(-D^ao 5, (-D'ai 4 
(2n)! (2n 4 0)! 


2 4 —1)” 
-a| 5 Artis Porgi -a 


17 4 (2n)! 
x3 x? (-1)" 5: 
— — — * ae C — +1 eee 
ta f atat teen" t 
oo oo 
(-D" 2n (-D" 2n+1 
- QE nel, 11 
«2. (2n)! " taraen (1) 
n= n=l 


We identify two series solutions of equation (4): 


= C=)" 2n 


Ly 
x^ and y(x) = E ) ntl 


Qn DU 


y (x) = 


n=0 n= 


Using the ratio test, we can show that the series for y,(x) and y2(x) converge for all x, and this 
justifies retroactively all of the steps used in obtaining these solutions. Indeed, the series for y,(x) is 
exactly the Taylor series for cos x about x = 0 and the series for y2(x) is the corresponding Taylor 
series for sinx. Thus, as we anticipated in equation (11) we have obtained the general solution of 
equation (4) in the form y = ag cos x + a, sin x. 

Notice that no conditions are imposed on a, and a,; hence they are arbitrary. From equations (5) 
and (6) we see that y and y' evaluated at x = 0 are ag and aj, respectively. Since the initial conditions 
y(0) and y'(0) can be chosen arbitrarily, it follows that ay and a, should be arbitrary until specific 
initial conditions are stated. 

Figures 5.2.1 and 5.2.2 show how the partial sums of the series solutions y;(x) and y2(x) 
approximate cos x and sin x, respectively. As the number of terms increases, the interval over which 


2l n-4 n-8 n=12 n-16 n-20 


n-2 n-6 n=10 n-14 n=18 


Polynomial approximations to y — cosx. The 
value of n is the degree of the approximating polynomial. 


?The result given in equation (10) and other similar formulas in this chapter can be proved by an induction 
argument resembling the one just given for equation (9). We assume that the results are plausible and omit the 
inductive argument hereafter. (See Problem 16.) 


5.2 Series Solutions Near an Ordinary Point, Part | 


the approximation is satisfactory becomes longer, and for each x in this interval the accuracy of 
the approximation improves. However, you should always remember that a truncated power series 
provides only a local approximation of the solution in a neighborhood of the initial point x = 0; it 
cannot adequately represent the solution for large |x]. 


n=5 n=9 n=13 n=17 n=21 


n=3 n-7 nall n=15 n=19 


Polynomial approximations to y= sinx. The 
value of n is the degree of the approximating polynomial. 


In Example 1 we knew from the start that sinx and cosx form a fundamental set 
of solutions of equation (4). However, if we had not known this and had simply solved 
equation (4) using series methods, we would still have obtained the solution (11). In 
recognition of the fact that the differential equation (4) often occurs in applications, we might 
decide to give the two solutions of equation (11) special names, perhaps 


oo 


eo —]1)” —]1)” 
C(x) 2Y7 CD six) => re" (12) 
i n=0 ! 


Then we might ask what properties these functions have. For instance, can we be sure that C( x) 
and S(x) form a fundamental set of solutions? It follows at once from the series expansions 
that C(0) = 1 and S(0) = 0. By differentiating the series for C(x) and S(x) term by term, 
we find that 


S'(x) = C(x), C(x) = —S(x). (13) 


Thus at x = 0, we have S’(0) = 1 and C'(0) = 0. Consequently, the Wronskian of C and S 
at x = Ois 


WIC, S](0) = A. =1 14 
, ~ 10 il 3 (14) 


so these functions do indeed form a fundamental set of solutions. By substituting —x for x 
in each of equations (12), we obtain C(—x) = C(x) and S(—x) = —S(x). Moreover, 
by calculating with the infinite series? we can show that the functions C(x) and S(x) 
have all the usual analytical and algebraic properties of the cosine and sine functions, 
respectively. 

Although you probably first saw the sine and cosine functions defined in a more 
elementary manner in terms of right triangles, it is interesting that these functions can be 
defined as solutions of a certain simple second-order linear differential equation. To be precise, 
the function sin x can be defined as the unique solution of the initial-value problem y" -- y = 0, 
y(0) = 0, y/(0) = 1; similarly, cos x can be defined as the unique solution of the initial-value 


3Such an analysis is given in Section 24 of Knopp (see the References at the end of this chapter). 
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problem y" + y = 0, y(0) = 1, y(0) = 0. Many other functions that are important in 
mathematical physics are also defined as solutions of certain initial-value problems. For most 
of these functions there is no simpler or more elementary way to approach them. 


EXAMPLE 2 


Find a series solution in powers of x of Airy’s* equation 


/ 


y'—xy-20, -—oo«x«oo. (15) 


Solution: 


For this equation P(x) = 1, Q(x) = 0, and R(x) = —x; hence every point is an ordinary point. 
We assume that 


oo 
ye agr (16) 


1—0 


and that the series converges in some interval |x| « p. The series for y" is given by equation (7); as 
explained in the preceding example, we can rewrite it as 


oo 
y" =X n+2(n+ Da, 42x". (17) 
n=0 


Substituting the series (16) and (17) for y and y” into the left-hand side of equation (15), we obtain 


oo oo oo oo 
v» (n 4 2)(n + Da,4ax" — x 374 = »»c + D(a Darp" — b d (18) 
n=0 n=0 n=0 n=0 


Next, we shift the index of summation in the second series on the right-hand side of equation (18) by 
replacing n by n — 1 and starting the summation at 1 rather than zero. Thus we write equation (15) 
as 


oo 


oo 
2+ lay + Soin +2)(n+1)dn42x" — E t =0. 


n=1 n=1 


Again, for this equation to be satisfied for all x in some interval, the coefficients of like powers of x 
must be zero; hence a» = 0, and we obtain the recurrence relation 


(n +2)(n + 1)an+2 — an-ı =0 for n=1,2,3,.... (19) 
Since a, is given in terms of a„—1, the a’s are determined in steps of three. Thus a determines 
a3, which in turn determines ag, ... ; a, determines a4, which in turn determines a7, ... ; and a 
determines as, which in turn determines ag, ... . Since a) = 0, we immediately conclude that 
45 = dg = 4,1 =---=0. 
For the sequence dp, a3, 46, dg, ... We set n = 1,4, 7, 10, ... in the recurrence relation: 
ao a3 ao a6 do 
d3 = ——i 66S EU SS dg = = TETE 
2.3 5.6 2.3.5.6 8.9  2.3.5.6.8.9 


These results suggest the general formula 


ao 
= , nz 
2.3.5.6... (3n — D) (3n) 


3n 


^Sir George Biddell Airy (1801—1892), an English astronomer and mathematician, was director of the Greenwich 
Observatory from 1835 to 1881. He studied the equation named for him in an 1838 paper on optics. One reason 
why Airy's equation is of interest is that for x negative the solutions are similar to trigonometric functions, and for 
x positive they are similar to hyperbolic functions. Can you explain why it is reasonable to expect such behavior? 
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For the sequence a4, a4, 47, ajo, ... , we set n = 2, 5,8, 11, ... in the recurrence relation: 
a1 a4 a1 a7 a, 
a4 = > a= = > ao= = pores 
3-4 6-7 3.4.6.7 9.10 3-4-6-7-9-10 
In general, we have 
ay, d 
"HT S Agere iy 9 
Thus the general solution of Airy's equation is 
(x) pe MU NUNT: i + 
x)=a —— —— ae lee 
$ M" eus "puis 2-3:--(3n — DOR) 
bags uL dus — + 
a Tun l———— n n 
Eme avd- dideBcT 3-4---(3n)(3n+ 1) 
= agyi(x) + a1 y2(x) (20) 


where y,(x) and y2(x) are the first and second bracketed expressions in equation (20). 

Having obtained these two series solutions, we can now investigate their convergence. Because 
of the rapid growth of the denominators of the terms in the series for y,(x) and for y2(x), we might 
expect these series to have a large radius of convergence. Indeed, it is easy to use the ratio test to 
show that both of these series converge for all x; see Problem 17. 

Assume for the moment that the series for y; and y; do converge for all x. Then, by choosing 
first ay = 1, a, = 0 and then aj = 0,a, = 1, it follows that y; and y are individually solutions 
of equation (15). Notice that y, satisfies the initial conditions y,(0) = 1, y; = 0 and that y2 
satisfies the initial conditions y2(0) = 0, y5(0) = 1. Thus W[y;, y2](0) = 1 Æ 0, and consequently 
yı and y» are a fundamental set of solutions. Hence the general solution of Airy's equation is 


y =aoyı(x) +aıy2(x) —oocx«oo. 


In Figures 5.2.3 and 5.2.4, respectively, we show the graphs of the solutions yı and y; of Airy's 
equation as well as graphs of several partial sums of the two series in equation (20). Again, the 
partial sums provide local approximations to the solutions in a neighborhood of the origin. Although 
the quality of the approximation improves as the number of terms increases, no polynomial can 
adequately represent y; and y» for large |x|. A practical way to estimate the interval in which a 
given partial sum is reasonably accurate is to compare the graphs of that partial sum and the next 
one, obtained by including one more term. As soon as the graphs begin to separate noticeably, we 
can be confident that the original partial sum is no longer accurate. For example, in Figure 5.2.3 the 
graphs for n = 24 and n = 27 begin to separate at about x = —9/2. Thus, beyond this point, the 
partial sum of degree 24 is worthless as an approximation to the solution. 


y 
2 
/n -3 
n-48| Be |24 \12 id 
42 |30 \ 18\ |6 
y = (x) 
| b Kf M / i NE 
-10 -8 L6 -4 2 2 x 
39| [7| |15 
n=45| |s3] [21 9| 3 -2 


Polynomial approximations to the solution y = y;(x) of Airy's 
equation. The value of n is the degree of the approximating polynomial. 
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y 
n24 
n-46| |a4| |22 10 2 
40| |28 | 16 4 
y = yox) 
L fi \i M fe NV | 
-10 -8 B - E 2 x 
n-49| |3 25| |13 
43 [311 [19 7 -2 


Polynomial approximations to the solution y = y2(x) of Airy's 
equation. The value of n is the degree of the approximating polynomial. 


Observe that both y, and y; are monotone for x > 0 and oscillatory for x < 0. You can also 
see from the figures that the oscillations are not uniform but, rather, decay in amplitude and increase 
in frequency as the distance from the origin increases. In contrast to Example 1, the solutions y, and 
y» of Airy’s equation are not elementary functions that you have already encountered in calculus. 
However, because of their importance in some physical applications, these functions have been 
extensively studied, and their properties are well known to applied mathematicians and scientists. 


EXAMPLE 3 


Find a solution of Airy's equation in powers of x — 1. 


Solution: 


The point x — 1 is an ordinary point of equation (15), and thus we look for a solution of the form 


oo 


y= ama- D", 


n=0 


where we assume that the series converges in some interval |x — 1| < p. Then 


oo 


y 2 M na - D = MG + Dana- 0", 


n-l n=0 


and 
oo oo 
y' 2 M nin - Dax = D"? = M "(n 2) + Dana — D". 
n=2 n=0 
Substituting for y and y” in equation (15), we obtain 
oo oo 
2G Diausix- 1)” =x a(x - 1)”. (21) 
n=0 n=0 


Now to equate the coefficients of like powers of (x — 1), we must express x, the coefficient of y in 
equation (15), in powers of x — 1; that is, we write x = 1 + (x — 1). Note that this is precisely the 
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Taylor series for x about x = 1. (See Problem 9 in Section 5.1.) Then equation (21) takes the form 


NOn +D + Danaa — 1)" = OG — 1) So ax - D" 


n=0 n=0 
oo oo 
= Sax za Sax — n, 
n=0 n=0 


Shifting the index of summation in the second series on the right gives 


oo oo oo 
3 n2 Dax - 0" 2 ax - 0" - aia - D". 
n=0 n=0 n=1 


Equating coefficients of like powers of x — 1, we obtain 
2a5 = ag, 
(3-2)as = a, +a, 
(4:3)a, = a3 +a, 
(5 -4)as = a3 +a, 


The general recurrence relation is 
(n+2)(n+ ldyi2 =an +a- for n21. (22) 


Solving for the first few coefficients a, in terms of ag and ay, we find that 


ao aj ao a2 a, ao ai a3 a» ao a, 
Baa BEE 0967 tus 754 i 057 39 * 39 ^ 307 120 
Hence 
x-D? (x-D? (x-1) (x-1»? 
zd "» 
y a| + 2 T 6 T 24 T 30 T 
(x-D? (x-D* (x- 1) 
ta E REVO c a S Q3) 


In general, when the recurrence relation has more than two terms, as in equation (22), the 
determination of a formula for a, in terms ao and a, will be fairly complicated, if not impossible. 
In this example such a formula is not readily apparent. Lacking such a formula, we cannot test the 
two series in equation (23) for convergence by direct methods such as the ratio test. However, we 
shall see in Section 5.3 that even without knowing the formula for a,, it is possible to establish that 
the two series in equation (23) converge for all x. Further, they define functions y3 and y4 that are a 
fundamental set of solutions of the Airy equation (15). Thus 


y = agya(x) + a1 y4(x) 


is the general solution of Airy's equation for —oo < x < oo. 


While Airy's equation is not particularly complicated, Example 3 shows some of the 
complications encountered when looking for a power series solution expressed in powers of 
x — xo With xo Æ 0. There is an alternative. We can make the change of variable x — xo = t, 
obtaining a new differential equation for y as a function of f, and then look for solutions of 

oo 


this new equation of the form X` a,t". When we have finished the calculations, we replace t 
n=0 
by x — xo (see Problem 15). 

In Examples 2 and 3 we have found two sets of solutions of Airy's equation. The 
functions y, and y defined by the series in equation (20) are a fundamental set of solutions of 
equation (15) for all x, and this is also true for the functions y3 and y4 defined by the series in 
equation (23). According to the general theory of second-order linear equations, each of the 
first two functions can be expressed as a linear combination of the latter two functions, and 
vice versa— a result that is certainly not obvious from an examination of the series alone. 
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Finally, we emphasize that it is not particularly important if, as in Example 3, we are 
unable to determine the general coefficient a, in terms of dg and a,. What is essential is that 
we can determine as many coefficients as we want. Thus we can find as many terms in the 
two series solutions as we want, even if we cannot determine the general term. While the 
task of calculating several coefficients in a power series solution is not difficult, it can be 
tedious. A symbolic manipulation package can be very helpful here; some are able to find a 
specified number of terms in a power series solution in response to a single command. With a 
suitable graphics package we can also produce plots such as those shown in the figures in this 


section. 


Problems 


In each of Problems 1 through 11: 


Ne Qo Se oe Du Eos 


= 
= 


11. 


a. Seek power series solutions of the given differential equation 
about the given point xo; find the recurrence relation that the 
coefficients must satisfy. 

b. Find the first four nonzero terms in each of two solutions y; 
and y» (unless the series terminates sooner). 

c. By evaluating the Wronskian W[y;, yo](xo). show that y, 
and y» form a fundamental set of solutions. 

d. If possible, find the general term in each solution. 


y'—y20, x=0 
y'4-3y 20, x9 =0 
y"—xy’—y=0, x =0 
y"—xy'-y=0, x=1 


y" +k?x?y =0, x9 =0, kaconstant 


(l—-x)y"+y=0, x =0 
y’+xy+2y=0, x =0 
xy’ +y +xy=0, xw=1 
(3—x?)y” -3xy/ - y 20, x 20 
2y"+xy'+3y=0, x =0 
2y" F(x-Dy -3y 20, x =2 


In each of Problems 12 through 14: 


12. 
13. 
14. 
15. 


16. 


a. Find the first five nonzero terms in the solution of the given 
initial-value problem. 

Q b. Plot the four-term and the five-term approximations to the 
solution on the same axes. 

C. From the plot in part b, estimate the interval in which the 
four-term approximation is reasonably accurate. 


y"—xy’—y=0, y(0)=2, y'(0) =l; 
y"+xy'+2y=0, (0) =4, y(0) = —L 
(l-x)y"+xy'—y=0, y(0) =—3, y(0) =2 

a. By making the change of variable x — 1 = t and assuming 


that y has a Taylor series in powers of f, find two series solutions 
of 


see Problem 3 


see Problem 7 


y' +(x — D?y + (x?-1y 20 
in powers of x — 1. 
b. Show that you obtain the same result by assuming that y 
has a Taylor series in powers of x — 1 and also expressing the 
coefficient x? — 1 in powers of x — 1. 


Prove equation (10). 


17. Show directly, using the ratio test, that the two series solutions 
of Airy's equation about x — 0 converge for all x; see equation (20) 
of the text. 


18. The Hermite Equation. The equation 


y” —2xy’+Ay =0, —oo«x« oo, 


where A is a constant, is known as the Hermite? equation. It is an 
important equation in mathematical physics. 
a. Find the first four nonzero terms in each of two solutions 
about x = 0 and show that they form a fundamental set of 
solutions. 
b. Observe that if \ is a nonnegative even integer, then one 
or the other of the series solutions terminates and becomes a 
polynomial. Find the polynomial solutions for \ = 0, 2, 4, 6, 
8, and 10. Note that each polynomial is determined only up to a 
multiplicative constant. 
C. The Hermite polynomial H,(x) is defined as the polynomial 
solution of the Hermite equation with A = 2n for which the 
coefficient of x" is 2". Find Ho(x), H4(x), ..., Hs(x). 


19. Consider the initial-value problem y' = 4/1 — y?, y(0) = 0. 
a. Show that y = sinx is the solution of this initial-value 
problem. 

b. Look for a solution of the initial-value problem in the form of 
a power series about x = 0. Find the coefficients up to the term 
in x? in this series. 

In each of Problems 20 through 23, plot several partial sums in a 

series solution of the given initial-value problem about x = 0, 

thereby obtaining graphs analogous to those in Figures 5.2.1 through 

5.2.4 (except that we do not know an explicit formula for the actual 

solution). 


Q 20. y’+xy'+2y 20, y(0) 20, y'(0) = 1; see Problem 7 
O 21. (4-x5y"42y20, y(00-20 y(02-21 

O 22. y"+x?y=0, y(00) 21, y(0) 20; see Problem 5 
Q 23. (1—x)y"+xy'-2y =0, y(0)=0, y(0 =1 


5Charles Hermite (1822-1901) was an influential French analyst and 
algebraist. An inspiring teacher, he was professor at the École Polytechnique 
and the Sorbonne. He introduced the Hermite functions in 1864 and showed in 
1873 that e is a transcendental number (that is, e is not a root of any polynomial 
equation with rational coefficients). His name is also associated with Hermitian 
matrices (see Section 7.3), some of whose properties he discovered. 


5.3 Series Solutions Near an Ordinary Point, Part II 


5.3 Series Solutions Near an Ordinary 
Point, Part Il 


In the preceding section we considered the problem of finding solutions of 
P(x)y" + QOO y+ RG)y =0, (1) 


where P, Q, and R are polynomials, in the neighborhood of an ordinary point xy. Assuming 
that equation (1) does have a solution y = @(x) and that @ has a Taylor series 


oo 


p(x) = X` a(x — xo)" (2) 


n=0 


that converges for |x — xo] < p, where p > 0, we found that the a, can be determined by 
directly substituting the series (2) for y in equation (1). 

Let us now consider how we might justify the statement that if xo is an ordinary point 
of equation (1), then there exist solutions of the form (2). We also consider the question of 
the radius of convergence of such a series. In doing this, we are led to a generalization of the 
definition of an ordinary point. 

Suppose, then, that there is a solution of equation (1) of the form (2). By differentiating 
equation (2) m times and setting x equal to xo, we obtain 


mldm = QU? (xo). (3) 


Hence, to compute a, in the series (2), we must show that we can determine à? (xg) for 
n —0,1,2, ... from the differential equation (1). 

Suppose that y = (x) is a solution of equation (1) satisfying the initial conditions 
y(xo) = yo. y'(xo) = yy. Then ag = yo and a, = yj. If we are solely interested in finding 
a solution of equation (1) without specifying any initial conditions, then ao and a, remain 
arbitrary. To determine QU) (xo) and the corresponding a, for n = 2,3, ... , we turn to 
equation (1) with the goal of finding a formula for 9" (x), $"(x), ... . Since ¢ is a solution 
of equation (1), we have 


P(x)ó"(x) + Q(x) 9G) + R(X) G(x) = 0. 
For the interval about xo for which P is nonzero, we can write this equation in the form 
b"(x) = —pQ)o'(x) — q(x) (x), (4) 
where p(x) = Q(x)/ P(x) and q(x) = R(x) / P(x). Observe that, at x = xo, the right-hand 
side of equation (4) is known, thus allowing us to compute $"(xo): Setting x equal to x9 in 
equation (4) gives 
$" (xo) = —p(xo) (xo) — q(xo) (x0) = —p(xo)ai — q(xo) o. 
Hence, using equation (3) with m = 2, we find that a2 is given by 
21a) = $" (xo) = —p(xo)ai — q(xo) ao. (5 
To determine a3, we differentiate equation (4) and then set x equal to xo, obtaining 
Blas = ó" (xo) = P) 4006 Q)) |a, 


= —2!p(xo)a» — Cp'(xo) + q(xo))a1 — q'(xo)ao. (6) 


Substituting for a» from equation (5) gives a3 in terms of a, and ao. 
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Since P, Q, and R are polynomials and P(xo) + 0, all the derivatives of p and q exist 
at xy. Hence we can continue to differentiate equation (4) indefinitely, determining after each 
differentiation the successive coefficients a4, as, ... by setting x equal to xo. 


EXAMPLE 1 


Let y = ó (x) be a solution of the initial value problem (1 + x?) y" + 2xy' + 4x?y = 0, y(0) = 0, 
y'(0) = 1. Determine $"(0), à" (0), and 60). 


Solution: 


To find $"(0), simply evaluate the differential equation when x = 0: 
(14-0) 9"(0) +2-0-4'(0) - 4-0? - 6(0) — 0, 


so $"(0) = 0. 
To find $"(0), differentiate the differential equation with respect to x: 


(1 4- x7) 9" (x) + 2xó" (x) + 2x" (x) 4- 26'(x) + Ax^ (x) + 8xó (x) = 0. (7) 
Then evaluate the resulting equation (7) at x = 0: 
$"(0) +26'(0) — 0. 


Thus $"(0) = —29'(0) = —2 (because $'(0) = 1). 
Finally, to find ¢ (0), first differentiate equation (7) with respect to x: 


(1 +x) G(x) + 2x6" (x) + Axó" (x) *- 46" (x) + (2-- 4?) 6" X) + 8x0") 
+ 8x$'(x) 4-86 (x) — 0. 
Evaluating this equation at x — 0 we find 
69 (0) +64"(0) + 86(0) = 0. 
Finally, using $ (0) = 0 and $"(0) = 0, we conclude that 9 (9 (0) = 0. 


Notice that the important property that we used in determining the a, was that we could 
compute infinitely many derivatives of the functions p and q. It might seem reasonable to 
relax our assumption that the functions p and q are ratios of polynomials and simply require 
that they be infinitely differentiable in the neighborhood of xo. Unfortunately, this condition 
is too weak to ensure that we can prove the convergence of the resulting series expansion for 
y = (x). What is needed is to assume that the functions p and q are analytic at xo; that 
is, they have Taylor series expansions that converge to them in some interval about the point 


Xo: 
oo 
P(x) = po + px — Xo) b pO xem px O 
n=0 
oo 
q(x) = qo + dY(x — xo) qax — x9)" + M au(x = x0)". 9) 
n=0 


With this idea in mind, we can generalize the definitions of an ordinary point and a singular 
point of equation (1) as follows: if the functions p(x) = Q(x) /P(x) and q(x) = R(x) /P(x) 
are analytic at xo, then the point xo is said to be an ordinary point of the differential equation 
(1); otherwise, it is a singular point. 

Now let us turn to the question of the interval of convergence of the series solution. 
One possibility is actually to compute the series solution for each problem and then to apply 
one of the tests for convergence of an infinite series to determine its radius of convergence. 
Unfortunately, these tests require us to obtain an expression for the general coefficient a, as 
a function of n, and this task is often quite difficult, if not impossible; recall Example 3 in 
Section 5.2. However, the question can be answered at once for a wide class of problems by 
the following theorem. 
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Theorem 5.3.1 


If xo is an ordinary point of the differential equation (1) 
P(x)y" + O(x)y’ + R(x)y =0, 


that is, if p(x) = Q(x) / P(x) and q(x) = R(x) / P(x) are analytic at xo, then the general solution 
of equation (1) is 

oo 

y= y» — xo)" = agyi(x) + a1y2(x), 

n=0 
where ay and a, are arbitrary, and y, and y» are two power series solutions that are analytic at x. 
The solutions y, and y? form a fundamental set of solutions. Further, the radius of convergence for 
each of the series solutions y; and y» is at least as large as the minimum of the radii of convergence 
of the series for p and q. 


To see that y, and y» are a fundamental set of solutions, note that they have the form 
yi(x) = 1--by(x — xg)? +--+ and yo(x) = (x — xo) --ex( x —xg)? + +++, where bp +c) = a5. 
Hence y, satisfies the initial conditions yi(xo) = 1, yi (xo) = 0, and y» satisfies the initial 
conditions y2(x9) = 0, y5(xo) = 1. Thus W[y1, y2](x0) = 1. 

Also note that although calculating the coefficients by successively differentiating the 
differential equation is excellent in theory, it is usually not a practical computational procedure. 
Rather, you should substitute the series (2) for y in the differential equation (1) and determine 
the coefficients so that the differential equation is satisfied, as in the examples in the preceding 
section. 

We will not prove this theorem, which in a slightly more general form was established by 
Fuchs. What is important for our purposes is that there is a series solution of the form (2) and 
that the radius of convergence of the series solution cannot be less than the smaller of the radii 
of convergence of the series for p and q; hence we need only determine these. 

This can be done in either of two ways. Again, one possibility is simply to compute the 
power series for p and q and then to determine the radii of convergence by using one of the 
convergence tests for infinite series. However, there is an easier way when P(x), Q(x), and 
R(x) are polynomials. It is shown in the theory of functions of a complex variable that the 
ratio of two polynomials, say, Q(x) / P(x), has a convergent power series expansion about a 
point x = xo if P(xo) + 0. Further, if we assume that any factors common to Q(x) and P(x) 
have been canceled, then the radius of convergence of the power series for Q( x) / P(x) about 
the point xo is precisely the distance from x to the nearest zero of P(x). In determining this 
distance, we must remember that P(x) — 0 may have complex roots, and these must also be 
considered. 


EXAMPLE 2 


What is the radius of convergence of the Taylor series for (1 + x?) ^! about x = 0? 


Solution: 


One way to proceed is to find the Taylor series in question, namely, 
1 
1+ x? 


=| x24 x4 xO fee + (1) x eee, 


Then it can be verified by the ratio test that o = 1. Another approach is to note that the zeros of 
1 +x? are x = +i. Since the distance in the complex plane from 0 to i or to —i is 1, the radius of 
convergence of the power series about x = 0 is 1. 


6Lazarus Immanuel Fuchs (1833-1902), a German mathematician, was a student and later a professor at the University 
of Berlin. He proved the result of Theorem 5.3.1 in 1866. His most important research was on singular points of linear 
differential equations. He recognized the significance of regular singular points (Section 5.4), and equations whose 
only singularities, including the point at infinity, are regular singular points are known as Fuchsian equations. 
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EXAMPLE 3 


What is the radius of convergence of the Taylor series for (x? — 2x + 2)~! about x = 0? about 
x=1? 
Solution: 
First notice that 
x? —2x+2=0 

has solutions x = 1 +i. The distance in the complex plane from x = 0 to either x = 1 + i or 
x = 1—-iis 2; hence the radius of convergence of the Taylor series expansion » anx” about 
x =Ois y. tm 

The distance in the complex plane from x — 1 to either x =1+io0rx = 1 -— iis 1; hence the 


radius of convergence of the Taylor series expansion $5 b,(x — 1)" about x = 1 is 1. 
n=0 


According to Theorem 5.3.1, the series solutions of the Airy equation in Examples 2 and 
3 of the preceding section converge for all values of x and x — 1, respectively, since in each 
problem P(x) = 1 and hence is never zero. 

A series solution may converge for a wider range of x than indicated by Theorem 5.3.1, 
so the theorem actually gives only a lower bound on the radius of convergence of the series 
solution. This is illustrated by the Legendre polynomial solution of the Legendre equation 
given in the next example. 


EXAMPLE 4 


Determine a lower bound for the radius of convergence of series solutions about x = O for the 
Legendre equation 


(1 — x?) y" — 2xy' c a(a + 1)y — 0, 


where o is a constant. 


Solution: 


Note that P(x) = 1 — x’, Q(x) = —2x, and R(x) = a(a +1) are polynomials, and that the zeros 


oo 
of P, namely, x = +1, are a distance 1 from x = 0. Hence a series solution of the form 5 ax" 

n=0 
converges at least for |x| < 1, and possibly for larger values of x. Indeed, it can be shown that if 
a is a positive integer, one of the series solutions terminates after a finite number of terms, that is, 
one solution is a polynomial, and hence converges not just for |x| < 1 but for all x. For example, if 
a = 1, the polynomial solution is y = x. See Problems 17 through 23 at the end of this section for 
a further discussion of the Legendre equation. 


EXAMPLE 5 


Determine a lower bound for the radius of convergence of series solutions of the differential equation 


(1 4- x2) y" + 2xy! + 4x7y =0 (10) 


about the point x — 0; about the point x — — i. 


Solution: 


Again P, Q, and R are polynomials, and P has zeros at x = +i. The distance in the complex plane 


E jl. D 5 E: 

from 0 to +i is 1, and from — P] to ci is4/14 7 = Ex Hence in the first case the series 5 a, x" 
"- 1 " n=0 

converges at least for |x| < 1, and in the second case the series 5 bn (: + >) converges at least 


n=0 


5.3 Series Solutions Near an Ordinary Point, Part II 


An interesting observation that we can make about equation (10) follows from Theorems 3.2.1 
and 5.3.1. Suppose that initial conditions y(0) = yo and y'(0) = y, are given. Since 1 + x? Æ 0 for 
all x, we know from Theorem 3.2.1 that there exists a unique solution of the initial-value problem 
on —oo « x « oo. On the other hand, Theorem 5.3.1 only guarantees a series solution of the 

oo 


form X` anx” (with ay = yo, a) = yo) for -1 < x < 1. The unique solution on the interval 
n=0 
—oo < x < oo may not have a power series about x = 0 that converges for all x. 


EXAMPLE 6 


Can we determine a series solution about x = 0 for the differential equation 
y" +(sinx)y (14: x?)y =0, 


and if so, what is the radius of convergence? 


Solution: 


For this differential equation, p(x) = sin x and g(x) = 1 + x?. Recall from calculus that sin x has 

a Taylor series expansion about x = 0 that converges for all x. Further, q also has a Taylor series 

expansion about x = 0, namely, g(x) = 1 + x?, that converges for all x. Thus there is a series 
oo 
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n=0 


Problems 


In each of Problems | through 3, determine $"(xo), $"(xg), and 
o” (xo) for the given point x if y = ¢ (x) is a solution of the given 
initial-value problem. 

l. y’+xy'+y=0; y(0)=1, y(0) =0 

2. xy" + +x)y +3nx)y=0; y(D =2, y(1) 20 

3. y'-cx?y' c£ (sinx)y 20; y(0) =a, y (0) =a 
In each of Problems 4 through 6, determine a lower bound for the 
radius of convergence of series solutions about each given point xo 
for the given differential equation. 

4. y"+4y'’+6xy=0; x =0, x =4 

5. (x2 -2x—3)y"+-xy'+4y=0; x =4, x9— —4, x9 =0 

6. (L4+x5)y"+4xy’+y=0; x9 =0, x =2 

7. Determine a lower bound for the radius of convergence of series 


solutions about the given xo for each of the differential equations in 
Problems | through 11 of Section 5.2. 


8. The Chebyshev Equation. The Chebyshev’ differential 
equation is 


(1 — x?) y" — xy'  o?y — 0, 
where a is a constant. 


a. Determine two solutions in powers of x for |x| « 1, and 
show that they form a fundamental set of solutions. 


7Pafnuty L. Chebyshev (1821-1894), the most influential nineteenth-century 
Russian mathematician, was for 35 years professor at the University of St. 
Petersburg, which produced a long line of distinguished mathematicians. His 
study of Chebyshev polynomials began in about 1854 as part of an investigation 
of the approximation of functions by polynomials. Chebyshev is also known 
for his work in number theory and probability. 


solution of the form y = Y a,x" with ag and a, arbitrary, and the series converges for all x. 


b. Show that if o is a nonnegative integer n, then there is 
a polynomial solution of degree n. These polynomials, when 
properly normalized, are called the Chebyshev polynomials. 
They are very useful in problems that require a polynomial 
approximation to a function defined on —1 < x < 1. 
c. Find a polynomial solution for each of the cases a = n = 0, 
1,:2..3: 
For each of the differential equations in Problems 9 through 11, find 
the first four nonzero terms in each of two power series solutions about 
the origin. Show that they form a fundamental set of solutions. What 
do you expect the radius of convergence to be for each solution? 


9. y"-r(sinx)y =0 
10. ey” +xy=0 
11. (cosx)y”+xy'’-2y 20 
12. Let y = x and y = x? be solutions of a differential equation 
P(x)y" + Q(x) y' + R(x)y = 0. Can you say whether the point 
x = Q is an ordinary point or a singular point? Prove your answer. 
First-Order Equations. The series methods discussed in this section 
are directly applicable to the first-order linear differential equation 
P(x)y' + Q(x)y = 0 at a point xo, if the function p = Q/P has 
a Taylor series expansion about that point. Such a point is called an 
ordinary point, and further, the radius of convergence of the series 

oo 


y = Yo an(x — xo)" is at least as large as the radius of convergence 
n=0 

of the series for Q/ P. In each of Problems 13 through 16, solve the 
given differential equation by a series in powers of x and verify that 
do is arbitrary in each case. Problem 17 involves a nonhomogeneous 
differential equation to which series methods can be easily extended. 
Where possible, compare the series solution with the solution obtained 
by using the methods of Chapter 2. 
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13. y'- y 20 
14. y'-xy 20 
15. (l-x)wW=y 
16. y - yx? 


The Legendre Equation. Problems 17 through 23 deal with the 
Legendre? equation 


(1 — x2) y" 2 2xy' +a(a+ Dy 2 0. 


As indicated in Example 4, the point x = 0 is an ordinary point of 
this equation, and the distance from the origin to the nearest zero 
of P(x) = 1 — x? is 1. Hence the radius of convergence of series 
solutions about x = 0 is at least 1. Also notice that we need to 
consider only a > —1 because if a < —1, then the substitution 
a = —(1 + y), where y > 0, leads to the Legendre equation 
(1 — x?) y" — 2xy' + yCy + Dy =0. 

17. Show that two solutions of the Legendre equation for |x| « 1 
are 


a (a + 1) E a (a —2)(a + D(a +3) 4 
2! 4! 


yi(x) = 1 


25% ppm iar = 2m Dla + D(a 2m - D oy 


(2m)! 
m=3 
(a) =e GED 
Pa — D(o —3)(a+2)(a +A) s 


5! 


tcp" 


m=3 


fo =D (a = 2m + Dla +2)---(a 42m) oni 


(2m + 1)! 


18. Show that if o is zero or a positive even integer 2n, the series 
solution y; reduces to a polynomial of degree 27 containing only even 
powers of x. Find the polynomials corresponding to a = 0, 2, and 4. 
Show that if o is a positive odd integer 2n + 1, the series solution y2 
reduces to a polynomial of degree 2n + 1 containing only odd powers 
of x. Find the polynomials corresponding to a = 1,3, and 5. 


19. The Legendre polynomial P,(x) is defined as the polynomial 
solution of the Legendre equation with a = n that also satisfies the 
condition P,(1) = 1. 

a. Using the results of Problem 18, find the Legendre 


polynomials Po(x), ... , Ps(x). 
Q b. Plot the graphs of Po(x), ... , Ps(x) for—1 <x < 1. 
Q c. Find the zeros of Po(x), ... , Ps(x). 


8 Adrien-Marie Legendre (1752-1833) held various positions in the French 
Académie des Sciences from 1783 onward. His primary work was in the fields 
of elliptic functions and number theory. The Legendre functions, solutions of 
Legendre’s equation, first appeared in 1784 in his study of the attraction of 
spheroids. 


20. The Legendre polynomials play an important role in 
mathematical physics. For example, in solving Laplace’s equation 
(the potential equation) in spherical coordinates, we encounter the 
equation 


dF 
PED) boot (p) 
dy? do 


tn(n+1)F(y) =0, 0«wgoc«m, 


where n is a positive integer. Show that the change of variable 
x = cos y leads to the Legendre equation with a = n for 
y = f(x) = F(arccosx). 


21. Show that for n = 0, 1, 2, 3, the corresponding Legendre 
polynomial is given by 


n 


2"n! dx" 


P(x) = (x? = Dy. 


This formula, known as Rodrigues's formula,’ is true for all positive 
integers n. 


22. Show that the Legendre equation can also be written as 
(1 -xy 2 -o(a 4 Dy. 
Then it follows that 


(A =x?) PI) = -a(n + 1) PG) 


and 


(Q — x?) P(x)’ = —m(m + 1) P4). 


By multiplying the first equation by P,,(x) and the second equation 
by P(x), integrating by parts, and then subtracting one equation from 
the other, show that 


1 
i: P(X) Pn(x)dx = 0 ifn + m. 


1 


This property of the Legendre polynomials is known as the 
orthogonality property. If m = n, it can be shown that the value of 
the preceding integral is 2/(2n 4- 1). 


23. Given a polynomial f of degree n, it is possible to express f as 
a linear combination of Po, Pj, Po, ... , Pr: 


n 


fo) = 3a PO. 


k=0 
Using the result of Problem 22, show that 


| 2k+1 


ak 2 


1 
1 f(x) PG dx. 
ERI 


?Benjamin Olinde Rodrigues (1795-1851) published this result as part of his 
doctoral thesis from the University of Paris in 1815. He then became a banker 
and social reformer but retained an interest in mathematics. Unfortunately, his 
later papers were not appreciated until the late twentieth century. 
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54 Euler Equations; Regular Singular Points 


In this section we will begin to consider how to solve equations of the form 
P(x)y" + QG)y' + R(x) y =0 (1) 


in the neighborhood of a singular point xo. Recall that if the functions P, Q, and R are 
polynomials having no factors common to all three of them, then the singular points of 
equation (1) are the points for which P(x) = 0. 


Euler Equations. A relatively simple differential equation that has a singular point is the 
Euler equation!’ 


L[y] = x^y" + axy' + By =0, (2) 
where a and 8 are real constants. Then P(x) = x”, Q(x) = ax, and R(x) = 8. If B 40, 
then P(x), Q(x), and R(x) have no common factors, so the only singular point of equation (2) 
is x = 0; all other points are ordinary points. For convenience we first consider the interval 
x > 0; later we extend our results to the interval x < 0. 
Observe that (x")’ = rx’~! and (x")” = r(r — 1)x'~?. Hence, if we assume that 
equation (2) has a solution of the form 
yox, (3) 
then we obtain 
L[x^] =Z x(x" t+ax(x')’ + Bx" 
= x?^r(r — Dx"? +.ax(rx"!) + Bx" 
=x'(r(r-1)+ar+8). (4) 
If r is a root of the quadratic equation 
F(r) 2r(r — 1) t ar * 8 —0, (5) 


then L[x"] is zero, and y = x" is a solution of equation (2). The roots of equation (5) are 


—(q—1Dzc va — 1)? — 48 
boe (o — 1) (o — 1) 38 6 
2 
and the quadratic polynomial F(r) defined in equation (5) can also be written as F(r) = 
(r — ri)(r — r2). Mirroring the treatment of second-order linear differential equations with 
constant coefficients, we consider separately the cases in which the roots are real and different, 
real but equal, and complex conjugates. Indeed, the entire discussion of Euler equations is 
similar to the treatment of second-order linear equations with constant coefficients in Chapter 


3, with e"* replaced by x”. 


Real, Distinct Roots. If F(r) = 0 has real roots rı and rz, with rı Z r», then yj(x) = x” 
and y2(x) = x” are solutions of equation (2). Since 


Wie", x^] = (r2 — ri) ntn 
is nonzero for rı 4 r2 and x > 0, it follows that the general solution of equation (2) is 


yocox"-ctox^, x90. (7) 


Note that if r is not a rational number, then x" is defined by x” = e" In 


EXAMPLE 1 


Solve 


2x?y" p 3xy'- y 20, x>0. (8) 


V'This equation is sometimes called the Cauchy — Euler equation or the equidimensional equation. Euler studied it in 
about 1740, but its solution was known to Johann Bernoulli before 1700. 
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Solution: 


Substituting y = x" in equation (8) gives 


x'(2r(r — 1) -3r— 1) 2 x'(27? +r 21) 2 x'(2r — D(r + 1) 20. 


1 
Hence r; = 5 and rz = —1, so the general solution of equation (8) is 


y2cx? exl, x 0. (9) 


Equal Roots. Ifthe roots rı and rz are equal, then we obtain only one solution y,(x) = x” 
of the assumed form. A second solution can be obtained by the method of reduction of order, 
but for the purpose of our future discussion we consider an alternative method. Since r; = r2, it 
follows that F(r) = (r —r;)?. Thus in this case, not only does F(r,) = O but also F'(rj) = 0. 
This suggests differentiating equation (4) with respect to r and then setting r equal to rı. By 
differentiating equation (4) with respect to r, we obtain 


ð T]. a r — ð r 2 
5b) = 5-2 FOI = uen 


= (r —r)?x" lnx + 2(r —11)x". (10) 


However, by interchanging differentiation with respect to x and with respect to r, we also 

obtain 

o o 

—L[x']- L|—x 
lx] Š 


8r | = Lix In x]. 


The right-hand side of equation (10) is zero for r = rı; consequently, L[x" In x] = 0 also. 
Therefore, a second solution of equation (2) is 


y(x) =x" lnx, x»90. (11) 
By evaluating the Wronskian of yand y», we find that 
Wx", x^ Inx] 2 x21, 


Hence x” and x" In x are a fundamental set of solutions for x > 0, and the general solution 
of equation (2) is 


y=(cy teolnx)x", x90. (12) 
EXAMPLE 2 
Solve 
xy" + 5xy'+4y 20, x > 0. (13) 
Solution: 


Substituting y = x" in equation (13) gives 
x'(r(r 2 1) 5r +4) =x" (P +4r +4) =0. 
Hencer, = r2 = —2, and 


y=x7(cy tox), x»0 (14) 


is the general solution of equation (13). 


Complex Roots. Finally, suppose that the roots r; and r2 of equation (5) are complex 
conjugates, say, r} = A + ip andr, = A — ip, with jj z 0. We must now explain what 
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is meant by x^ when r is complex. Remembering that 
x = ef nx (15) 


when x > 0 andr is real, we can use this equation to define x" when r is complex. Then, 
using Euler's formula for ei” "*, we obtain 


x^ tin (A+ip) lnx ed nx gin Inx A jip Inx 


=e =x e 


= x^ (cos(uInx) --isin(uInx)), x > 0. (16) 


With this definition of x" for complex values of r, it can be verified that the usual laws 
of algebra and differential calculus hold, and hence x" and x” are indeed solutions of 
equation (2). The general solution of equation (2) is 


y = cux" peux? cin, (17) 


The disadvantage of this expression is that the functions x^*" and x^-!" are complex-valued. 
Recall that we had a similar situation for the second-order differential equation with constant 
coefficients when the roots of the characteristic equation were complex. Just as we did then, 
we can use Theorem 3.2.6 to obtain real-valued solutions of equation (2) by taking the real 
and imaginary parts of x^*'" , namely, 


x^ cos( u lnx) and x^ sin( u In x). (18) 
A straightforward calculation shows (see Problem 29) that 
W [x* cos( u In x), x^ sin( u In x)] = a 


Hence these solutions form a fundamental set of solutions for x > 0, and the general solution 
of the Euler equation (2) is 


y= cux? cos( u In x) + eux sin(u lnx), x > 0. (19) 
EXAMPLE 3 
Solve 
x2y" 4 xy! t y — 0. (20) 
Solution: 


Substituting y — x" in equation (20) gives 
x'(r(r - 1) +r +1) 2 x'(r? +1) «0. 
Hence r = +i, and the general solution is 
y = c cos(lnx) + c2 sin(Inx), x> 0. (21) 


The factor x^ does not appear explicitly in equation (21) because in this example \ = 0 and x^ = 1. 


Now let us consider the qualitative behavior of the solutions of equation (2) near the 
singular point x = 0. This depends entirely on the values of the exponents rı and r3. First, if r 
is real and positive, then x” — 0 as x tends to zero through positive values. On the other hand, 
if r is real and negative, then x” becomes unbounded. Finally, if r = 0, then x” = 1. Figure 
5.4.1 shows these possibilities for various values of r. If r is complex, then a typical solution 
is x^ cos( u In x). This function becomes unbounded or approaches zero if A is negative or 
positive, respectively, and also oscillates more and more rapidly as x — 0. This behavior is 
shown in Figures 5.4.2 and 5.4.3 for selected values of \ and js. If A = 0, the oscillation is of 
constant amplitude. Finally, if there are repeated roots, then one solution is of the form x" In x, 
which tends to zero ifr > 0 and becomes unbounded if r < 0. An example of each case is 
shown in Figure 5.4.4. 
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ya 
10 y 2 x 2 cos(5 In x) 
| L> 
0.125 0.25 -0:375 5 x 
-10. 5 
Solution of an Euler equation; Solution of an Euler equation; 
complex roots with negative real part. complex roots with positive real part. 


The two typical second 


solutions of an Euler equation with equal roots: 
r > O(red), r < 0 (blue). 


The extension of the solutions of equation (2) into the interval x < 0 can be carried out 
in a relatively straightforward manner. The difficulty lies in understanding what is meant by 
x" when x is negative and r is not an integer; similarly, In x has not been defined for x < 0. 
The solutions of the Euler equation that we have given for x > 0 can be shown to be valid 
for x < 0, but in general they are complex-valued. Thus in Example 1 the solution x!/? is 
imaginary for x « 0. 

Itis always possible to obtain real-valued solutions of the Euler equation (2) in the interval 
x < 0 by making the following change of variable. Let x = —£, where € > 0, and let 
y —u(£). Then we have 


dy dud _ du qu d du dE du 
dx d&dx dë? dx? dë\ d&£/Jdx dê? 


(22) 
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Thus, for x < 0, equation (2) takes the form 


d^ d 
OF tage + fu =0, E> 0. (23) 


But except for names of the variables, this is exactly the same as equation (2); from 
equations (7), (12), and (19), we have 


cié” + eé” ifr; and rz are real-valued and different 


(ci + c5 In £) £^! ifr, and r are real-valued with r4 = rz 


u(g) = 


c1£? cos(p In £) + c5£? sin( u In £) ifr). = A cx ip are complex-valued 


(u #0), 
(24) 


depending on the nature of the zeros of F(r) = r(r — 1) +ar+ 8 = 0. To obtain u in terms 
of x, we replace € by —x in equations (24). 

We can combine the results for x > 0 and x < 0 by recalling that |x| = x when x > 0 
and that |x| = —x when x « 0. Thus we need only replace x by |x| in equations (7), (12), and 
(19) to obtain real-valued solutions valid in any interval not containing the origin. 

Hence the general solution of the Euler equation (2) 


n 


xy" +axy' + By =0 


in any interval not containing the origin is determined by the roots rı and r» of the equation 
F(r) 2r(r - 1) c ar - 8 -0 
as follows. If the roots rı and rz are real and different, r12 = À + i u, then 
y = cilx[^ + ealxp^. Q5) 
If the roots are real and equal, then 


y = Cc eon |x|) |x|". (26) 


If the roots are complex conjugates, rji» = A + ip, then 
y = Ixl^ (ei cosC y In |x|) + c» sin(yu In [x ) ). (27) 
The solutions of an Euler equation of the form 
(x — x9?y" + a(x — xo) y + By =0 (28) 


are similar. If we look for solutions of the form y = (x — xo)", then the general solution is 
given by equation (25), equation (26), or equation (27) with x replaced by x —xo. Alternatively, 
we can reduce equation (28) to the form of equation (2) by making the change of independent 
variable t = x — Xo. 


Regular Singular Points. We now return to a consideration of the general equation (1) 
P(x)y" + QG) y + R(x)y =0, 


where x is a singular point. This means that P(xo) = O0 and that at least one of Q and R is 
not zero at x. 

Unfortunately, if we attempt to use the methods of the preceding two sections to solve 
equation (1) in the neighborhood of a singular point xo, we find that these methods fail. This 
is because the solution of equation (1) is often not analytic at xy and consequently cannot be 
represented by a Taylor series in powers of x — xo. Examples 1, 2, and 3 illustrate this fact; in 
each of these examples, the solution fails to have a power series expansion about the singular 
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point x = 0. Therefore, to have any chance of solving equation (1) in the neighborhood of a 
singular point we must use a more general type of series expansion. 

Since the singular points of a differential equation are usually few in number, we might 
ask whether we can simply ignore them, especially since we already know how to construct 
solutions about ordinary points. However, this is not feasible. The singular points determine 
the principal features of the solution to a much larger extent than you might at first suspect. 
In the neighborhood of a singular point the solution often becomes large in magnitude or 
experiences rapid changes in magnitude. For example, the solutions found in Examples 1, 
2, and 3 are illustrations of this fact. Thus the behavior of a physical system modeled by a 
differential equation frequently is most interesting in the neighborhood of a singular point. 
Often geometric singularities in a physical problem, such as corners or sharp edges, lead to 
singular points in the corresponding differential equation. Thus, although at first we might 
want to avoid the few points where a differential equation is singular, it is precisely at these 
points that it is necessary to study the solution most carefully. 

As an alternative to analytical methods, we can consider the use of numerical methods, 
which are discussed in Chapter 8. However, these methods are ill suited for the study of 
solutions near a singular point. Thus, even if we adopt a numerical approach, it is advantageous 
to combine it with the analytical methods of this chapter in order to examine the behavior of 
solutions near singular points. 

Without any additional information about the behavior of Q/P and R/P in the 
neighborhood of the singular point, it is impossible to describe the behavior of the solutions 
of equation (1) near x — xo. It may be that there are two distinct solutions of equation (1) 
that remain bounded as x — xo (as in Example 3); or there may be only one, with the other 
becoming unbounded as x — xo (as in Example 1); or they may both become unbounded 
as x — xo (as in Example 2). If equation (1) has solutions that become unbounded as 
X — xo, it is often important to determine how these solutions behave as x — xo. For 
example, does y — oo in the same way as (x — xo) ! or |x — xo|~!/?, or in some other 
manner? 

Our goal is to extend the method already developed for solving equation (1) near an 
ordinary point so that it also applies to the neighborhood of a singular point x9. To do this 
in a reasonably simple manner, it is necessary to restrict ourselves to cases in which the 
singularities in the functions Q/ P and R/ P at x = x are not too severe—that is, to what 
we might call “weak singularities.” At this stage it is not clear exactly what is an acceptable 
singularity. However, as we develop the method of solution, you will see that the appropriate 
conditions (see also Section 5.6, Problem 16) to distinguish “weak singularities” are 


n XJ! oL nus 
erii — Xo) vm is finite (29) 


and 


lim (x — x9)? is finite. (30) 


X— Xo P(x 


This means that the singularity in Q / P can be no worse than (x — xo) ! and the singularity 
in R/ P can be no worse than (x — xo) ?. Such a point is called a regular singular point of 
equation (1). For equations with more general coefficients than polynomials, xo is a regular 
singular point of equation (1) if it is a singular point and if both!! 


Q(x) and (x sa 
P(x) P(x) 


(x — xo) (31) 


have convergent Taylor series about xo — that is, if the functions in equation (31) are analytic 
at x — xo. Equations (29) and (30) imply that this will be the case when P, Q, and R are 
polynomials. Any singular point of equation (1) that is not a regular singular point is called an 
irregular singular point of equation (1). 


11The functions given in equation (31) may not be defined at xy, in which case their values at x9 are to be assigned 
as their limits as x — xo. 
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Observe that the conditions in equations (29) and (30) are satisfied by the Euler equation 
(28). Thus the singularity in an Euler equation is a regular singular point. Indeed, we will see 
that all equations of the form (1) behave very much like Euler equations near a regular singular 
point. That is, solutions near a regular singular point may include powers of x with negative 
or nonintegral exponents, logarithms, or sines or cosines of logarithmic arguments. 

In the following sections we discuss how to solve equation (1) in the neighborhood 
of a regular singular point. A discussion of the solutions of differential equations in the 
neighborhood of irregular singular points is more complicated and may be found in more 
advanced books. 


EXAMPLE 4 


Determine the singular points of the Legendre equation 
(1 — x*)y” —2xy’+a(a+ly=0 (32) 
and determine whether they are regular or irregular. 


Solution: 

In this case P(x) = 1 — x7, so the singular points are x = 1 and x = —1. Observe that when we 
divide equation (32) by 1 — x?, the coefficients of y' and y are —2x/(1—x?) anda(a+1)/(1—x’), 
respectively. We consider the point x — 1 first. Thus, from equations (29) and (30), we calculate 


; —2x . (x —1)(—2x) . 2x 
lim(x — 1) — lim — lim =1 
xl 1-3? xoi (1 — x)C(1 4 x) yor l+x 
and 
1 -p? 1 
jute iia _ agg eee) 
x1 l=% x1 (1—x)(1+ x) 
. (x — DC-o)(o +1) 
— lim = Q0. 
x1 1 +x 
Since these limits are finite, the point x = 1 is a regular singular point. 
It can be shown in a similar manner that x = —1 is also a regular singular point. 


EXAMPLE 5 


Determine the singular points of the differential equation 


2x(x — 2)?y" +3xy +(x —2)y 20 
and classify them as regular or irregular. 


Solution: 


Dividing the differential equation by 2x(x — 2)?, we have 


1 


E = 0, 
2x 2)? "2x(x-2)) 


y" 4 


gan E CN 
59 PV TUB)  O2x-—2)0 49. — BG) —2x(x-2) 


and x — 2. Consider x — 0. We have 


. The singular points are x — 0 


3 
lim xp(x) = lim x ————— = 0, 
x0 x0 2(x = 2)? 
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and 


lim x?q(x) = lim x 0. 


x0 x0 


2 -Á —— 4 
2x(x — 2) 
Since these limits are finite, x = 0 is a regular singular point. 
For x — 2 we have 


3 
= lim ; 
(x-2)? x2 2(x — 2) 


lim(x — 2) p(x) = lim(x 25 


x2 x2 


so the limit does not exist; hence x — 2 is an irregular singular point. 


EXAMPLE 6 


Determine the singular points of 


2 
T n , * 
x— a y +(cosx)y + (sinx)y =0 
and classify them as regular or irregular. 


Solution: 


T 
The only singular point is x — 3 To study it, we consider the functions 


T i T \ Q(x) | cosx 
Peg pcm EE eae 


and 


Starting from the Taylor series for cos x about x — = we find that 


cosx L 4 © 27/2? Gant 
x—7/2 3! 5! 


which converges for all x. Similarly, sin x is analytic at x — > Therefore, we conclude that = isa 


regular singular point for this equation. 


Problems 


In each of Problems 1 through 8, determine the general solution of the 
given differential equation that is valid in any interval not including 
the singular point. 
1. x?y" +4xy'+2y=0 
(x + D?y" -3(x --1)y' -0.75y 20 
x2y" — 3xy! - Ay 20 
x2y" — xy! +y —0 
xy” + 6xy' - y «0 
2x? y" — Axy’ +6y 20 
x2y" — 5xy! -9y «0 
(x — 2)?y" + 5(x —2)y' + 8y 20 


SO eS 


In each of Problems 9 through 11, find the solution of the given initial- 
value problem. Plot the graph of the solution and describe how the 
solution behaves as x — 0. 

Q 9. 2x?y"+xy' -3y =0, 
Q 10. 4x2,7-8xy/-17y 20, y(D22 y(D)2-3 

O 11. x2y7"—-3xy 24y 20, y(-D 22, y(-D =3 

In each of Problems 12 through 23, find all singular points of the given 
equation and determine whether each one is regular or irregular. 

12. xy"+(—-x)y'+xy =0 

13. x?(1- x)?y" + 2xy’ +4y = 0 


yi) =1, y(1) 24 


14. x?^(1—x)y" +(x —2)y’ — 3xy 20 
15. x*(1—x?)y"+ BED 
x 


16. (1—x2)?y" e x(1 x)y' --(1--x)y 20 
17. xy"4xy +(x? — v?)y=0 
18. (x +2)?(x — Dy” +3(% — Dy’ -2(x +-2)y = 0 
19. x(3—x)y”+(x4+1)y’-2y =0 

20. xy” +e*y’+ (3cosx)y =0 

21. y”+ (n|x|)y’ + 3xy =0 

22. (sinx)y”+xy’+4y=0 

23. (xsinx)y”+3y’+xy =0 

24. Find all values of o for which all solutions of 


(Bessel equation) 


5 
"Laxy + = 0 approach zero as x > 0. 


xy 
25. Find all values of 3 for which all solutions of 
x2y" + By = 0 approach zero as x — 0. 


26. Find y so that the solution of the initial-value problem 

x?y" — 2y = 0, y(1) = 1, y'(1) = y is bounded as x — 0. 

27. Consider the Euler equation x?y" + axy’ + By = 0. Find 
conditions on a and 5 so that: 

All solutions approach zero as x — 0. 

All solutions are bounded as x — 0. 

All solutions approach zero as x — oo. 

All solutions are bounded as x — oo. 

All solutions are bounded both as x — 0 and as x — oo. 


propp 


28. Using the method of reduction of order, show that if rı is a 
repeated root of 

r(ir-l)+ar+6=0, 
then x"! and x"! In x are solutions of x?y" Fa xy'-- dy = 0forx > 0. 
29. Verify that W[x^ cos( u In x), x^ sin( u Inx)] = ux? -. 


In each of Problems 30 and 31, show that the point x = O isa 
regular singular point. In each problem try to find solutions of the 
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oo 
form 5 anx”. Show that (except for constant multiples) there is only 
n=0 
one nonzero solution of this form in Problem 30 and that there are 
no nonzero solutions of this form in Problem 31. Thus in neither case 
can the general solution be found in this manner. This is typical of 


equations with singular points. 

30. 2xy’+3y’+xy =0 

31. 2x?y" 4 3xy’—-(14+-x)y =0 

32. Singularities at Infinity. The definitions of an ordinary point 
and a regular singular point given in the preceding sections apply 
only if the point xp is finite. In more advanced work in differential 
equations, it is often necessary to consider the point at infinity. This 


is done by making the change of variable £ — 1/x and studying the 
resulting equation at € = 0. Show that, for the differential equation 


P(x)y" + Q(x) y' + R(x)y =0, 


the point at infinity is an ordinary point if 


l (oe e RO/£) 
and 
P(1/£) t e DIOS 

have Taylor series expansions about £ — 0. Show also that the point at 


infinity is a regular singular point if at least one of the above functions 
does not have a Taylor series expansion, but both 


$ 2P(1/€)  Q(1/6) "T 
P(1/&) $ E 


do have such expansions. 


RU/§) 
EDS 


In each of Problems 33 through 37, use the results of Problem 32 to 
determine whether the point at infinity is an ordinary point, a regular 
singular point, or an irregular singular point of the given differential 
equation. 


33. y" y-0 
34. x?y" p xy C 4y 20 

35. (1—x2)y" — 2xy' c a(a +1)y=0 
36. y”’—2xy’+ Ay =0 
37. y’-xy=0 


(Legendre equation) 
(Hermite equation) 


(Airy equation) 


55 Series Solutions Near a Regular 


Singular Point, Part | 


We now consider the question of solving the general second-order linear differential equation 


P(x)y" + O(x)y’ + R(x)y =0 


(1) 


in the neighborhood of a regular singular point x = x. For convenience we assume that 
Xo = 0. If xo 4 0, the equation can be transformed into one for which the regular singular 


point is at the origin by letting x — x9 equal f. 
The assumption that x = 


O is a regular singular point of equation (1) means that 


xQ(x)/ P(x) = xp(x) and x? R(x)/ P(x) = x?q(x) have finite limits as x — 0 and are 
analytic at x = 0. Thus they have convergent power series expansions of the form 


oo 
xp(x) = x Pax”, 


n=0 


(Vulg = Y ge Q) 
n=0 
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on some interval |x| « p about the origin, where p > 0. To make the quantities xp(x) and 
x?q(x) appear in equation (1), it is convenient to divide equation (1) by P(x) and then to 
multiply by x?, obtaining 


x?y" + x(xp(x)) y' + (x4(2)) y =0, (3) 
or 


x?y" + x( po + pix c pax" y 
+ (qo dix tes qax" o )y =0. e 


Notice that the first terms of xp( x) and of x^g(x) are 


X X 
= lim —— d = lim ————. 5 
Pos mp) 2M 40 = OP) d 


If all other coefficients p, and qn for n > 1 in equation (2) are zero, then equation (4) reduces 
to the Euler equation 


x?y" + poxy’ + qoy = 0, (6) 


which was discussed in the preceding section. 

In general, of course, some of the coefficients p, and qn, n > 1, are not zero. However, 
the essential character of solutions of equation (4) in the neighborhood of the singular point is 
identical to that of solutions of the Euler equation (6). The presence of the terms pix 4- --- + 
Pox" +--+ and qix +-+- qux" e merely complicates the calculations. 

We restrict our discussion primarily to the interval x > 0. The interval x < 0 can be 
treated, just as for the Euler equation, by making the change of variable x = —€ and then 
solving the resulting equation for € > 0. 

The coefficients in equation (4) can be viewed as “Euler coefficients" times power series. 
To see this, you can write the coefficient of y' in equation (4) as 

por (14 Pace Pate sce Prata, 
po po Po 
and similarly for the coefficient of y. Thus it may seem natural to seek solutions of equation (4) 
in the form of “Euler solutions" times power series. Hence we assume that 
oo oo 
y= x" (ao Fajk anx” sees ) =x" 5 anx” = 5 aa, (7) 
n=0 n=0 


where ao + 0. In other words, r is the exponent of the first nonzero term in the series, and ag 
is its coefficient. As part of the solution, we have to determine: 


1. The values of r for which equation (1) has a solution of the form (7) 


2. The recurrence relation for the coefficients a, 
oo 

3. The radius of convergence of the series > a, x" 
n=0 


The general theory was constructed by Frobenius! and is fairly complicated. Rather than 
trying to present this theory, we simply assume, in this and the next two sections, that there 
does exist a solution of the stated form. In particular, we assume that any power series in an 
expression for a solution has a nonzero radius of convergence and concentrate on showing 
how to determine the coefficients in such a series. To illustrate the method of Frobenius, we 
first consider an example. 


Ferdinand Georg Frobenius (1849-1917) grew up in the suburbs of Berlin, received his doctorate in 1870 from the 
University of Berlin, and returned as professor in 1892. For most of the intervening years he was professor at the 
Eidgenóssische Polytechnikum at Zürich. He showed how to construct series solutions about regular singular points 
in 1874. His most distinguished work, however, was in algebra, where he was one of the foremost early developers 
of group theory. 


5.5 Series Solutions Near a Regular Singular Point, Part | 


EXAMPLE 1 


Solve the differential equation 


2x2y" — xy! + (14+ x)y =0. (8) 


Solution: 


It is easy to show that x = 0 is a regular singular point of equation (8). Further, xp(x) = —1/2 and 
x?q(x) = (1 + x) /2. Thus po = —1/2, qo = 1/2, qı = 1/2, and all other p,'s and q,'s are zero. 
Then, from equation (6), the Euler equation corresponding to equation (8) is 


n 


2x?y" — xy! +y =0. (9) 


To solve equation (8), we assume that there is a solution of the form (7). Then y’ and y" are 
given by 


oo 
y= Soar t n)x t”! (10) 
n=0 
and 
oo 
y= So alr +n +n- Dann. (11) 


n=0 


By substituting the expressions for y, y’, and y” in equation (8), we obtain 


oo 
2x?y" — xy! + (1 +x)y = >D 2ay(r +n)(r +n — Dx?" 
n=0 


oo oo oo 
— 5 ay(r +n)x" t" + 5 ana 5 anx hth, (12) 
n=0 


n=0 n=0 


oo 
The last term in equation (12) can be written as > a, 1x t" 


n=1 


, so by combining the terms in 
equation (12), we obtain 


2x?y" — xy! + (1 +x)y = ag2r(r — 1) — r + 1x’ 


4 (Qe +n)(r +n—1) —(r n) + Da, + a, 4)x"*" =0. (13) 
n=l 


If equation (13) is to be satisfied for all x, the coefficient of each power of x in equation (13) must 
be zero. From the coefficient of x” we obtain, since ay Æ 0, 


2r(r 1) —r +1 =2r? 23r c1 2 (r - D) (2r - 1) 20. (14) 


Equation (14) is called the indicial equation for equation (8). Note that it is exactly the polynomial 
equation we would obtain for the Euler equation (9) associated with equation (8). The roots of the 
indicial equation are 
1 
n=l, n—-»y (15) 
These values of r are called the exponents at the singularity for the regular singular point x = 0. 
They determine the qualitative behavior of the solution (7) in the neighborhood of the singular point. 


Now we return to equation (13) and set the coefficient of x"*" equal to zero. This gives the 
relation 
(Qr Fn)rc-n-l)-—(r-n-clbDa +a =0, nzl, (16) 
or 
ü,—| 
an = 


Wr+n)2—3(r+n)4+1 


an-ı 
7 ((r +n) — 1)(2(r +n) — 1)’ nzil. (17) 
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For each root rı and rz of the indicial equation, we use the recurrence relation (17) to determine a 


set of coefficients a4, a5, ... . For r = rı = 1, equation (17) becomes 
Gn-1 
a, = ——————, nol. 
(2n 4- 1)n 
Thus 
ao 
p — ae 
3.1 
ay ao 
d} = 


5-2 (3-5)(1-2)° 


and 
a» ao 


ü3 = = E 
7.3 (3:5. 7)(1.2.3) 


In general, we have 
( —1)” 
an = ao 
(3-5-7---(2n+1))n! 
If we multiply both the numerator and denominator of the right-hand side of equation (18) by 
2-4-6- 2n = 2"n!, we can rewrite dy, as 
( ee 1)" 2n 
—Á— > 
eee A 


, nz4. (18) 


Hence, if we omit the constant multiplier ag, one solution of equation (8) is 


eo (—1)"2" 
y(x) =x 203 od , x0. (19) 
n-l 


To determine the radius of convergence of the series in equation (19), we use the ratio test: 

a, 41x"*! 2|x| 

— | = lim a 
noo (2n + 2)(2n + 3) 


for all x. Thus the series converges for all x. 


lim 


noo 


anx" 


1 
Corresponding to the second root r = r} = y we proceed similarly. From equation (17) we 


have 
an = Shed = ied , n>. 
2n (n — 3) n(2n — 1) 
Hence 
ao 
a, = E 
ai ao 
MO Pee Eee Ee) 
ay ao 
a= = 


3e» (1-2-3)(1-3-5)’ 


and, in general, 


- (—1)" 
m= ise S n>4. (20) 


Just as in the case of the first root rı, we multiply the numerator and denominator by 
2-4-6. 2n = 2"n!. Then we have 
(—1)"2" 


a Car Y n> 1. 


Again omitting the constant multiplier ao, we obtain the second solution 


oo 

(—1)"2" » 

yo(x) =x"? 1-27 “Dat p s99 (21) 
n=1 


5.5 Series Solutions Near a Regular Singular Point, Part! 223 


As before, we can show that the series in equation (21) converges for all x. Since y; and y? behave like 
x and x!/?, respectively, near x = 0, they are linearly independent and so they form a fundamental 
set of solutions. Hence the general solution of equation (8) is 


y= cyx) +c2y2(x), x > 0. 


The preceding example illustrates that if x = 0 is a regular singular point, then sometimes 
there are two solutions of the form (7) in the neighborhood of this point. Similarly, if there is 
a regular singular point at x = xo, then there may be two solutions of the form 


y = (x — xo)” V an(x — xo)" Q2) 


n=0 


that are valid near x = xo. However, just as an Euler equation may not have two solutions of 
the form y = x”, so a more general equation with a regular singular point may not have two 
solutions of the form (7) or (22). In particular, we show in the next section that if the roots 
rı and r of the indicial equation are equal or differ by an integer, then the second solution 
normally has a more complicated structure. In all cases, though, it is possible to find at least 
one solution of the form (7) or (22); if rı and rz differ by an integer, this solution corresponds 
to the larger value of r. If there is only one such solution, then the second solution involves a 
logarithmic term, just as for the Euler equation when the roots of the characteristic equation are 
equal. The method of reduction of order or some other procedure can be invoked to determine 
the second solution in such cases. This is discussed in Sections 5.6 and 5.7. 

If the roots of the indicial equation are complex, then they cannot be equal or differ by an 
integer, so there are always two solutions of the form (7) or (22). Of course, these solutions 
are complex-valued functions of x. However, as for the Euler equation, it is possible to obtain 
real-valued solutions by taking the real and imaginary parts of the complex solutions. 

Finally, we mention a practical point. If P, Q, and R are polynomials, it is often much 
better to work directly with equation (1) than with equation (3). This avoids the necessity 
of expressing x Q(x) / P(x) and x? R(x) / P(x) as power series. For example, it is more 
convenient to consider the equation 


x(14- x)y" -2y' - xy 20 


than to write it in the form 


2, / x 
0, 
xy + EF yr TE y 
. ; "E E* 
which would entail expanding ——— and in power series. 
lx 1+ 


Problems 


In each of Problems 1 through 6: 
a. Show that the given differential equation has a regular 
singular point at x = 0. 
b. Determine the indicial equation, the recurrence relation, and 
the roots of the indicial equation. 
C. Find the series solution (x > 0) corresponding to the larger 
root. 
d. If the roots are unequal and do not differ by an integer, find 
the series solution corresponding to the smaller root also. 


2xy" + y'- xy 20 
x2y" 4 xy! + (2- ps 


xy" p y-0 

xy" Ly —y=0 

xy" E xy! (x -2)y 20 
xy” +(1—x)y’-y=0 


Dep P H 
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7. The Legendre equation of order a is 


(1 — x2) y" —2xy’+a(a+ly=0. 


The solution of this equation near the ordinary point x — 0 was 
discussed in Problems 17 and 18 of Section 5.3. In Example 4 of 
Section 5.4, it was shown that x = +1 are regular singular points. 
a. Determine the indicial equation and its roots for the point 
=r; 
b. Find a series solution in powers of x — 1 for x — 1 > 0. 
Hint: Write 1 + x = 2 + (x — 1) and x = 1 + (x — 1). 
Alternatively, make the change of variable x — 1 = ¢ and 
determine a series solution in powers of f. 


8. The Chebyshev equation is 


(1—x 2) y^ —xy' ta 2y = = 0, 

where o is a constant; see Problem 8 of Section 5.3. 
a. Show that x = 1 and x = —1 are regular singular points, and 
find the exponents at each of these singularities. 
b. Find two solutions about x = 1. 


9. The Laguerre?? differential equation is 


xy" +(1—x)y’+Ay =0. 


a. Show that x = 0 is a regular singular point. 

b. Determine the indicial equation, its roots, and the recurrence 
relation. 

c. Find one solution (for x > 0). Show that if A =m, a 
positive integer, this solution reduces to a polynomial. When 
properly normalized, this polynomial is known as the Laguerre 
polynomial, L,,(x). 


10. The Bessel equation of order zero is 


x?y" + xy! +x7y 2 0. 


Edmond Nicolas Laguerre (1834-1886), a French geometer and analyst, 
studied the polynomials named for him about 1879. He is also known for an 
algorithm for calculating roots of polynomial equations. 


a. Show that x = 0 is a regular singular point. 
b. Show that the roots of the indicial equation are r4 = r = 0. 
C. Show that one solution for x > 0 is 


(-1)"x 2n 


Jo(x) = 1 D 221(n)2 ' 


The function Jo is known as the Bessel function of the first kind 
of order zero. 
d. Show that the series for Jo(x) converges for all x. 


11. Referring to Problem 10, use the method of reduction of order 
to show that the second solution of the Bessel equation of order zero 
contains a logarithmic term. 

Hint: If yo(x) = Jo(x) v(x), then 


(x) = Jo( f i 
Y2 X) = JolX = 
x(Jo(x))” 


Find the first term in the series expansion of = 
x (Jo(x) ) 


12. The Bessel equation of order one is 
x2y" + xy! +(x? 2 ly =0. 


a. Show that x = 0 is a regular singular point. 

b. Show that the roots of the indicial equation are rı 
N= 1. 

C. Show that one solution for x > 0 is 


= | and 


= Dx 2n 


< ( 
Ji(x) = 22 G ena" 


The function J; is known as the Bessel function of the first kind 
of order one. 

d. Show that the series for Jı (x) converges for all x. 

e. Show that it is impossible to determine a second solution of 
the form 


oo 
x! J bax", x90 


n=0 
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Now let us consider the general problem of determining a solution of the equation 


Lp] = x?" + x(xp(x))y' + (i409) = 0, D 
where 
xp(x) 2 3 pax", xq) = Y qx", 2) 
n=0 n=0 


and both series converge in an interval |x| < p for some p > 0. The point x = 0 is a regular 
singular point, and the corresponding Euler equation is 


x?y" + poxy' + qoy = 0. (3) 


5.6 Series Solutions Near a Regular Singular Point, Part II 


We seek a solution of equation (1) for x > 0 and assume that it has the form 


oo oo 


y =¢(r,x) =x" ax =) aa (4) 


n=0 n=0 


where ao + 0, and we have written y = ¢ (r, x) to emphasize that ¢ depends on r as well as 
x. It follows that 


oo 


oo 
y= Sort maartm, y"= XO Otn tn = hga m (5) 
n=0 n=0 


Then, substituting from equations (2), (4), and (5) in equation (1) gives 
Lid lr. x) = aọr(r — 1)x" patr + Drx' t Ear mir +n = 1x H 
+ (Po pix +--+ pax” +++) (ars ar Dx sse ad(r 4 n)x"" H ) 


+ (qo + aix qux" e ) (aox day sees aya t pe ) 
— 0. 
Multiplying the infinite series together and then collecting terms, we obtain 
LII, x) = agF(r)x" + [a Fr +1) + alpi + 40]x^** 
+ [Fr +2) + ag( por +q) + (pir +1) +41) Ja"? 
vocc [anF(r +n) + aol Par + qi) + ai (puC + D + quai) 
ecran +n D qi) 0 
or, in a more compact form, 


L[ó] = aoF(r)x* 


oo n—l 
ML FO masc aC Dp qi) x7" = 0, (6) 
n-l k=0 
where 
F(r) =r(r — 1) + por + qo. (7) 


For equation (6) to be satisfied for all x > 0, the coefficient of each power of x must be zero. 

Since dy + 0, the term involving x" yields the equation F(r) = 0. This equation is 
called the indicial equation; note that it is exactly the equation we would obtain in looking for 
solutions y — x" of the Euler equation (3). Let us denote the roots of the indicial equation by 
rı andr with r; > r» if the roots are real. If the roots are complex, the designation of the roots 
is immaterial. Only for these values of r can we expect to find solutions of equation (1) of the 
form (4). The roots rı and rz are called the exponents at the singularity; they determine the 
qualitative nature of the solution in the neighborhood of the singular point. 

Setting the coefficient of x" *" in equation (6) equal to zero gives the recurrence relation 


n—1 


F(r  n)a, +X a(r +k) prr + dax) =0, nz 1. (8) 
k=0 


Equation (8) shows that, in general, a, depends on the value of r and all the 


preceding coefficients ao, 41, ... ,@,_1. It also shows that we can successively compute 
di, 02, ... , dg, ... in terms of dg and the coefficients in the series for xp(x) and x?q(x), 
provided that F(r +1), F(r +2), ..., F(r +n), ... are not zero. The only values of r for 


which F (r) = 0 are r = r; andr = r3; since r; > m, it follows that rı + n is not equal to rı 
or r forn > 1. Consequently, F(r; +n) 4 0 forn > 1. Hence we can always determine one 
solution of equation (1) in the form (4), namely, 


oo 
»G =x"(14+ $0 a(r)x"|, x> o. (9) 
n-l 
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Here we have introduced the notation a,(ri) to indicate that a, has been determined from 
equation (8) with r = rı. The solution involves an arbitrary constant; the solution in equation 
(9) is obtained by assigning aj the value 1. 

If rz is not equal to rı, and rı — rz is not a positive integer, then r2 + n is not equal to rı 
for any value of n > 1; hence F(r2 +n) z 0, and we can also obtain a second solution 


yo(x) =x” | 1+ 5 anl(r)x” |, x>0. (10) 


n=1 


Just as for the series solutions about ordinary points discussed in Section 5.3, the series in 
equations (9) and (10) converge at least in the interval |x| « p where the series for both xp(x) 
[n9] 


and x?q(x) converge. Within their radii of convergence, the power series 1 + $5 a,(ri) x" 
oo n=1 

and 1 + $7 a,(r2)x" define functions that are analytic at x = 0. Thus the singular behavior, 
n=1 

if there is any, of the solutions y; and y» is due to the factors x" and x”? that multiply these 

two analytic functions. 

Next, to obtain real-valued solutions for x < 0, we can make the substitution x = —& 
with € > 0. As we might expect from our discussion of the Euler equation, it turns out that we 
need only replace x” in equation (9) and x’? in equation (10) by |x|"! and |x|”, respectively. 

Finally, note that if r; and ry are complex numbers, then they are necessarily complex 
conjugates and r2 Z rı + N for any positive integer N. Thus, in this case we can always find 
two series solutions of the form (4); however, they are complex-valued functions of x. Real- 
valued solutions can be obtained by taking the real and imaginary parts of the complex-valued 
solutions. 

The exceptional cases in which r; = r2 or rı — r2 = N, where N is a positive integer, 
require more discussion and will be considered later in this section. 

It is important to realize that rı and rz, the exponents at the singular point, are easy to find 
and that they determine the qualitative behavior of the solutions. To calculate rı and rg, it is 
only necessary to solve the quadratic indicial equation 


r(r — 1) + por 4- qo = 0, (11) 
whose coefficients are given by 
po = lim xp(x), go = lim x^g(x). (12) 
x0 x0 


Note that these are exactly the limits that must be evaluated in order to classify the singularity 
as a regular singular point; thus they have usually been determined at an earlier stage of the 
investigation. 

Further, if x = 0 is a regular singular point of the equation 


P(x)y" + QG)y' + R(x)y = 0, (13) 
where the functions P, Q, and R are polynomials, then xp(x) = x Q(x) / P(x) and x?q(x) = 
x?R(x) / P(x). Thus 
Q(x) ,R(x) 


Po = lim x , go=limx : 
x0 P(x) x0 P(x) 


Finally, the radii of convergence for the series in equations (9) and (10) are at least equal to the 
distance from the origin to the nearest zero of P other than the regular singular point x = 0 
itself. 


(14) 


EXAMPLE 1 


Discuss the nature of the solutions of the equation 


2x(1+x)y"+(34+x)y - xy 20 


near the singular points. 
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Solution: 
This equation is of the form (13) with P(x) = 2x(1+ x), Q(x) = 3 + x, and R(x) = —x. The 
points x = 0 and x = —1 are the only singular points. The point x = 0 is a regular singular point, 
since 
Q(x) : 34x 3 
lim x = lim x —————— = -,, 
yoo P(X) 449 2x(1 +x) 2 
R = 
lim x? (0) mail—— =0 
zo PO) so ED 


3 3 
Further, from equation (14), py = 2 and qo = 0. Thus the indicial equation is r(r — 1) + ge 0, 


and the roots are rj = 0, r; = — Y Since these roots are not equal and do not differ by an integer, 


there are two solutions of the form 


oo oo 
x6) = 1+ Y Ta (005^ and ya) = 7? {1+ Y a (=3) 2" 


n=1 n=1 


for < |x| < p. A lower bound for the radius of convergence of each series is 1, the distance from 


x = 0 tox = —1, the other zero of P(x). Note that the solution y, is bounded as x — 0, indeed is 
analytic there, and that the second solution y» is unbounded as x — 0. 
The point x = —1 is also a regular singular point, since 
1)(3 
imtir Bee ig Og. 
x—-1 P(x) x—-1 2x(1 + x) 


R(x) _ Œ +D?) " 


lim (x + 1)? — lim 
rol P(x) oa A +x) 
In this case pọ = —1, go = 0, so the indicial equation is r(r — 1) — r = 0. The roots of the indicial 


equation are rj = 2 and r2 = 0. Corresponding to the larger root there is a solution of the form 


»G) 2G D^| 14S) a (2G + 1" 


n-l 


The series converges at least for |x + 1| < 1, and y; is an analytic function there. Since the two roots 
differ by a positive integer, there may or may not be a second solution of the form 


y) =1+ M ^a (0)(x +1)". 


n-l 


We cannot say more without further analysis. 

Observe that no complicated calculations were required to discover the information about the 
solutions presented in this example. All that was needed was to evaluate a few limits and solve two 
quadratic equations. 


We now consider the cases in which the roots of the indicial equation are equal or differ 
by a positive integer, rı — r2 = N. As we have shown earlier, there is always one solution of 
the form (9) corresponding to the larger root rı of the indicial equation. By analogy with the 
Euler equation, we might expect that if rj = r2, then the second solution contains a logarithmic 
term. This may also be true if the roots differ by an integer. 


Equal Roots. The method of finding the second solution is essentially the same as the one 
we used in finding the second solution of the Euler equation (see Section 5.4) when the roots 
of the indicial equation were equal. We consider r to be a continuous variable and determine 
an as a function of r by solving the recurrence relation (8). For this choice of a(r) for n > 1, 
the terms in equation (6) involving x^*!, x’+?, x"+3, ... all have coefficients equal to zero. 
Therefore, since r; is a repeated root of F(r), equation (6) reduces to 


L[ó](r, x) = agF(r)x' = a(r — r1)°x". (15) 
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Setting r = rı in equation (15), we find that L[@](71, x) = 0; hence, as we already know, 
y1(x) given by equation (9) is one solution of equation (1). But more important, it also follows 
from equation (15), just as for the Euler equation, that 


L P4 (rj,x) = ao xc — n») | 


r-— 


= a (( = r5 mx e 20 - r0") — 0. (16) 


r=r, 
Hence, a second solution of equation (1) is 
Oó(r, x) 0 


y(x) = ES uS =; x” ar aa 


r=r, r=r, 


oo oo 
= (x" Inx) | ao + X a(r)" +x” 9 al(ry)x" 


n=1 n=1 


oo 
= y(x) Inx + x^! Xar", x » 0, (17) 


n=1 


dan 


where aj (r1) denotes evaluated at r = ri. 


Although equation (17) provides an explicit expression for a second solution y2(x), it may 
turn out that it is difficult to determine a,(r) as a function of r from the recurrence relation 
(8) and then to differentiate the resulting expression with respect to r. An alternative is simply 
to assume that y has the form of equation (17). That is, assume that 

oo 
y = yi (x) In x + x"! 3 bax", x0, (18) 


n=1 


where y;(x) has already been found. The coefficients b, are calculated, as usual, by 
substituting into the differential equation, collecting terms, and setting the coefficient of each 
power of x equal to zero. A third possibility is to use the method of reduction of order to find 
y»(x) once yı(x) is known. 


Roots r, and r; Differing by an Integer N. For this case the derivation of the second 
solution is considerably more complicated and will not be given here. The form of this solution 
is stated in equation (24) in the following theorem. The coefficients c,(72) in equation (24) 
are given by 


d 
€n(r2) = zC —12)a,(r)] , n= 142; ESTE. (19) 


rr 


where a,,(r) is determined from the recurrence relation (8) with ao = 1. Further, the coefficient 
a in equation (24) is 

a= lim(r — r;3)ay(r). (20) 
If ay(72) is finite, then a = 0 and there is no logarithmic term in y2. A full derivation of 
formulas (19) and (20) may be found in Coddington (Chapter 4). 

In practice, the best way to determine whether a is zero in the second solution is simply to 
try to compute the a, corresponding to the root rz and to see whether it is possible to determine 
ay (12). If so, there is no further problem. If not, we must use the form (24) with a + 0. 

When r; — r2 = N, there are again three ways to find a second solution. First, we can 
calculate a and c, (r2?) directly by substituting the expression (24) for y in equation (1). Second, 
we can calculate c,(r5) and a of equation (24) using the formulas (19) and (20). If this is the 
planned procedure, then in calculating the solution corresponding to r = rı, be sure to obtain 
the general formula for a, (r) rather than just a; (r;). The third alternative is to use the method 
of reduction of order. 

The following theorem summarizes the results that we have obtained in this section. 
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Theorem 5.6.1 


Consider the differential equation (1) 
x*y" + x(xpQxo) y + (x7q(x))y = 0, 


where x = 0 is a regular singular point. Then xp(x) and x?q(x) are analytic at x = 0 with convergent 
power series expansions 


oo oo 
xp(x) = So pax, xt qi) = M oai 
n=0 n=0 
for |x| < p, where p > 0 is the minimum of the radii of convergence of the power series for xp(x) 
and x?q(x). Let rı and r; be the roots of the indicial equation 
F(r) — r(r — 1) + por 4 qo = 0, 


with rı > r ifr; and r3 are real. Then in either the interval —p < x < Oorthe interval 0 < x < p, 
there exists a solution of the form 


yx) = i^ | 1+ 3 a0" |. QU 


n=1 


where the a„(r1) are given by the recurrence relation (8) with ay = 1 andr = r4. 


CASE 1 If rj — r is not zero or a positive integer, then in either the interval —p < x < O or the 
interval 0 < x < p, there exists a second solution of the form 


yo(x) = I? | 1$ Laos" J. Q2) 


n-l 


The a, (r2) are also determined by the recurrence relation (8) with ag = 1 and r = r2. The 
power series in equations (21) and (22) converge at least for |x| < p. 


CASE2 Ifr, = r, then the second solution is 


oo 
yo(x) = y) In fal + xl! S T bur). (23) 
n=1 
CASE3 Ifr; — r = N, a positive integer, then 
oo 


yo(x) = ay Q0 In xl + Il? {1+ Y e 02a" |. Q4) 


n=1 


The coefficients a,(ri), b,(ri), and c,(r;) and the constant a can be determined by 
substituting the form of the series solutions for y in equation (1). The constant a may turn 
out to be zero, in which case there is no logarithmic term in the solution (24). Each of the 
series in equations (23) and (24) converges at least for |x| < p and defines a function that is 
analytic in some neighborhood of x — 0. 

In all three cases, the two solutions y;(x) and y2(x) form a fundamental set of solutions 
of the given differential equation. 


Problems 


In each of Problems 1 through 8: 6. x1—x)y" -(14-x)y' -2xy 20 
a. Find all the regular singular points of the given differential 7. (x —2)?*(x -2)y" + 2xy’ -3(x -2)y 20 
equation. 8. (4—x?)y" + 2xy' -3y 20 
b. Determine the indicial equation and the exponents at the Ip each of Problems 9 through 12: 
singularity for each regular singular point. a. Show that x = 0 is a regular singular point of the given 
l. xy" + 2xy'+6e*y =0 differential equation. 
2. xy" -x(24 x)y'- (24 x2))y 20 b. Find the exponents at the singular point x = 0. 
3. y"”+4xy’ +6y 20 c. Find the first three nonzero terms in each of two solutions 
4. 2x(x -2)y'- y -xy —0 (not multiples of each other) about x = 0. 
E xy" + 3G sin)y +y =0 So Ey opes 


10. xy’ +2xy’+6e*y =0 (see Problem 1) 
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11. xy”+y=0 
12. x?y"”+(sinx)y’ —(cosx)y = 0 
13. a. Show that 


1 
(In x) y" + DAE =0 


has a regular singular point at x = 1. 
b. Determine the roots of the indicial equation at x = 1. 


c. Determine the first three nonzero terms in the series 
oo 


SS an(x — 1)" corresponding to the larger root. 

n=0 

You can assume x — 1 > 0. 

d. What would you expect the radius of convergence of the 
series to be? 


14. In several problems in mathematical physics, it is necessary to 
study the differential equation 


x(L—x)y"+(y-(1+a+)x)y’-aBy=0, Q5 


where a, (3, and y are constants. This equation is known as the 
hypergeometric equation. 
a. Show that x — O is a regular singular point and that the roots 
of the indicial equation are 0 and 1 — y. 
b. Show that x = 1 is a regular singular point and that the roots 
of the indicial equation are 0 and y — a — f. 
C. Assuming that 1 — y is not a positive integer, show that, in 
the neighborhood of x — 0, one solution of equation (25) is 


ap ala +D T1) 5 


y(x) = 1 + ——x+ Xo fee, 

: 4-1! y(y +1)2! 
What would you expect the radius of convergence of this series 
to be? 


d. Assuming that 1 — y is not an integer or zero, show that a 
second solution for 0 < x < Lis 


(a-y+tD(8-y+)) 


yo(x) =x! ( + 


(2-91! 
(a =7 + D(a -y F208 3 *D(8-y*2 2., 
2-3)8-32! l 


e. Show that the point at infinity is a regular singular point and 
that the roots of the indicial equation are a and 5. See Problem 
32 of Section 5.4. 


15. Consider the differential equation 


LA 


xy"  oxy' + By =0, 
where a and ĝ are real constants and a Z 0. 

a. Show that x — O is an irregular singular point. 

oo 
b. By attempting to determine a solution of the form ` a,x" *", 
n=0 

show that the indicial equation for r is linear and that, 
consequently, there is only one formal solution of the assumed 
form. 
€. Show that if G/a = —1,0, 1,2, ... , then the formal series 
solution terminates and therefore is an actual solution. For other 
values of 3/a, show that the formal series solution has a zero 
radius of convergence and so does not represent an actual solution 
in any interval. 


16. Consider the differential equation 


H Q ri B 
ytoyt+sy=0, (26) 
x” X 


where a Æ 0 and 8 zz 0 are real numbers, and s and ¢ are positive 
integers that for the moment are arbitrary. 
a. Show that if s > lor: > 2, then the point x = O is an 
irregular singular point. 
b. Try to find a solution of equation (26) of the form 
oo 
p= mas mu (27) 
n=0 


Show that if s = 2 and t = 2, then there is only one possible 
value of r for which there is a formal solution of equation (26) of 
the form (27). 

€. Show that ifs = 1 and t = 3, then there are no solutions of 
equation (26) of the form (27). 

d. Show that the maximum values of s and t for which the 
indicial equation is quadratic in r [and hence we can hope to find 
two solutions of the form (27)] are s = 1 and t = 2. These are 
precisely the conditions that distinguish a “weak singularity,” or 
a regular singular point, from an irregular singular point, as we 
defined them in Section 5.4. 


As a note of caution, we point out that although it is sometimes 
possible to obtain a formal series solution of the form (27) at an 
irregular singular point, the series may not have a positive radius of 
convergence. See Problem 15 for an example. 


5.7 Bessel’s Equation 


In this section we illustrate the discussion in Section 5.6 by considering three special cases of 
equation, 


Bessel’s!* 


x*y' + xy +(x? —v’°)y =0, (1) 


V Friedrich Wilhelm Bessel (1784-1846) left school at the age of 14 to embark on a career in the import-export 
business but soon became interested in astronomy and mathematics. He was appointed director of the observatory 
at Kónigsberg in 1810 and held this position until his death. His study of planetary perturbations led him in 1824 to 
make the first systematic analysis of the solutions, known as Bessel functions, of equation (1). He is also famous for 
making, in 1838, the first accurate determination of the distance from the earth to a star. 


5.7 Bessel’s Equation 


where v is a constant. It is easy to show that x = 0 is a regular singular point of equation (1). 
We have 


1 Q0 
po = lim x = x-=1, 
x0 P(x) x0 X 
2 
qo — lim x? (x) = 1 xm a ep 
x>0 P(x) x—>0 7 


Thus the indicial equation is 


F(r) =r(r-l)+prt+q=err—-Dtr—-v=r—-v’?=0, 
1 
with the roots r = +v. We will consider the three cases v = 0, v = y and v = 1 for the 
interval x > 0. Bessel functions will reappear in Sections 11.4 and 11.5. 


Bessel Equation of Order Zero. In this case v = 0, so differential equation (1) reduces to 
L[y] = x?y" + xy’  x^y =0, (2) 


and the roots of the indicial equation are equal: rj = r2 = 0. Substituting 


oo 
y = ó(r,x) = aox" + M ax" (3) 
n=l 
in equation (2), we obtain 
oo oo 
Lio Mr, x) = So an((r t n)(r n — D 9 Gn) +S a 
n=0 n=0 


= a(r(r —1) tnx’ ta((rt+Drt+t(r4))x't! 


oo 
M (aHan DOn) cta)?" 20. A 
n=2 
As we have already noted, the roots of the indicial equation F(r) = r(r — 1) +r = 0 are 
rı = 0 and r; = 0. The recurrence relation is 
an—2( r) an—2(r) 


(rtn(rt+n—-D+(r+n) (r +n)?’ nz2 2 


as(r) = 


To determine y;(x), we set r equal to 0. Then, from equation (4), it follows that for the 
coefficient of x"+! to be zero we must choose a, = 0. Hence, from equation (5), a4 = a5 = 


a, =---=(0. Further, 
n—2(0 
MOPE HAO BER, 
n 
or, letting n = 2m, we obtain 
a2, —2(0) 
m(0) = —————, zl2 33.12 
a», (0) Qm? m 
Thus 
ao ao do 
a0 — —. a0) — x a0 — xU 
and, in general, 
( —1)" ag 
aos (0) = 22n(m Y)?" m= 123; T (6) 
Hence 
oo 
( — 1) t xm 
yi(x) = ao M 5 x > 0. (7) 
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The function in brackets is known as the Bessel function of the first kind of order zero and 
is denoted by Jo(x). It follows from Theorem 5.6.1 that the series converges for all x and that 
Jo is analytic at x = 0. Some of the important properties of Jy are discussed in the problems. 
Figure 5.7.1 shows the graphs of y = Jo(x) and some of the partial sums of the series (7). 


ya 
2— 


n=4 n-8 n-12 n=16 n-20 


Polynomial approximations to Jo(x), the Bessel 
function of the first kind of order zero. The value of n is the degree 
of the approximating polynomial. 


To determine y2(x) we will use equation (17) in Section 5.6. This requires that we 
calculate? a, (0). First we note from the coefficient of x^*! in differential equation (4) that 
(r + 12a, (r) 20. Thus a;(r) =0 for all r near r — 0. So not only does a,(0) =0 but also 
a (0) = 0. From the recurrence relation (5) it follows that 


a (0) =e) = erem 0) = +- =0; 
hence we need only compute à», (0), m —1,2,3, ... . From equation (5) we have 
ag -2(r) 
m =. AL Ex 
CI oup Tn 


By solving this recurrence relation, we obtain 


(r)2-—29 (r) = st 
OED T oo gar 
and, in general, 
=] m 
a(r) = = m > 3. (8) 


(r +2)2---(r + 2m)?’ 
The computation of a5, (r) can be carried out most conveniently by noting that if 


f(x) = (2 = af = a3) — as) (x — ey), 


and if x is not equal to à 1, &2, ... , Œn, then 
/ 
X n 
Fœ By B» — B 
f(x) X-Q, x-a? X—Qy 


Applying this result to a5,,(r) from equation (8), we find that 


au) L duci pode 
Gr) r+2 r4 r+2m )’ 
and setting r equal to 0, we obtain 


1 1 1 
a5, (0) = (; F 4 se Se 5; eso. 


Problem 9 outlines an alternative procedure, in which we simply substitute the form (23) of Section 5.6 in 
equation (2) and then determine the b,,. 


Substituting for a2,,(0) from equation (6), and letting 


1 1 1 
Hm =1 ^ p ES _> 9 
+ 2 F 3 deese eR (9) 
we obtain, finally, 
i ( a 1)"ao 
Ay,(0) = Hn may? m = 1,253 5 06s 
The second solution of the Bessel equation of order zero is found by setting ag = 1 and 


substituting for yı(x) and a5, (0) = b5,4,(0) in equation (23) of Section 5.6. We obtain 


oo 
CDH Hn on 
ya(x) = Jo(x) Inx + 5 Em a 


m=1 


, x>O0. (10) 


Instead of y2, the second solution is usually taken to be a certain linear combination of Jo 
and y2. It is known as the Bessel function of the second kind of order zero and is denoted 
by Yo. Following Copson (Chapter 12), we define!® 


2 
Yo(x) = —Dalx) + Cy = In 2) Jo(22]. (11) 
Here y is a constant known as the Euler—Mascheroni!’ constant; it is defined by the equation 
y = lim (A, — Inn) = 0.5772. (12) 
noo 


Substituting for y2(x) in equation (11), we obtain 


2 x oo (—D?*!H, s 
Yo(x) = = E + In 5 ) ste) px Q"(my ~ x X 0. (13) 


The general solution of the Bessel equation of order zero for x > 0 is 
y = ei Jo(x) + exYo(x). 


Note that Jo(x) — 1 as x — 0 and that Yo(x) has a logarithmic singularity at x = 0; 
that is, Yo(x) behaves as (2/7) lnx when x — 0 through positive values. Thus, if we are 
interested in solutions of Bessel's equation of order zero that are finite at the origin, which 
is often the case, we must discard Yo. The graphs of the functions Jọ and Yo are shown in 
Figure 5.7.2. 


The Bessel functions of order zero: 
y = Jo(x) (blue) and y = Yo(x) (red). 


lóOther authors use other definitions for Yo. The present choice for Yo is also known as the Weber function, after 
Heinrich Weber (1842-1913), who taught at several German universities. 

'TThe Euler- Máscheroni constant first appeared in 1734 in a paper by Euler. Lorenzo Máscheroni (1750-1800) was 
an Italian priest and professor at the University of Pavia. He correctly calculated the first 19 decimal places of y in 
1790. 


5.7 Bessel’s Equation 
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It is interesting to note from Figure 5.7.2 that for x large, both Jo(x) and Yo(x) are 
oscillatory. Such a behavior might be anticipated from the original equation; indeed it is 
true for the solutions of the Bessel equation of order v. If we divide equation (1) by x”, we 
obtain 


For x very large, it is reasonable to conjecture that the terms (1/x) y' and (v?/x7)y are 
small and hence can be neglected. If this is true, then the Bessel equation of order v can 
be approximated by 


y"+y=0. 


The solutions of this equation are sin x and cos x; thus we might anticipate that the solutions 
of Bessel’s equation for large x are similar to linear combinations of sin x and cos x. This is 
correct insofar as the Bessel functions are oscillatory; however, it is only partly correct. For 
x large the functions Jo and Yo also decay as x increases; thus the equation y" + y = 0 does 
not provide an adequate approximation to the Bessel equation for large x, and a more delicate 
analysis is required. In fact, it is possible to show that 


xd 1/2 T 
Jo(x) = | — cos| x — 4 as x — oo (14) 
TX 
«(Z) w(-1) 
Yo(x) = | — sin| x — — as x — oo. (15) 
TX 4 


These asymptotic approximations, as x — ov, are actually very good. For example, 
Figure 5.7.3 shows that the asymptotic approximation (14) to Jo(x) is reasonably accurate 
for all x > 1. Thus to approximate Jo(x) over the entire range from zero to infinity, you can 
use two or three terms of the series (7) for x < 1 and the asymptotic approximation (14) for 
xzl. 


and that 


AUAA Asymptotic approximation to Jo(x). 


Bessel Equation of Order One-Half. This case illustrates the situation in which the roots of 
the indicial equation differ by a positive integer but there is no logarithmic term in the second 
solution. Setting v = in equation (1) gives 


1 
L[y] = xy" + xy' + (2- i-o (16) 


When we substitute the series (3) for y = @(r, x), we obtain 


[o] 


L[ó]Cr. x) = »» +n)(r +n — 1) +(r+n) = Pje” D antt 


4 
n=0 n=0 


1 1 
= (e — zor + (o Bde rra 


oo 1 : 
e (eem iac) <0. (17) 


n=2 


2 


1 1 1 
The roots of the indicial equation r^ — 1 —O0arer; = 5 and r; = — y hence the roots differ 


by an integer. The recurrence relation is 


1 
(o +n)? — Bp = —a, 5» n2. (18) 


1 
Corresponding to the larger root r; = =, we find, from the coefficient of x^*! in equation (17), 


that a; = 0. Hence, from equation (18), a4 = a5 = +--+ = a5,,4 = ` +- = 0. Further, for r = 5 
ee: e Ae a, 
n(n + 1) 


or, letting n = 2m, we obtain 


A2m—2 


m= =s Par a eet 243 
e 2mm D' ” 
By solving this recurrence relation, we find that 
00 _ a 
a» = E 4 — LT 
and, in general, 
—1)” 
dali a ET E E us 
(2m + 1)! 
Hence, taking aọ = 1, we obtain 
oo oo 
—1)"x2" (—1)nyx2m*l 
2x21 CU age E > 0. 19 
odes toan B 2, Qmapr' * VN 


The second power series in equation (19) is precisely the Taylor series for sin x; hence one 
solution of the Bessel equation of order one-half is x !/? sin x. The Bessel function of the 
first kind of order one-half, J; /», is defined as (2/7) /?y;. Thus 


2 \ 1/2 
Jip(x) = (=) sinx, x >0. (20) 
TX 
Corresponding to theroot r; = — y itis possible that we may have difficulty in computing 
1 
a, since N = rı — r; = 1. However, from equation (17) for r = at the coefficients of x" 


and x'*! are both zero regardless of the choice of ay and a,. Hence ay and a, can be chosen 
arbitrarily. From the recurrence relation (18), we obtain a set of even-numbered coefficients 
corresponding to ao and a set of odd-numbered coefficients corresponding to a,. Thus no 


logarithmic term is needed to obtain a second solution in this case. It is left as an exercise to 
show that, for r = — 2 
(—1)"ao (-D'a 


> n+1 = mx LI12,:: 
Cnr e mni. ” 


an = 
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Hence 


(—1)"x 2n 1)"x 2n+1 


y(x) = x7! “(ar or (an)! +a Oe Qn 4 DI 


COS X sinx 
= a—, +a,—;, x>0. (21) 
X 


1/2 11/2’ 
The constant a, simply introduces a multiple of y;(x). The second solution of the Bessel 
equation of order one-half is usually taken to be the solution for which ay = (2/7) 1/2 and 
a, = 0. It is denoted by J_1/2. Then 


2 \ 1/2 
J-ija(x) = (=) cosx, x > 0. (22) 


The general solution of equation (16) is y = c1J1/5(x) + c2J-1/5(x). 

By comparing equations (20) and (22) with equations (14) and (15), we see that, except 
for a phase shift of 7/4, the functions J_, /2 and J;;; resemble Jo and Yo, respectively, for 
large x. The graphs of Jj/2 and J—1/2 are shown in Figure 5.7.4. 


idieti:2c The Bessel functions of order one-half: 
y = Jyj5(x) (blue) and y = J—1/2(x) (red). 


Bessel Equation of Order One. This case illustrates the situation in which the roots of the 
indicial equation differ by a positive integer and the second solution involves a logarithmic 
term. Setting v — 1 in equation (1) gives 


L[y] 2 x2y" + xy' + (x? — y 2 0. (23) 
If we substitute the series (3) for y = ¢ (r, x) and collect terms as in the preceding cases, we 
obtain 
LIAK x) = ako? = Dax" a (Gr D? = 1)! 
oo 
+ (om - 1)a, + apa)” =0. (24) 
n=2 
The roots of the indicial equation r? —] —0arer, = 1 andr) = —1. The recurrence relation 
is 
(+m)? -1)a( = -anar 0 22. 25) 


Corresponding to the larger root r = 1, the recurrence relation becomes 


An-—2 
ay, = ————, n=2,3,4,.... 
(n+2)n 


We also find, from the coefficient of x’+! in equation (24), that a, = 0; hence, from the 
recurrence relation, a4 = a5 = --- = 0. For even values of n, we can write n = 2m, where m 


is a positive integer; then 


2g —2 a2m—2 
Am = = n m i 2, 3; 
(2m + 2) (2m) 22(m + 1)m 
By solving this recurrence relation, we obtain 
—~1)"a 
am = a = 1:2; 3) 282 « (26) 
22^ (m + 1)!m! 


The Bessel function of the first kind of order one, denoted by J), is obtained by choosing 
dy = 1/2. Hence 


x oo (=1)" 4" 
J = 3 27 

(2) 2 2s nm + Dm on 
The series converges absolutely for all x, so the function J; is analytic everywhere. 

In determining a second solution of Bessel’s equation of order one, we illustrate the 
method of direct substitution. The calculation of the general term in equation (28) below 
is rather complicated, but the first few coefficients can be found fairly easily. According to 
Theorem 5.6.1, we assume that 


oo 
y(x) =aJı(x) lnx 4 x^! 1 cna" , x90. (28) 


n=1 
Computing y5(x) and yj(x), substituting in equation (23), and making use of the fact that J; 
is a solution of equation (23), we obtain 
oo oo 
2ax J{(x) F 5 ((n E 1)(n zx 2)Cn + (n A 1)cn a Ge F 3 gam = 0, (29) 
n=0 n=0 
where co = 1. Substituting for J;( x) from equation (27), shifting the indices of summation in 


the two series, and carrying out several steps of algebra, we arrive at 


oo 
—c + (0: c2 + co)x + >> (o? - Deng + e, on 
n=2 


oo 
—pn(2 1 2m+1 
- 245 5 )"(2m + Dx 


= 30 

EE 22" (m 4- 1)! m! e» 

From equation (30) we observe first that c; = 0, and a = —co = —1. Further, since there are 
only odd powers of x on the right, the coefficient of each even power of x on the left must be 
zero. Thus, since c; = 0, we have c3 = c5 = --- = 0. Corresponding to the odd powers of x, 


writing n = 2m +1 on the left-hand side of equation (30), we obtain the following recurrence 
relation: 


(-D"(2m + 1) 


Matin “see MD 


((2m +1)? — Leama + en = 


When we set m = | in equation (31), we obtain 


2 _ C903 
(3 1)c4 + c2 = 22.2V 
Notice that c; can be selected arbitrarily, and then this equation determines c4. Also notice 
that in the equation for the coefficient of x, c; appeared multiplied by 0, and that equation was 
used to determine a. That c3 is arbitrary is not surprising, since c is the coefficient of x in the 


oo 

expression x ! | 1 + 5 c,x" |. Consequently, c; simply generates a multiple of Jj, and y; 
n=1 

is determined only up to an additive multiple of Jı. In accordance with the usual practice, we 
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choose c; = 1/2”. Then we obtain 


E a er ee ee 
“4 = 3-242 = 243] 2 


EC 
-94.91 


(H + Hi). 


It is possible to show that the solution of the recurrence relation (31) is 


(-D"*(H, sr Hy) 
Cog — , m-L583 |. 
22nm (m — 1)! 


with the understanding that Hp = 0. Thus 


1 < (=1)"( Ain + Hai) 
yx) = —A(x)Inx - (1 5 Aae 


m=1 


x") ox > 0. (32) 


The calculation of y2(x) using the alternative procedure (see equations (19) and (20) 
of Section 5.6) in which we determine the c,(72) is slightly easier. In particular, the latter 
procedure yields the general formula for Cam without the necessity of solving a recurrence 
relation of the form (31) (see Problem 10). In this regard, you may also wish to compare 
the calculations of the second solution of Bessel’s equation of order zero in the text and in 
Problem 9. 

The second solution of equation (23), the Bessel function of the second kind of order one, 
Y,, is usually taken to be a certain linear combination of J, and y2. Following Copson (Chapter 
12), Y; is defined as 


2 
Yı(x) = =(=») + (y —1n2) Ji(x)). (33) 
where y is defined in equation (12). The general solution of equation (23) for x > 0 is 
y = c Ji(x) + eYi(x). 


Notice that although J, is analytic at x = 0, the second solution Y; becomes unbounded in the 
same manner as 1/x as x — 0. The graphs of J, and Y; are shown in Figure 5.7.5. 


JA: The Bessel functions of order one: 


y = J,(x) (blue) and y = Y;(x) (red). 


Problems 


In each of Problems 1 through 3, show that the given differential 
equation has a regular singular point at x = 0, and determine two 
solutions for x > 0. 


1. x?y"”4+2xy’+xy =0 

2. x?y"4+3xy'+(14+x)y =0 

3. x?y"”+xy'+2xy =0 

4. Find two solutions (not multiples of each other) of the Bessel 


3 
equation of order 2 


2 9 
xy" E xy + ( x? E y=0, x>0. 


5. Show that the Bessel equation of order one-half 
2." 1 
x y" + xy + dou y-0, x>0 


can be reduced to the equation 

vy" py 20 
-V?y(x). From this, 
-V?sinx are 


by the change of dependent variable y — x 
conclude that yj(x) = x-l/?cosx and y(x) =x 
solutions of the Bessel equation of order one-half. 


6. Show directly that the series for Jọ(x), equation (7), converges 

absolutely for all x. 
7. Show directly that the series for Jj (x), equation (27), converges 
absolutely for all x and that Jj(x) = —J,(x). 
8. Consider the Bessel equation of order v 
xy" +xy +(x? -—v?)y=0, x0, 


where v is real and positive. 
a. Show that x = 0 is a regular singular point and that the roots 
of the indicial equation are v and —v. 
b. Corresponding to the larger root v, show that one solution is 


2 4 
_ yy 1 ] i + : 
y(x) =x (i+v)\2 2Y(14-v)(2-v) \2 


x: (—1)" x 2m 
à zs EE oS UA 2 


C. If2v is not an integer, show that a second solution is 


— j 1 X 7 1 X T 
yo(x) =x W1—v)\2 * bi vv) 2 


oo (-1)” x 2m 
i 2 Gee Gat 2 


Note that y;(x) — 0 as x — 0, and that yo(x) is unbounded as 
x — 0. 
d. Verify by direct methods that the power series in the 
expressions for y;(x) and y2(x) converge absolutely for all x. 
Also verify that y» is a solution, provided only that v is not an 
integer. 
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9. Inthis section we showed that one solution of Bessel's equation 
of order zero 


L[y] = x2y" t xy’ o x?y 20 
is Jo, where Jo( x) is given by equation (7) with ag = 1. According to 
Theorem 5.6.1, a second solution has the form (x > 0) 
oo 
yo(x) = Jo(x) lnx + 5 bax”. 
n=1 


a. Show that 


oo oo 
L[yo](x) = 5 n(n — 1)byx" + So nbn x" 


n=2 n-l 


oo 
ba + 2x Q(x). (34) 


n-l 


b. Substituting the series representation for Jo(x) in equation 
(34), show that 


oo 
bix + 27 bax? + 3 (i? b, T by—2) x" 


25 | oh 
22^(n!)? 


n=l 


(35) 


c. Note that only even powers of x appear on the right-hand 


side of equation (35). Show that bj = b} = bs = --- = 0, 
1 
b, = ————— , and that 
? Pe 
2 ( EDn /— 
(2n) by, + boa 2 22n (n1) > MN PE A ES 


Deduce that 


The general solution of the recurrence relation is 
(— DH H, 
bo, = E Substituting for b, in the expression for 


y2(x), we obtain the solution given in equation (10). 


10. Find a second solution of Bessel's equation of order one by 
computing the c, (r5) and a of equation (24) of Section 5.6 according 
to the formulas (19) and (20) of that section. Some guidelines along 
the way of this calculation are the following. First, use equation (24) 
of this section to show that a,( —1) and aj (— 1) are 0. Then show that 


ci( —1) = 0 and, from the recurrence relation, that c,( —1) = 0 for 
n = 3,5, ... . Finally, use equation (25) to show that 
ao 
ar) = ————————, 
209 — ^E DU 3) 
ao 
a(r) = > 
G c DG 3G + 3)(r +5) 
and that 
(—1)" ag 
agr) = ,m > 3. 
(r+1)---(r+2m —1)(r +3) --- (7 +2m +1) 
Then show that 
zx m+1 H, +H 
me (Hn + Hn) m>1. 


2 mmYm-—1) —^' = 
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11. By a suitable change of variables it is sometimes possible to 
transform another differential equation into a Bessel equation. For 
example, show that a solution of 


TE 
xy" + (sine m e)s =0, x>0 


is given by y = x!/? f (ax^), where f(€) is a solution of the Bessel 
equation of order v. 


12. Using the result of Problem 11, show that the general solution 
of the Airy equation 


y'—xy-20, x>0 


2 2 
is y = x!? | c, fi jen) ten(Z0?)), where fi(£) and 


fo(€) are a fundamental set of solutions of the Bessel equation of order 
one-third. 


13. It can be shown that Jọ has infinitely many zeros for x > 0. 
In particular, the first three zeros are approximately 2.405, 5.520, and 
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Jo(Ajx) = T N ae 


Verify that y = Jo( A jx) satisfies the differential equation 
n 1 / 2 
y +-y t+ Ajy =0, x0. 
x 


Hence show that 
1 
| XxJo(Aix) Jo(Ajx)dx =0 if A; Z Aj. 
0 


This important property of Jo(A;x), which is known as the 
orthogonality property, is useful in solving boundary value 
problems. 
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CHAPTER6 


The Laplace Transtorm 


Many practical engineering problems involve mechanical or electrical systems acted on 
by discontinuous or impulsive forcing terms. For such problems the methods described in 
Chapter 3 are often rather awkward to use. Another method that is especially well suited 
to these problems, although useful much more generally, is based on the Laplace transform. 
In this chapter we describe how this important method works, emphasizing problems typical 
of those that arise in engineering applications. 


61 Definition of the Laplace Transform 


Improper Integrals. Since the Laplace transform involves an integral from zero to infinity, 
a knowledge of improper integrals of this type is necessary to appreciate the subsequent 
development of the properties of the transform. We provide a brief review of such improper 
integrals here. If you are already familiar with improper integrals, you may wish to skip over 
this review. On the other hand, if improper integrals are new to you, then you should probably 
consult a calculus book, where you will find many more details and examples. 

An improper integral over an unbounded interval is defined as a limit of integrals over 
finite intervals; thus 


oo A 
f f(t)dt = lim f(t)dt, (1) 
a A>o Ja 


where A is a positive real number. If the definite integral from a to A exists for each A > a, 
and if the limit of these values as A — oo exists, then the improper integral is said to converge 
to that limiting value. Otherwise the integral is said to diverge, or to fail to exist. The following 
examples illustrate both possibilities. 


EXAMPLE 1 


oo 
: ; dt- .. 
Does the improper integral J F3 diverge or converge? 
1 


Solution: 


From equation (1) we have 


oo t A t 
/ af — lim I ai = lim hA. 
1 t A>oo J1 t A— o0 


Since lim In A = œ, the improper integral diverges. 
A—o00 
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EXAMPLE 2 


oo 
Evaluate the improper integral f edt. For what values of c does this improper integral converge? 
0 


Solution: 


Suppose c is a real nonzero constant. Then 


oo A ct |^ 
e 
ct : ct H 
] cam] nmm E 
0 A>oo Jo A—oo € |o 
= lim -(e^ — 1) 
A>oo € 


It follows that the improper integral converges to the value —1/c if c < 0 and diverges if c > 0. If 
c = 0, the integrand e“ is the constant function with value 1. In this case 


A 
im f ldt = lim (A — 0) = oo, 
0 


A> A—oo 


so the integral again diverges. 


EXAMPLE 3 


oo 
Find all real numbers p for which the improper integral / t Pdt converges. For what values of p 
1 


does it diverge? 


Solution: 


Suppose that p is a real constant and p + 1; the case p = 1 was considered in Example 1. Then 


oo A 1 
f t Pdt = lim J t^"dt = lim ——(A!? — 1). 
1 A>oo J1 A> o0 1— p 


oo 
As A > oo, Al“? — 0 if p > 1, but AIP > oo if p « 1. Hence f t "dt converges to the 
1 


value 1/( p — 1) for p > 1 but (incorporating the result of Example 1) diverges for p < 1. These 
oo 


results are analogous to those for the infinite series X` n 7. 
n-l 


Before discussing the possible existence of L f (t)dt,itis helpful to define certain terms. 
A function f is said to be piecewise continuous on an interval a < t < 0 if the interval! can 
be partitioned by a finite number of points a = tọ < tı < +++ < t, = D so that 


1. f is continuous on each open subinterval tj. < t < ti. 


2. f approaches a finite limit as the endpoints of each subinterval are approached from within 
the subinterval. 


In other words, f is piecewise continuous on a < t < ĝ if it is continuous there except for a 
finite number of jump discontinuities. If f is piecewise continuous on a < t < f for every 
D > a, then f is said to be piecewise continuous on t > a. An example of a piecewise 
continuous function is shown in Figure 6.1.1. 

The integral of a piecewise continuous function on a finite interval is just the sum of the 
integrals on the subintervals created by the partition points. For instance, for the function f(t) 


! It is not essential that the interval be closed; the same definition applies if the interval is open at one or both ends. 
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| FIGURE 6.1.1 | A piecewise continuous function y = f(t). 


shown in Figure 6.1.1, we have 
B 


B ti t; 
roa = f roa s | f(ndt- fdt. (2) 
a a ti ty 
For the function shown in Figure 6.1.1, we have assigned values to the function at the endpoints 
o and B and at the partition points t; and t2. However, as far as the integrals in equation (2) are 
concerned, it does not matter whether f(t) is defined at these points, or what values may be 
assigned to f(t) atthem. The values of the integrals in equation (2) remain the same regardless. 

Thus, if f is piecewise continuous on the interval a < t < A, then få f(t)dt exists. 
Hence, if f is piecewise continuous for t > a, then få f(t)dt exists for each A > a. 
However, piecewise continuity is not enough to ensure convergence of the improper integral 
pn f (0 dt, as the preceding examples show. 

If f cannot be integrated easily in terms of elementary functions, the definition of 
convergence of jJ f(t)dt may be difficult to apply. Frequently, the most convenient way 
to test the convergence or divergence of an improper integral is by the following comparison 
theorem, which is analogous to a similar theorem for infinite series. 


Theorem 6.1.1 


If f is piecewise continuous for t > a, if | f(t)| < g(t) when t > M for some positive constant M, 
and if [s g(t)dt converges, then i f (t)dt also converges. 


On the other hand, if f(r) > g(t) > 0 fort > M, and if f% g(t)dt diverges, then f f(t)dt 
also diverges. 


The proof of these results from calculus will not be given here. They are made plausible, 
however, by comparing the areas represented by f Ns g(t)dt and f Ma | f(t) |dt. The functions 
most useful for comparison purposes are e“ and t^^, which we considered in Examples 1, 2, 
and 3. 


The Laplace Transform. Among the tools that are very useful for solving linear differential 
equations are integral transforms. An integral transform is a relation of the form 


B 
F(s) =) K(s, t) f(t) dt, (3) 


where K(s, t) is a given function, called the kernel of the transformation, and the limits of 
integration œ and ĝ are also given. It is possible that a = —oo, or 8 = co, or both. The 
relation (3) transforms the function f into another function F, which is called the transform 
of f. 

There are several integral transforms that are useful in applied mathematics, but in this 
chapter we consider only the Laplace? transform. This transform is defined in the following 


?The Laplace transform is named for the eminent French mathematician P. S. Laplace, who studied the relation (3) in 
1782. However, the techniques described in this chapter were not developed until a century or more later. We owe them 
mainly to Oliver Heaviside (1850—1925), an innovative self-taught English electrical engineer, who made significant 
contributions to the development and application of electromagnetic theory. He was also one of the developers of 
vector calculus. 
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way. Let f(t) be given for t > 0, and suppose that f satisfies certain conditions to be stated 
a little later. Then the Laplace transform of f, which we will denote by L{ f(t)} or by F(s), 
is defined by the equation 


L{ f(t)} = FG) =i e" f(r)dt, (4) 


whenever this improper integral converges. The Laplace transform makes use of the kernel 
K(s,t) = e^*.. Since the solutions of linear differential equations with constant coefficients 
are based on the exponential function, the Laplace transform is particularly useful for such 
equations. The general idea in using the Laplace transform to solve a differential equation is 
as follows: 


1. Use the relation (4) to transform an initial value problem for an unknown function f in the 
t-domain into a simpler problem (indeed, an algebraic problem) for F in the s-domain. 

2. Solve this algebraic problem to find F. 

3. Recover the desired function f from its transform F. This last step is known as “inverting 
the transform." 


In general, the parameter s may be complex, and the full power of the Laplace transform 
becomes available only when we regard F(s) as a function of a complex variable. However, 
for the problems discussed here, it is sufficient to consider only real values of s. 

The Laplace transform F of a function f exists if f satisfies certain conditions, such as 
those stated in the following theorem. 


Theorem 6.1.2 


Suppose that 


(i) f is piecewise continuous on the interval 0 < 1 < A for any positive A and 


(ii) there exist real constants K, a, and M, with K and M positive, such that 


f()| x Ke" whent > M. 


Then the Laplace transform £( f(t)} = F(s), defined by equation (4), exists for s > a. 


To establish this theorem, we must show that the integral in equation (4) converges for 
s > a. Splitting the improper integral into two parts, we have 


oo M oo 
f e" roi = | e" ro s | e" f(t)dt. (5) 
0 0 


M 


The first integral on the right-hand side of equation (5) exists by hypothesis (1) of the theorem; 
hence the existence of F(s) depends on the convergence of the second integral. By hypothesis 
(ii) we have, fort > M, 


le“ f(1)] < Ke "et -— Keo", 


and thus, by Theorem 6.1.1, F(s) exists provided that f% e(^^?'dt converges. Referring to 
Example 1 with c replaced by a — s, we see that this latter integral converges when a — s < 0, 
which establishes Theorem 6.1.2. 

In this chapter (except in Section 6.5), we deal almost exclusively with functions that 
satisfy the conditions of Theorem 6.1.2. Such functions are described as piecewise continuous 
and of exponential order as t — co. Note that there are functions that are not of exponential 
order as t — oo. One such function is f(t) = e^. Ast > oo, this function increases faster 
than Ke“ regardless of how large the constants K and a may be. 

The Laplace transforms of some important elementary functions are given in the following 
examples. 
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EXAMPLE 4 


Find L{1}. 


Solution: 

Let f(t) = 1, t > 0. Then, as in Example 2, 
oo e A 

L{1} -| e "dt = — lim =; $20. 

0 


A—oo S 0 s 


EXAMPLE 5 


Find £e } . 


Solution: 


Let f(t) =e“, t > 0. Then, again referring to Example 2, 


oo oo 
L{e"} = n e™ edt = f e SO dt 
0 0 


EXAMPLE 6 


Find the Laplace transform of the function graphed in Figure 6.1.2. 


ya 


—LL———M————— 
1 t 


| FIGURE 6.1.2 | Graph of the piecewise-defined function in Example 6. 


Solution: 
Let 
1, Oxt«l 
f= k, t=1, 
0, tl, 


where k is a constant. In engineering contexts f(t) often represents a unit pulse, perhaps of force or 


voltage. 
Note that f is a piecewise continuous function. Then 


oo 1 
eos f e roa f a= 
0 0 


Observe that L{ f(t) } does not depend on k, the function value at the point of discontinuity. Even if 
f(t) is not defined at this point, the Laplace transform of f remains the same. Thus there are many 
functions, differing only in their value at a single point, that have the same Laplace transform. 
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EXAMPLE 7 


Find £{sin(at) }. For what values of s is this transform defined? 


Solution: 
Let f(t) = sin(at), t > 0. Then 


L{sin(at)} = F(s) =] e "sin(at)dt, s> 0. 
0 


Since 


F(s) = lim 


A> o0 


A 
] mea 
0 
A A 
s —st 
e * cos(at)dt 
o € Jo 


oo 
f e™™ cos(at)dt. 
0 


A second integration by parts then yields 


upon integrating by parts, we obtain 


| 


S 


a 


e *' cos(at) 


F(s) lim 


a 


1 ye 
FS) = 2-4 e *' sin(at)dt 
a a? Jo 
1 s? F(s) 
= — — — F(s). 
a ae 
Now, solving for F(s), we have 
F(s) - 4— >0 
s) =——; 4 
s2 +a? 


In Problem 5 you will use a similar process to find L{cos(at)} = — fors > 0. Now 
s +a 


let us suppose that fı and f» are two functions whose Laplace transforms exist for s > a, and 
s > a5, respectively. Then, for s greater than the maximum of a, and a5, 


Lier fi(t) 9 eofa( 0] 


f e (cr fi(t) t eft) dt 


oo oo 

a | epode os | e fa(t)dt; 
0 0 

hence 


Lici fit) taf} = ect fi} + olf fro}. 


Equation (6) states that the Laplace transform is a linear operator, and we make frequent use 
of this property later. The sum in equation (6) can be readily extended to an arbitrary number 
of terms. 


(6) 


EXAMPLE 8 


Find the Laplace transform of f(t) = 5e~~! — 3sin(4t), t > 0. 


Solution: 


Using equation (6), we write 
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L{ f(t)} 2 5£(e ) — 3£{sin(4r) }. 


Then, from Examples 5 and 7, we obtain 


5 12 
L{ f(t)} = suy FONT. 


s>0 


Problems 


In each of Problems 1 through 3, sketch the graph of the given 
function. In each case determine whether f is continuous, piecewise 
continuous, or neither on the interval 0 < ¢ < 3. 


t, 0<zżt<1 
1. f(Ð)=4<2+t, 1<t<2 
6-t, 2<t<3 
t, 0<zż<1 
2 TW = (r=) 1l<t<2 
1, 2<t<3 
t?, 0<zż<1 
3. fi) =< 1, I< t2 
3-t, 2<t<3 
4. Find the Laplace transform of each of the following functions: 
a. f(t) 2t 
b. f(t) =? 


C. f(t) = t", where n is a positive integer 


5. Find the Laplace transform of f(t) = cos(at), where a is a real 
constant. 


Recall that 
1 1 
cosh(bt) = Y +e") and sinh(bt) = ue cet. 


In each of Problems 6 through 7, use the linearity of the Laplace 
transform to find the Laplace transform of the given function; a and b 
are real constants. 

6. f(t) = cosh(bt) 

7. f(t) = sinh(br) 
Recall that 
- e irj . 


I . l . 
cos(bt) = ye +e) and sin(bt) = 2,07 
l 


In each of Problems 8 through 11, use the linearity of the Laplace 
transform to find the Laplace transform of the given function; a and b 
are real constants. Assume that the necessary elementary integration 
formulas extend to this case. 


8. f(t) = sin(bt) 

9. f(t) = cos(bt) 
10. f(t) = e” sin(bt) 
11. f(t) = e” cos(bt) 


In each of Problems 12 through 15, use integration by parts to find the 
Laplace transform of the given function; n is a positive integer and a 
is a real constant. 


12. f(t) = te” 
13. f(t) = tsin(at) 
14. f) = t'e” 


15. f(t) = t? sin(at) 


In each of Problems 16 through 18, find the Laplace transform of the 
given function. 


1, 0Oxrt«m 
16. = n 
b. JO) ts T €t « oo 
t, Oxt«l 
17. = > 
FO f 1<t<o 
ü 0<t<l 
18. fi) =<2-t, 1<t<2 
0, 2<t< oo 


In each of Problems 19 through 21, determine whether the given 
integral converges or diverges. 


19. n aeth adr 
0 


20. i te ‘dt 
0 

21. f te'dt 
1 


22. Suppose that f and f’ are continuous for t > 0 and of 
exponential order as t — oo. Use integration by parts to show that 
if F(s) = L{ f(t)}, then lim F(s) = 0. The result is actually true 


SOO. 
under less restrictive conditions, such as those of Theorem 6.1.2. 


23. The Gamma Function. The gamma function is denoted by 
I'(p) and is defined by the integral 


(p+) 3 e *x?Pdx. (7) 
0 


The integral converges as x — oo for all p. For p « 0 it is also 
improper at x = 0, because the integrand becomes unbounded as 
x — 0. However, the integral can be shown to converge at x = 0 
for p > —1. 

a. Show that, for p 0, 

I(p +1) = pI(Cp). 
b. Show that '(1) = 1. 
C. If p is a positive integer n, show that 
T(n+1) =n!. 

Since I'( p) is also defined when p is not an integer, this function 

provides an extension of the factorial function to nonintegral 

values of the independent variable. Note that it is also consistent 

to define 0! = 1. 

d. Show that, for p > 0, 

Pp + D(p42)(pn-1)— 49. 
T(p) 

Thus I'( p) can be determined for all positive values of p if T (p) 
is known in a single interval of unit length—say, 0 < p < 1. It is 


1 = 3 11 
possible to show that (7) = vs Fr) mar (7); 
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24. Consider the Laplace transform of t”, where p > —1. It is possible to show that 
a. Referring to Problem 23, show that NM, Jr 
oo 1 oo n e^ dx = IE 
L{tP} = f edt = —— e *xPdx 0 
0 sPH Jy hence 
"o £g ym. wm 
(o sPHC : s 
b. Let p bea positive integer n in part a; show that d. Show that 
n! 1/2 Jn 
my = 
LEYS TE s>0. £r" 25 s>0. 


c. Show that 


£p? = A P erus 520 
x ] L 


62 Solution of Initial Value Problems 


In this section we show how the Laplace transform can be used to solve initial value problems 
for linear differential equations with constant coefficients. The usefulness of the Laplace 
transform for this purpose rests primarily on the fact that the transform of f’ is related in 
a simple way to the transform of f. The relationship is expressed in the following theorem. 


Theorem 6.2.1 


Suppose that f is continuous and f’ is piecewise continuous on any interval 0 < £ < A. Suppose 
further that there exist constants K, a, and M such that | f(t)| < Ke“ fort > M. Then L{ f'(1)) 
exists for s > a, and moreover, 


LFO = sL{ f()} — fO. Q0) 


To prove this theorem, we consider the integral 
A 
| e f'(t)dt, 
0 


whose limit as A — oo, if it exists, is the Laplace transform of f’. To calculate this limit we 
first need to write the integral in a suitable form. If f' has points of discontinuity in the interval 
0 x t <A, let them be denoted by ty, fo, ... , tj. Then we can write the integral as 


A f t; A 
| e" f'(t)dt = li e " f'(t)dt «f e" f'(t)dt +- «f e" f'(t)dt. 
0 0 


t tk 


Integrating each term on the right by parts yields 


A 
[ e^" fdt = e? fap e POP ep 


+s 


ti h A 
J e" for s [ e foa f eros] 
0 ti 


tk 


6.2 Solution of Initial Value Problems 


Since f is continuous, the contributions of the integrated terms at £4, t2, ... , tg cancel. Further, 
the integrals on the right-hand side can be combined into a single integral so that we obtain 


A 


A 
f e™ fode =e f(A) - 0) +5 [ e " f(t)dt. (2) 
0 0 


Now we let A — oo in equation (2). The integral on the right-hand side of this 
equation approaches £( f (1) ). Further, for A > M, we have | f( A)| < Ke^^; consequently, 
le^ f(A)| € Ke-6-9^, Hence e^ f(A) — 0 as A — oo whenever s > a. Thus the 
right-hand side of equation (2) has the limit sZ( f(t)} — f(0) . Consequently, the left-hand 
side of equation (2) also has a limit, and as noted above, this limit is C( f’(t)}. Therefore, for 
s > a, we conclude that 


LiF} = s£CÉCO) — FO), 
which completes the proof of Theorem 6.2.1. 

If f' and f" satisfy the same conditions that are imposed on f and f’, respectively, in 
Theorem 6.2.1, then it follows that the Laplace transform of f” also exists for s > a and is 
given by 

LF" = sL{ FO} — f£ (0) 
= s(sL{ f(t)} — f(0)) — f'(0) 
=s Lf} — sf(0) — f'(0). (3) 
Indeed, provided the function f and its derivatives satisfy suitable conditions, an expression 


for the transform of the n® derivative f™ can be derived by n successive applications of this 
theorem. The result is given in the following corollary. 


Corollary 6.2.2 


Suppose that the functions f, f’, ... , f ^ are continuous and that f? is piecewise continuous 
on any interval 0 < t < A. Suppose further that there exist constants K, a, and M such that 
|f(| € Ke", | f'()| € Ke”, ...,| f-P(0)| < Ke" fort > M. Then L{ f (1) exists 
for s > a and is given by 


L{ FPA} = s" EC f(t) } — 5" fco) —--- — sf 0) — fad). (4) 


We now show how the Laplace transform can be used to solve initial value problems. 
It is most useful for problems involving nonhomogeneous differential equations, as we 
will demonstrate in later sections of this chapter. However, we begin by looking at some 
homogeneous equations, which are a bit simpler. 


EXAMPLE 1 


Find the solution of the differential equation 
y"— y —2y=0 (5) 


that satisfies the initial conditions 
yx(0)=1, y(0)=0. (6) 


Solution: 
This initial value problem is easily solved by the methods of Section 3.1. The characteristic equation 
is 


r’—r—2=(r—2)(r+1)=0, 
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and consequently, the general solution of equation (5) is 


y = ce + cse. (7) 


To satisfy the initial conditions (6), we must have c, + c? = 1 and —c; + 2c? = 0; hence c; = 


Wildy 


1 
and c; = 3 so the solution of the initial value problem (4) and (5) is 


2 1 
y-2ó(n-2 "i + 3€ (8) 


Now let us solve the same problem using the Laplace transform. To do this, we must assume that 
the problem has a solution y = y(t), which with its first two derivatives satisfies the conditions of 
Corollary 6.2.2. Then, taking the Laplace transform of the differential equation (5), we obtain 


L{y" - y! — 2y} = £61 - Liy’) - 2£0) = 0, (9) 


where we have used the linearity of the Laplace transform to write the transform of a sum as the sum 
of the separate transforms. Upon using Corollary 6.2.2 to express C(y") and L{y’} in terms of L{y}, 
we find that equation (9) becomes 


s?2£(y) — sy(0) — y'(0) — (sL{y} — (0) — 2L{y} = 0, 


or 
(s? — s —2)Y(s) + (1 —s) (0) — y'(0) — 0, (10) 
where Y(s) = L{y}. Substituting for y(0) and y'(0) in equation (10) from the initial conditions (6), 
and then solving for Y (s), we obtain 
s—l s—l 


faber 2 ESI 


(11) 


We have thus obtained an expression for the Laplace transform Y (s) of the solution y(t) of the given 
initial value problem. To determine the solution, y(t), we must find the function whose Laplace 
transform is Y(s), as given in equation (11). 

This can be done most easily by expanding the right-hand side of equation (11) in partial 
fractions. Thus we write 


s—l o a à: b _ a(s +1) + b(s — 2) 
(s—2)(s-1 s—2 s-cl (s—2)(s+ 


where the coefficients a and b are to be determined. By equating numerators of the second and fourth 
members of equation (12), we obtain 


Y(s) = , (12) 


s—1l=a(s+1) +b(s —2), 


1 
an equation that must hold for all s. In particular, if we set s = 2 then it follows that a = 7 Similarly, 


2 
if we set s = —1 then we find that b = 7 Substituting these values for a and b into equation (12), 
we have 
1/3 2/3 
Y(s) 2 —- : 13 
(s) "me m FU (13) 


1 
Finally, if we use the result of Example 5 of Section 6.1, it follows that z” has the transform 


1 2 2 
36 — 2) -*; similarly, ge^ has the transform 36 + 1)7!. Hence, by the linearity of the Laplace 


transform, 


1, 2_, 
jj = 
y(t) 3° Tee 


has the transform (13) and is therefore the solution of the initial value problem (4), (5). Observe that 
it does satisfy the conditions of Corollary 6.2.2, as we assumed initially. Of course, this is the same 
solution that we obtained earlier. 


The same procedure can be applied to the general second-order linear equation with 
constant coefficients 


ay" + by + cy = f(t). (14) 


6.2 Solution of Initial Value Problems 


Assuming that the solution y(t) satisfies the conditions of Corollary 6.2.2 for n = 2, we can 
take the transform of equation (14) and thereby obtain 


a(s?¥(s) —sy(0) — y'(0)) + bYG) — y(O)) +e¥(8) = F(s), 3) 
where F(s) is the transform of f(t). By solving equation (15) for Y(s), we find that 
b j F 
Pues (as + D + ay'(0) : (s) l (16) 
as*+bs+c as*+bs+c 


The problem is then solved, provided that we can find the function y(t) whose transform is 
Y(s). 

Even at this early stage of our discussion we can point out four essential features of the 
Laplace transform method. In the first place, the transform Y(s) of the unknown function 
y(t) is found by solving an algebraic equation rather than a differential equation, algebraic 
equation (10) rather than differential equation (5) in Example 1, or in general algebraic 
equation (15) rather than differential equation (14). This is the key to the usefulness of Laplace 
transforms for solving linear, constant coefficient, ordinary differential equations —the 
problem is reduced from a differential equation to an algebraic one. Next, the solution 
satisfying given initial conditions is automatically found, so that the task of determining 
appropriate values for the arbitrary constants in the general solution does not arise. Further, as 
indicated in equation (15), nonhomogeneous equations are handled in exactly the same way 
as homogeneous ones; it is not necessary to solve the corresponding homogeneous equation 
first. Finally, the method can be applied in the same way to higher-order equations, as long 
as we assume that the solution satisfies the conditions of Corollary 6.2.2 for the appropriate 
value of n. 

Observe that the polynomial as? + bs + c in the denominator on the right-hand side of 
equation (16) is precisely the characteristic polynomial associated with equation (14). Since 
the use of a partial fraction expansion of Y(s) to determine y(t) requires us to factor this 
polynomial, the use of Laplace transforms does not avoid the necessity of finding roots of the 
characteristic equation. For equations of higher than second order, this may require a numerical 
approximation, particularly if the roots are irrational or complex. 

The main difficulty that occurs in solving initial value problems by Laplace transforms 
lies in the problem of determining the function y(t) corresponding to the transform Y(s). 
This problem is known as the inversion problem for the Laplace transform; y(t) is called the 
inverse Laplace transform corresponding to Y (5), and the process of finding y(t) from Y( s) 
is known as inverting the Laplace transform. We also use the notation £^! (Y(s)) to denote 
the inverse transform of Y (s). There is a general formula for the inverse Laplace transform, 
but its use requires a familiarity with functions of a complex variable, and we do not consider it 
in this book. However, it is still possible to develop many important properties of the Laplace 
transform, and to solve many interesting problems, without the use of complex variables. 

In solving the initial value problem (4), (5), we did not consider the question of whether 
there may be functions other than the one given by equation (8) that also have the transform 
(13). By Theorem 3.2.1 we know that the initial value problem has no other solutions. We also 
know that the unique solution (8) of the initial value problem is continuous. Consistent with this 
fact, it can be shown that if f and g are continuous functions with the same Laplace transform, 
then f and g must be identical. On the other hand, if f and g are only piecewise continuous, 
then they may differ at one or more points of discontinuity and yet have the same Laplace trans- 
form; see Example 6 in Section 6.1. This lack of uniqueness of the inverse Laplace transform 
for piecewise continuous functions is of no practical significance in applications. 

Thus there is essentially a one-to-one correspondence between functions and their Laplace 
transforms. This fact suggests the compilation of a table, such as Table 6.2.1, giving the 
Laplace transforms of functions frequently encountered, and vice versa. The entries in the 
second column of Table 6.2.1 are the Laplace transforms of those in the first column. Perhaps 
more important, the functions in the first column are the inverse Laplace transforms of those in 
the second column. Thus, for example, if the transform of the solution of a differential equation 
is known, the solution itself can often be found merely by looking it up in the table. Some of 
the entries in Table 6.2.1 have been used as examples, or appear as problems in Section 6.1, 
while others will be developed later in the chapter. The third column of the table indicates 
where the derivation of the given transforms may be found. Although Table 6.2.1 is sufficient 
for the examples and problems in this book, much larger tables are also available (see the list 
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TABLE 6.2.1 


10. 


11. 


LOL t<c 


F) =L {F(s)} 


t”, n a positive integer 


t?, p>-l 


sin(at) 


cos(at) 


. sinh(at) 


. cosh(at) 


. e” sin( bt) 


e“ cos(bt) 


t"e^'. n a positive integer 


tc 


- ust) f(t — c) 
. e? f(t) 
Eon 

e (fg) = fo ftt — 7) g(r) dr 
. ó(t—c) 


« fU 


(—t)" f(t) 


Elementary Laplace Transforms 


F(s) 2 £ (f(0) 


-, s»0 Sec. 6.1; Ex. 4 
5 
1 
;- «Od Sec. 6.1; Ex. 5 
s—a 
n! 
p 8 0 Sec. 6.1; Prob. 24 
rin 
E 1 
MU. aui Sec. 6.1; Prob. 24 
gPtl 
a 
s aS Secd6 AEN 
s? +a? 
> , s>0 Sec. 6.1; Prob. 5 
s2 + a? 
a 
ewe S> lal Sec. 6.1; Prob. 7 
s2—a 
€ s > |a| Sec. 6.1; Prob. 6 
s2—a 
5 5 2» Sec. 6.1; Prob. 10 
— . 6.1; Prob. 
(s — a)? +b? 
s—a 
—, s>a Sec. 6.1; Prob. 11 
(s — a)? +b? n 
n! 
————. 00 Sec. 6.1; Prob. 14 
(s — a)n*l 
es 
> X0 Sec. 6.3 
s 
e 5F(s) Sec. 6.3 
F(s —c) Sec. 6.3 
1 IS 
-F (2), e> Sec. 6.3; Prob. 17 
QW 
F(s)G(s) Sec. 6.6 
Qui ECHO: 
s" F(s) —s"-! f(0) —...— f"-U(0) Sec. 62; Cor. 6.2.2 
FO) (s) Sec. 6.2; Prob. 21 


of references at the end of the chapter). Transforms and inverse transforms can also be readily 
obtained electronically by using a computer algebra system. 
Frequently, a Laplace transform F(s) is expressible as a sum of several terms 


F(s) = Fi(s) + Fo(5) +--+ FC). 


(17) 


6.2 Solution of Initial Value Problems 


Suppose that fı(t) = LMF (s)}, ..., Sr) = L-'{F,(s)}. Then the function 
IO = fi) +--+ + fa 


has the Laplace transform F(s). By the uniqueness property stated previously, no other 
continuous function f has the same transform. Thus 


LOR (s)} = LON (s)} +e +L Fan(s) }; (18) 


that is, the inverse Laplace transform is also a linear operator. 

In many problems it is convenient to make use of the linearity property by decomposing 
a given transform into a sum of functions whose inverse transforms are already known or can 
be found in the table. Partial fraction expansions are particularly useful for this purpose, and a 
general result covering many cases is given in Problem 29. Other useful properties of Laplace 
transforms are derived later in this chapter. 

As further illustrations of the technique of solving initial value problems by means of the 
Laplace transform and partial fraction expansions, consider the following examples. 


EXAMPLE 2 


Find the solution of the differential equation 


y" + y = sin(2r) (19) 
satisfying the initial conditions 


y(0) =2, y(0) =1. (20) 


Solution: 


We assume that this initial value problem has a solution y(t), which with its first two derivatives 
satisfies the conditions of Corollary 6.2.2. Then, taking the Laplace transform of the differential 
equation (19), we have 


/ 2 
s?Y(s) — sy(0) — y'(0) + Y(s) = a 


, 


where the transform of sin(2t) has been obtained from line 5 of Table 6.2.1. Substituting for y(0) 
and y'(0) from the initial conditions (20) and solving for Y (s), we obtain 


25? +s? -- 8s +6 


Y(s) = ———————. 21 
eric uen Q1) 

Using partial fractions, we can write Y (s) in the form 
Y(s) = as+b cs+d _ (as+b)(s?+4) +(cs+d)(s? +1) l (22) 


~~ s24 1 5244 (s? + 1)(s? +4) 


By expanding the numerator on the right-hand side of equation (22) and equating it to the numerator 
in equation (21), we find that 


25? +s? + 8s 6 = (a - c)s? + (b - d)s? +(4a+c)s+(4b+d) (23) 
for all s. Then, comparing coefficients of like powers of s, we have? 
at+c=2, b+id=l, 
4a+c=8 4b+d=6. 


3We could find the value of the four coefficients by evaluating equation (23) for four different values of s, but, 
unlike Example 1, it is not obvious what four values of s will give trivial equations to solve for a, b, c, and d. 
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5 2 
Consequently, a = 2,c =0,b= y and d — — 3' from which it follows that 


YG) 2s E 5/3 2/3 (24) 
y= ; 
s2-1  s2?+1 5244 
From lines 5 and 6 of Table 6.2.1, the solution of the given initial value problem is 
I 1. 
Y= cost kasint — y sm2). (25) 


EXAMPLE 3 


Find the solution of the initial value problem 


y(9 — y —0, Q6) 


»x(0 —0, y(0)=1, y'(0 —0, y"(0)-O0. (27) 


Solution: 


In this problem we need to assume that the solution y(t) satisfies the conditions of Corollary 6.2.2 
for n = 4. The Laplace transform of the differential equation (26) is 


s^Y(s) — s°y(0) — s*y'(0) — sy"(0) — y"(0) — Y(s) = 0. 


Then, using the initial conditions (27) and solving for Y(s), we have 


2 


Y(s) = . 28 
(= 4— (28) 
A partial fraction expansion of Y(s) is 
as+b cs+d 
Y(s) = + ; 29 
(s) s —1 s +l (22) 
and it follows that 
(as + b)(s* +1) + (cs +d)(s? — 1) 2s? (30) 


for all s. In this problem we use a combination of substituting values of s and equating coefficients 
of like powers of s. First, setting s = 1 and s = — 1, respectively, in equation (30), we obtain the 
pair of equations 


Xa+b)=1, X—a+b)=1, 


1 1 
and therefore a = 0 and b = Y If we set s = 0 in equation (30), then b — d = 0, sod = y. Finally, 


equating the coefficients of the cubic terms on each side of equation (30), we find that a -- c — 0, so 
c = 0. Thus 


EE 4 19 G1) 
y= A 
s2—1 s +1 
and from lines 7 and 5 of Table 6.2.1, the solution of the initial value problem (26), (27) is 
1 
y(t) = 3 (sinh: + sinf). (32) 


We conclude by noting that we could have looked for a partial fraction expansion of Y(s) in 
the form 
a b cs +d 


Y(s) = + + 
6s) s—1 s+1 s?-c1 


We used the form in equation (29) because Table 6.2.1 includes entries for both 1/ (s? 1) and 
s/( +1). 
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The most important elementary applications of the Laplace transform are in the study of 
mechanical vibrations and in the analysis of electric circuits; the governing equations were 
derived in Section 3.7. A vibrating spring-mass system has the equation of motion 


a ae (33) 
m— — u = : 

dq 
where m is the mass, y the damping coefficient, k the spring constant, and F(t) the applied 
external force. The equation that describes an electric circuit containing an inductance L, a 


resistance R, and a capacitance C (an LRC circuit) is 
d? d 1 
,U9 , ,40 


aa tka *:97 EO. (34) 


where Q(t) is the charge on the capacitor and E(t) is the applied voltage. In terms of the 
current I(t) = d Q(t) /dt, we can differentiate equation (34) and write 


dI dl 1 dE 
L 


R I= t). 35 
ae aC a (33) 


Suitable initial conditions on u, Q, or J must also be prescribed. 

We have noted previously, in Section 3.7, that equation (33) for the spring-mass system 
and equations (34) or (35) for the electric circuit are identical mathematically, differing only 
in the interpretation of the constants and variables appearing in them. There are other physical 
problems that also lead to the same differential equation. Thus, once the mathematical problem 
is solved, its solution can be interpreted in terms of whichever corresponding physical problem 
is of immediate interest. 

In the problem lists following this and other sections in this chapter are numerous initial- 
value problems for second-order linear differential equations with constant coefficients. Many 
can be interpreted as models of particular physical systems, but usually we do not point this 
out explicitly. 


Problems 


In each of Problems 1 through 7, find the inverse Laplace transform 10. y”—2y'+2y=0; y(0)=0, y(0 21 


of the given m 11. y’—2y'+4y=0; y(0 —2, y'(0 —0 
l. F(s) = faa 12. y"+2y'+5y=0; y(0 22, y(0)—-1 
4 13. y(? — 4y" + 6y" —4y'+ y=0; y(0) =0, 
2. F(s)= TENE y(0) —1, y"(0) =0, y"(0 =1 
2 14. y9—y 20; y(0)=1, y(0) 20, y"(0) = 1, 
3. F(s)— "(0) =0 
(s) 2435 —4 y (0) 
2542 15. y"-w?y =cos(2t), w? Z4; y(0 =1, y'(0) 20 
4. F(s) — s2-c2s-45 16. y” —2y' -2y =e; y(0) = 0, y’(0) =1 
5 2s —3 In each of Problems 17 through 19, find the Laplace transform Y (s) = 
€ F(0— 5 4 L{y} of the solution of the given initial value problem. A method of 
TN 12 determining the inverse transform is developed in Section 6.3. You 
6. F(s)= i To s 5 may wish to refer to Problems 16 through 18 in Section 6.1. 
s^ 
1-25 17, y &yelb 95197 q.i yo 
7. F(s)= 24445 . i 0 t<t<@; 
s s 
t, O<t<1, 
In each of Problems 8 through 16, use the Laplace transform to solve 18. y’+4y= { x y(0) 20, y'(0 20 
: NEN t i l 1xt«oo; ^ 
the given initial value problem. 
t, 0xt«l, 


8. y"—y'—6y-20; y(0)=1, y(0 —-I 
9. y"+3y'+2y=0; y(0)=1, y(0)-0 


19. y"+y=<2-t, 1<t<2, y(0)=0, y(0) =0 
0, 2«t« oo 
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20. The Laplace transforms of certain functions can be found 
conveniently from their Taylor series expansions. 
a. Using the Taylor series for sin t 
oo 
. —1)” pent 
ina) e 


(2n +1)! 
n=0 


and assuming that the Laplace transform of this series can be 
computed term-by-term, verify that 


L{sint} = , >k 
(sint) Pai s 
b. Let 
sint TT 
fü)jes t’ ' 
1, t=0 


Show that f(t) is continuous for all real values of t. Find the 
Taylor series for f about f = 0. Assuming that the Laplace 
transform of this function can be computed term-by-term, verify 
that 


L{ f(t)} -acm( 1). sl. 


s 
C. The Bessel function of the first kind of order zero, Jo, has the 


Taylor series (see Section 5.7) 


is (=i): 
Jo(t) = 2- 22"(n!)2 i 
n=0 


Assuming that the following Laplace transforms can be computed 
term-by-term, verify that 


L{ J} = -D0 77, s»1 
and 
hihi y =se, a> 0: 


Problems 21 through 27 are concerned with differentiation of the 
Laplace transform. 


21. Let 
roy = | e™ f(t) dt. 
0 


It is possible to show that as long as f satisfies the conditions of 
Theorem 6.1.2, it is legitimate to differentiate under the integral sign 
with respect to the parameter s when s > a. 
a. Show that F'(s) = L{—tf(t)}. 
b. Show that FU? (s) = L{(—1)" f(t)}; hence differentiating 
the Laplace transform corresponds to multiplying the original 
function by —t. 


In each of Problems 22 through 25, use the result of Problem 21 to find 
the Laplace transform of the given function; a and b are real numbers 
and n is a positive integer. 


22. f(t)-te* 

23. f(t) = t? sin(bt) 
24. f(t) = t'e” 

25. f(t) = te“ sin(bt) 


26. Consider Bessel’s equation of order zero 
ty" +y'+ty =0. 


Recall from Section 5.7 that t = 0 is a regular singular point for this 
equation, and therefore solutions may become unbounded as t — 0. 
However, let us try to determine whether there are any solutions that 
remain finite at £ = 0 and have finite derivatives there. Assuming that 
there is such a solution y = o (t), let Y(s) = L{@(t)}. 

a. Show that Y(s) satisfies 


(1-7 52) Y'(s) -- sY(s) =0. 


b. Show that Y(s) = c(1 + s2)-1/2. where c is an arbitrary 
constant. 

c. Writing (1 + $2)-U/2 = g-l(1 +s)", expanding in 
a binomial series valid for s > 1, and assuming that it is 
permissible to take the inverse transform term-by-term, show that 


9o ( TEN D)" pn 
y= ey aD? cJo(t), 
n=0 
where Jp is the Bessel function of the first kind of order zero. Note 
that J9(0) = 1 and that Jọ has finite derivatives of all orders at 
t = 0. It was shown in Section 5.7 that the second solution of this 
equation becomes unbounded as t — 0. 


27. For each of the following initial value problems, use the results 
of Problem 21 to find the differential equation satisfied by Y(s) — 
L{y(t)}, where y(t) is the solution of the given initial value problem. 
y(0) 21, y'(0) 20 (Airy's equation) 
y(0) =0, y'(0) —1 


a. y" —ty 0; 
b. (1-2?) y’—2ty’+a(a+l) y = 0; 
(Legendre's equation) 


Note that the differential equation for Y(s) is of first-order in 
part a, but of second-order in part b. This is due to the fact 
that f appears at most to the first power in the equation of 
part a, whereas it appears to the second power in that of part b. 
This illustrates that the Laplace transform is not often useful in 
solving differential equations with variable coefficients, unless 
all the coefficients are at most linear functions of the independent 
variable. 


28. Suppose that 
t 
w= f f(r) dr. 
0 


If G(s) and F(s) are the Laplace transforms of g(t) and f(t), 
respectively, show that 


G(s) = Ln 


29. In this problem we show how a general partial fraction 
expansion can be used to calculate many inverse Laplace transforms. 
Suppose that 


P. 
Hys, 
Q(s) 
where Q(s) is a polynomial of degree n with n distinct zeros 
Tj, ... , Fn, and P(s) is a polynomial of degree less than n. In this case 


it is possible to show that P(s) / Q( s) has a partial fraction expansion 
of the form 


P(s) _ A1 An 
Qs sn ^ T 


where the coefficients A4, .. 


(36) 


> 
S—Ty 


. , A, must be determined. 
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a. Show that by s — ry because equation (36) is not defined at each root of 
Q(s). 
= Pon) k=1,...,n. b. Show that 
Ore) " 
= P(rx) 
1 
Hint: One way to do this is to multiply equation (36) by s — rg and LUTF(s)} = 2. Ory) ev. 
then to take the limit as s — rg. Note that limits are used because k=1 


it is not appropriate to simply evaluate equation (36) multiplied 
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In Section 6.2 we outlined the general procedure involved in solving initial value problems 
by means of the Laplace transform. Some of the most interesting elementary applications of 
the transform method occur in the solution of linear differential equations with discontinuous 
or impulsive forcing functions. Equations of this type frequently arise in the analysis of the 
flow of current in electric circuits or the vibrations of mechanical systems. In this section and 
the remaining sections in Chapter 6, we develop some additional properties of the Laplace 
transform that are useful in the solution of such problems. Unless a specific statement is made 
to the contrary, all functions appearing below will be assumed to be piecewise continuous and 
of exponential order, so that their Laplace transforms exist, at least for s sufficiently large. 

To deal effectively with functions having jump discontinuities, it is very helpful to 
introduce a function known as the unit step function or Heaviside function. This function 
will be denoted by uc and is defined by 


u(t) = n oe (1) 


l, tc. 


Since the Laplace transform involves values of t in the interval [0, oo), we are also interested 
only in nonnegative values of c. The graph of y = ue(t) is shown in Figure 6.3.1. We 
have somewhat arbitrarily assigned the value one to ue at t = c. However, for a piecewise 
continuous function such as u,, the value at a discontinuity point is usually irrelevant. The 
step can also be negative. For instance, Figure 6.3.2 shows the graph of y = 1 — u(t). 


| FIGURE 6.3.1 | Graph of y = u(t). | FIGURE 6.3.2 | Graph of y = 1—u,(t). 


If we associate the value | with “on” and 0 with “off,” then the function u,(t) represents 
a switch that is turned on at time c. Likewise, 1 — u,(t) represents a switch being turned off 
at time c. 


EXAMPLE 1 


Sketch the graph of y = h(t), where 


A(t) =u, (t) ug, (D, t» 0. 
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Solution: 


From the definition of u(t) in equation (1), we have 


0—0, Oxt«m, 
io - tt Se I e 1-0 «x <t<2n, 
1-1, 2m <t< oo, 
0, Qsim 
m<t< 2r, 
0, m <t< œ. 


Il 
an 


Thus the equation y = h(t) has the graph shown in Figure 6.3.3. This function can be thought of 
as a switch that is initially off, turned on at £ = m and then turned off at t = 27; this is also often 
referred to as a rectangular pulse. 


| FIGURE 6.3.3 | Graph of y = u; (t) — us; (t). 


EXAMPLE 2 


Consider the function 


2, 0xt«4A, 
5, 4<t<7, 
fü-4 1 Q) 
1, 


7<t< 49, 
t>9. 


Sketch the graph of y = f(t). Express f(t) in terms of ue(t). 


Solution: 


The graph of y = f(t) is piecewise constant. Paying attention to include the left endpoint of each 
horizontal segment, we arrive at Figure 6.3.4. 


A 


N 


| 


E — 


| FIGURE 6.3.4 | Graph of the function in equation (2). 


We start with the function f;(t) = 2, which agrees with f(t) on [0, 4). To produce the jump 
of three units at t = 4, we add 3u4(1) to f(t), obtaining 


fat) = 2+ 3u4(t), 


which agrees with f(t) on [0,7). The negative jump of six units at t+ = 7 corresponds to 
adding —6u7(t), which gives 


fa(t) = 2 + 3u4(t) — 6u7(t). 
Finally, we must add 2u(7) to match the jump of two units at t = 9. Thus we obtain 


F(t) =2 + 3u4(t) — 6u5(t) + 2ug(t). (3) 


The Laplace transform of ue for c > 0 is easily determined: 


L{u(t)} = l e "u(t)dt = [ edt 
0 c 


= y s> 0. (4) 
s 
Notice that 
e? 1 
L{uo(t)} = — = - = L{1}. 
s s 


This is true because uo(t) = 1 for all t > 0. 
For a given function f defined for t > 0, we will often want to consider the related 


function g defined by 
0, t « c, 
g(t) = { 


f(t-—c), tSc, 


which represents a translation of f a distance c in the positive t direction and is zero for t < c; 
see Figure 6.3.5. Making use of the unit step function, we can write g(t) in the convenient 
form 


g(t) — uc(t) f(t — c). 


f (0) 


(a) 
| FIGURE 6.3.5 | A translation of the given function. (a) y = f(t); (b) y = u-(t) f(t — c). 


The unit step function is particularly important in Laplace transform use because of the 
following relation between the transform of f (ft) and that of its translation u(t) f(t — c). 


Theorem 6.3.1 


If the Laplace transform of f(t), F(s) = L{ f(t)}, exists for s > a > Q, and if c is a positive 
constant, then 


Liu-(t) fa = c) 2e CL{ f(}=eCF(s), sa. (5) 
Conversely, if f(t) is the inverse Laplace transform of F(s), f(t) = £-! (F(s)), then 
u(t) f(t — c) = £7 Me F(s)}. (6) 
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Theorem 6.3.1 simply states that the translation of f(t) a distance c in the positive t 
direction corresponds to the multiplication of F(s) by e ^. To prove Theorem 6.3.1, it is 
sufficient to compute the transform of u.(1t) f(t — c): 


Lu,(t) f(t -o)- ] € fa - od 
0 


= 2 e "' f(t — c)dt. 


Introducing a new integration variable c = t — c, we have 
oo oo 
L{u-(t) f(t — c0) = | gite FCo) do =e f e" f(a) do 
0 0 


=e “F(s). 


Thus equation (5) is established; equation (6) follows by taking the inverse transform of both 
sides of equation (5). 

A simple example of this theorem occurs if we take f(t) = 1. Recalling that 
L{1} = 1/s, we immediately have from equation (5) that L{u.(t)} = e 5 /s. This result 
agrees with that of equation (4). Examples 3 and 4 illustrate further how Theorem 6.3.1 can 
be used in the calculation of Laplace transforms and inverse Laplace transforms. 


EXAMPLE 3 


Given the function f defined by 


sin f, 0st« -—, 


sketch the graph of y = f(t) on the interval 0 < t < 3. Find L{ f(1)}. 


Solution: 


The graph of y = f(t) is shown in Figure 6.3.6. 


1.5 


y =Sin £ + us, (t)cos(t — 4) 


| FIGURE 6.3.6 | Graph of the function in Example 3. 
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Note that f(t) = sint + g(t), where 


T 
4 T 
g(t) = ( z) m =  usya(t) os(: — 3! 
cosjt——], t» — 
4 4 
Thus 
L{ f(t)} = £[sini] Hef nad cos z 37 


= L{sint} + e "l^ P cost}. 


Recalling the Laplace transforms of sin t and cos t, we obtain 


1 pens s 1 + se775/4 
es = 


PU e s2? +1 s?4l 


You should compare this method with the calculation of £( f (t) } directly from the improper integrals 
in the definition of the Laplace transform. 


EXAMPLE 4 


Find the inverse Laplace transform of 
» 1—e-5 
(s) — 2 
Graph y = f(t). 


Solution: 


From the linearity of the inverse transform, we have 


—2s 
FO = £ FG) =e {ah -eift 2 j 
s sS 


ed gw e. 


To facilitate the graphing of y = f(t), it is helpful to write the function in a piecewise 
representation. Here that means 


0, 0.2 t, O<t< 2, 
t = f- NE = 
/"m { = f: t>2. 


2 4 6 t 


FIGURE 6.3.7 


The following theorem contains another very useful property of Laplace transforms that 
is somewhat analogous to that given in Theorem 6.3.1. 
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Problems 


Theorem 6.3.2 


If F(s) = L{ f(1)} exists for s > a > 0, and if c is a constant, then 
Lie" f(t)) 2 F(s-c), s>arte. (7) 
Conversely, if f(t) = £^! (F(s)), then 
e" f(t) = £C (FG - c). (8) 


According to Theorem 6.3.2, multiplication of f(t) by e^ results in a translation of the 
transform F(s) a distance c in the positive s direction, and conversely. To prove this theorem, 
we evaluate L{e“ f (t)}. Thus 


oo 


Lie" f(t)} = | e ‘te f(t)dt = n e 679! f(t)dt 
0 0 


= F(s—c), 


which is equation (7). The restriction s > a + c follows from the observation that, according 
to hypothesis (ii) of Theorem 6.1.2, | f(t)| < Ke“; hence Je“ f(1)| < Ke(**?'. Equation (8) 
is obtained by taking the inverse transform of equation (7), and the proof is complete. 

The principal application of Theorem 6.3.2 is in the evaluation of certain inverse 
transforms, as illustrated by Example 5. 


EXAMPLE 5 


Find the inverse Laplace transform of 


1 
G(s) = —————. 
(s) s? — 4s +5 
Solution: 


First, to avoid dealing with the complex-valued roots of the denominator s? — 4s + 5, we complete 
the square in the denominator: 


G(s) = F(s —2), 


(s—22+1 
where F(s) = (s? + 1)~!. Since £7'{F(s)} = sint, it follows from Theorem 6.3.2 that 


g(t) = £^ (G(s)) = e? sint. 


The results of this section are often useful in solving differential equations, particularly 
those that have discontinuous forcing functions. The next section is devoted to examples 
illustrating this point. 


In each of Problems 1 through 4, sketch the graph of the given function 2. g(t) = f(t —1)u4(t), where f(t) =t? 


on the interval t > 0. 
l. g(t) =uj(t) + 2u3(t) — 6u4(t) 


3. g(t) = f(t —3)u3(t), where f(t) = sint 
4. g(t) = (r — Dut) — Ut — 2)un(t) + (t — 3)u3(t) 


In each of Problems 5 through 8: 
a. Sketch the graph of the given function. 
b. Express f(t) in terms of the unit step function u-(t). 


0, O<t< 3, 
—2, 3<t<5, 
ia gerst 
1] tz]. 
1, Oxrt«l, 
—l 1zxtft«2, 
6. f()n- 1, 2<t< 3, 
—1, 3<t< 4, 
0, t>4. 
; 0<t< 2, 
A fo Ua t2. 
t, 0<t<2, 
2: 2-5, 
Sh HUC ue Bleep 
0, t>7. 


In each of Problems 9 through 12, find the Laplace transform of the 
given function. 


9. me t< 2 
(t—2)2, t>2 
0, tm 

10. f(n—-4t—-m, w<t<2r 
0, t> 2m 


11. f(t) = u(t) + 2u3(t) — 6u4(t) 
12. f(t) = (t —3)ux(t) — (t — 2)us(1) 


In each of Problems 13 through 16, find the inverse Laplace transform 
of the given function. 


13. F 2 
FO = Gao 
14 = 
. F(s) = ——~ 
(s) s?+s5—2 
2s — Des 
15. F(s) = ————_ 
2 s2—2s 42 
—s —2s _ p—3s _ p—4s 
16. FG) = Te e e 


s 
17. Suppose that F(s) = £( f(1)) exists for s > a > 0. 
a. Show that if c is a positive constant, then 


£n o P(E), so ca. 


5 
c 
b. Show that if k is a positive constant, then 
£7 (Fs) = H 
k” \k 
c. Show that ifa and b are constants with a > 0, then 


£^ F(as +b)} = zer p (2). 
a 


a 
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In each of Problems 18 through 20, use the results of Problem 17 to 
find the inverse Laplace transform of the given function. 


2n*ly! 
18. F(s) = — 
2. 1 
19. F(s) = -Stt 
4s? +4s +5 
1 
20. F(s) = ————— 
(5) 7 a 12:3 


In each of Problems 21 through 23, find the Laplace transform of the 
given function. In Problem 23, assume that term-by-term integration 
of the infinite series is permissible. 


1, O<t<l 
21. 5o - 45 ped 
> Oszrt«l 
b l<ł<2 
; A f«3 


1 
0 
22. f)- 4, 
0 t>3 


23. f(t) 21-3 (7 D'u().. See Figure 6.3.8. 


k=1 


The function f(t) in Problem 23; 


a square wave. 


24. Let f satisfy f(t + T) = f(t) for all t > 0 and for some 
fixed positive number T; f is said to be periodic with period T on 
0 « t < oo. Show that 

T 


e f(t)dt 
LFO = ETT E 


In each of Problems 25 through 28, use the result of Problem 24 to 
find the Laplace transform of the given function. 


25. ro - o 0Oxt«l f(t+2)= f(t). 
0 1<t<2; 
Compare with Problem 23. 
26. ro={ 1 O<t<1, f(t-2)-2 f(0. 
-l, 1<t<2; 


See Figure 6.3.9. 


=] 


| FIGURE 6.3.9 | The function f(t) in Problem 26; a 


Square wave. 
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27. f()—t OK<t< 1; f(t+1)= f(t). 29. a. If f(t) = 1 — u(t), find L{ f(t)}. Sketch the graph of 
See Figure 6.3.10. y= f(t). Compare with Problem 21. 


b. Let g(t) = | f(£) d£, where the function f is defined in 
0 

part a. Sketch the graph of y = g(t) and find L{g(t)}. Use your 
expression for £{g(t)} to find an explicit formula for g(t). 
Hint: See Problem 28 in Section 6.2. 

C. Let h(t) = g(t) — ui(t)g(t — 1), where g is defined in 
part b. Sketch the graph of y = h(t) and find £{h(t) }. Use your 
expression for £{h(t)} to find an explicit formula for h(t). 


The function f(t) in 30. Consider the function p defined by 
Problem 27; a sawtooth wave. t 0<t<1 

n=. i : t2) = p(t). 

p(t) EM peus p(t +2) = p(t) 
28. f(t)=sint, Oxt«m; f(t+7) = f(t). 
See Figure 6.3.11. a. Sketch the graph of y = p(t). 
b. Find Z(p(1)) by noting that p is the periodic extension of the 
y function / in Problem 29c; then use the result of Problem 24. 


c. Find £{p(t)} by noting that 


P(t) =) fdt, 
0 


where f is the function in Problem 26; then use Theorem 6.2.1. 


~y 


T 2m 3m 


The function f(t) in 
Problem 28; a rectified sine wave. 
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Discontinuous Forcing Functions 


In this section we turn our attention to some examples in which the nonhomogeneous term, or 
forcing function, is discontinuous. 


EXAMPLE 1 


Find the solution of the differential equation 


2y" + y'  2y = g(t), (1) 
where 
(t) (t) (t) M YETSA, (2) 
= —k = 
os 20 0, O's or t> 20. 


Assume that the initial conditions are 


y(0) —0, y0) =0. (3) 
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This problem governs the charge on the capacitor in a simple electric circuit with a unit voltage pulse 
for 5 < t < 20. Alternatively, y may represent the response of a damped oscillator subject to the 
applied force g(t). 

Solution: 


The Laplace transform of equation (1) is 


2s7¥(s) — 2sy(0) — 2y(0) + sY(s) — y(0) + 2Y(s) 


L{us(t)} — L{uro(t) } 


1 —5s —20s 
= (ee), 
Introducing the initial values (3) and solving for Y (s), we obtain 
—5s _ e 20s 
Y(s) = —————_.. 4 
(s) s(2s2 +5 +2) e 
To find y(t), it is convenient to write Y(s) as 
Y(s) «(e 7* =e ™) H(s), (5) 
where 
1 
H(s) = (6) 


s(252 +s +2) 
Then, if h(t) = £-! (H(s)), we have 
y(t) = us(t)h(t — 5) — uz(t)h(t — 20). (7) 


Observe that we have used Theorem 6.3.1 to write the inverse transforms of e^? H(s) and 
e 5 H(s), respectively. Finally, to determine h(t), we use the partial fraction expansion of H(s): 


Hog cem (8) 
s) = — + —————. 
s 252-5 4-2 
"m oF . 1 1 
Upon determining the coefficients, we find that a = 7 b = —1, and c = — 5 Thus 
1 à 1 " 1 
= s+- = 
1 sta MEE. 4) 4 
H(s) = =s =ý 
2 252 +s+ 2 2 2 1 2 15 
(: uy ET: 
1 4/15 
1 1 ae 1 4 
=S (9) 


Led) EA 


Then, by referring to lines 9 and 10 of Table 6.2.1, we obtain 


h(t) = E (e cos (45 4 lets in( ) (10) 
à 2 4 vA5 4 


In Figure 6.4.1 the graph of y(t) from equations (7) and (10) shows that the solution consists 
of three distinct parts. For 0 < t < 5, the differential equation is 


2y"+y'+2y — 0, (11) 


and the initial conditions are given by equation (3). Since the initial conditions impart no energy 
to the system, and since there is no external forcing, the system remains at rest; that is, y = 0 for 
0 < t < 5. This can be confirmed by solving equation (11) subject to the initial conditions (3). In 
particular, evaluating the solution and its derivative at t = 5, or, more precisely, as t approaches 5 
from below, we have 


x5)20, y(5)-0. (12) 
Once t > 5, the differential equation becomes 


2y"+y'+2y=1, (13) 
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whose solution is the sum of a constant (the response to the constant forcing function) and a damped 
oscillation (the solution of the corresponding homogeneous equation). The plot in Figure 6.4.1 shows 
this behavior clearly for the interval 5 < t < 20. An expression for this portion of the solution can 
be found by solving the differential equation (13) subject to the initial conditions (12). Alternatively, 
since u5(t) = 1 and u(t) = 0 for 5 < t < 20, equations (7) and (10) reduce to 


y(t) = h(t —5) 


1 1 -(1—5)/4 V 15(t — 5) 1 —(1-5)/4 s A/ 15(t — 5) 
=-— -e cos + e sin | ——— —— |. (14) 
4 24/15 4 


2 2 


Finally, for t > 20 the differential equation becomes equation (11) again, and the initial 
conditions are obtained by evaluating the solution of equations (13), (12), that is, equation (14) and 
its derivative, at t = 20. These values are, approximately, 


y(20) = 0.50162, y'(20) = 0.01125. (15) 


The initial value problem (7), (10) contains no external forcing, so its solution is a damped oscillation 
about y = 0, as can be seen in Figure 6.4.1. 


Solution of the initial value problem (1), (2), (3): 
2y" + y! +2y = us(t) — u(t), y(0) —0, y'(0) = 0. 


Although it may be helpful to visualize the solution shown in Figure 6.4.1 as composed 
of solutions of three separate initial value problems in three separate intervals, it is somewhat 
tedious to find the solution by solving these separate problems. Laplace transform methods 
provide a much more convenient and elegant approach to the problem in Example 1 and to 
others that have discontinuous forcing functions. 


The effect of the discontinuity in the forcing function can be seen if we examine the 
solution y(t) of Example 1 more closely. According to the existence and uniqueness theorem 
(Theorem 3.2.1), the solution y(t) and its first two derivatives are continuous except possibly 
at the points £ = 5 and t = 20, where g is discontinuous. This can also be seen at once from 
equation (7). One can also show by direct computation from equation (7) that y(t) and y'(1) 
are continuous even at t = 5 and t = 20. However, if we calculate y"(1), we find that 


: 7 : n 1 
lim y (f) 20, lim y (t) — —. 


t>5- t—5* 2 
1 
Consequently, y(t) has a jump of 5 at t = 5. Ina similar way, we can show that y"(1) has a 


1 
jump of — 2 att = 20. Thus the jump in the forcing term g(t) at these points is balanced by 


a corresponding jump in the highest order term 2y" on the left-hand side of the equation. 
Consider now the general second-order linear equation 


y" + POY + q(t)y = a(t), (16) 


where p and q are continuous on some interval a < t < (3, but g is only piecewise continuous 
there. If y(t) is a solution of equation (16), then y(t) and y'(t) are continuous ona < t < f, 
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but y(t) has jump discontinuities at the same points as g. Similar remarks apply to higher- 
order equations; the highest derivative of the solution appearing in the differential equation 
has jump discontinuities at the same points as the jump discontinuities in the forcing function, 
but the solution itself and its lower derivatives are continuous even at those points. 


EXAMPLE 2 


Describe the qualitative nature of the solution of the initial value problem 


y" +4y = g(t), (17) 
y(0) 20, y'(0)-0, (18) 
where 
0, 0<t<5, 
g(t) = s EE T (19) 
1, t > 10, 


and then find the solution. 


Solution: 


In this example the forcing function has the graph shown in Figure 6.4.2 and is known as ramp 
loading. It is relatively easy to identify the general form of the solution. For t < 5, the solution is 
simply y = 0. On the other hand, for t > 10, the solution has the form 


y = c, cos(2t) + co sin(2t) + B (20) 
The constant 1/4 is a particular solution of the nonhomogeneous equation, while the other two terms 
are the general solution of the corresponding homogeneous equation. Thus the solution (20) is a 
simple harmonic oscillation about y = 1/4. Similarly, in the intermediate range 5 < t « 10, the 
solution is an oscillation about a certain linear function. In an engineering context, for example, we 
might be interested in knowing the amplitude of the eventual steady oscillation. 


5 10 15 20 t 


Ramp loading; y = g(t) 
from equation (19) or equation (21). 


To solve the problem, it is convenient to write 
1 
a(t) = z (ust (t — 5) — wolt) — 10)), (21) 


as you may verify. Then we take the Laplace transform of the differential equation and use the initial 
conditions, thereby obtaining 


CET ORE 
or 
Y(s) = se —e 15) H(s), (22) 


where 


1 


HO) = saya" 


(23) 
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Thus the solution of the initial value problem (17), (18), (19) is 
1 
Y(t) = z (us(NACt —5) — wio( ACE — 10), (24) 


where A(t) is the inverse transform of H(s). 
The partial fraction expansion of H(s) is 


HG) 1/4 1/4 Q5) 
s) = — — ; 
seg? 44 
and it then follows from lines 3 and 5 of Table 6.2.1 that 
h(t) l, la (2t) (26) 
= -t — -sin : 
4 8° 


The graph of y(t) is shown in Figure 6.4.3. Observe that it has the qualitative form that we 
indicated earlier. To find the amplitude of the eventual steady oscillation, it is sufficient to locate one 
of the maximum or minimum points for t > 10. Setting the derivative of the solution (24) equal to 
Zero, we find that the first maximum is located approximately at ( 10.642, 0.2979), so the amplitude 
of the oscillation is approximately 0.2979 — 0.25 — 0.0479. 


| FIGURE 6.4.3 | Solution of the initial value problem (12), (13), (14). 


Note that in this example, the forcing function g is continuous, but g’ is discontinuous at t = 5 
and t = 10. It follows that the solution y(t) and its first two derivatives are continuous everywhere, 


"n 


but y 
points. 


(t) has discontinuities att = 5 and at t = 10 that match the discontinuities in g’ at those 


Problems 


In each of Problems 1 through 8: 
a. Sketch the graph of the forcing function on an appropriate 
interval. 
b. Find the solution of the given initial value problem. 
Q c. Plot the graph of the solution. 
d. Explain how the graphs of the forcing function and the 
solution are related. 


l. y"+y=f(t); y(0) =0, y'(0) =l; 
1, 0xt« 30 
5o -1 


0, 3% «t « oo 


2. y'r2y' -2y-h(t); 
1l m-xt«2m 
h(t) = 
0 O<t<a or t>27 


y(0) =0, y’(0) =1; 


3. y”+4y = sint — up, (t) sin(t — 27); 
4. y'43y +2y = f(t); 


y(0) 20, y'(0) =0 


y(0) =0, y'(0) =0; 
1, O<t< 10 


f) 
i t > 10 
5 T 
5, yere ciis (1-7): y(0) 20, y'(0) 20 
5 
6. Y +y +7580; 
sint, O<t<7 
g(t) = 


0, tn 


y(0) 20, y'(0) =0; 


Ts y" +4y = us (t) — uas E); y(0) = 0, y’(0) =0 


8. y? + 5y"+4y 2 1— ust); 
y"(0) =0, y"(0) =0 

9. Find an expression involving u(t) for a function f that ramps 
up from zero at t = tọ to the value h at t = tọ + k. 


y(0) 20, y'(0) — 0, 


10. Find an expression involving uc(t) for a function g that ramps 
up from zero at t = tọ to the value / at t = tọ +k and then ramps back 
down to zero at t = tg + 2k. 


11. Acertain spring-mass system satisfies the initial value problem 


u” + E +u =kg(t), u(0) —0, u'(0) = 0, 

where g(t) = u3/2(t) — us;5(1) and k > 0 is a parameter. 
a. Sketch the graph of g(t). Observe that it is a pulse of unit 
magnitude extending over one time unit. 
b. Solve the initial value problem. 
(G) C. Plot the solution for k = 1/2, k = 1, and k = 2. Describe 
the principal features of the solution and how they depend on k. 
(9 d. Find, to two decimal places, the smallest value of k for 
which the solution u(t) reaches the value 2. 
Q e. Suppose k = 2. Find the time 7 after which |u(1)| < 0.1 
forallt > 7. 


12. Modify the problem in Example 2 of this section by replacing 
the given forcing function g(t) by 


1 
FO = = (ust 5 — usal -5-0)/k. 


a. Sketch the graph of f(t) and describe how it depends on k. 
For what value of k is f(t) identical to g(t) in the example? 
b. Solve the initial value problem 


y”+4y= f(t), y(0 20, y'(0) — 0. 


Q c. The solution in part b depends on k, but for sufficiently 
large t, the solution is always a simple harmonic oscillation about 
y — 1/4. Try to decide how the amplitude of this eventual 
oscillation depends on k. Then confirm your conclusion by 
plotting the solution for a few different values of k. 


Resonance and Beats. In Section 3.8 we observed that an undamped 
harmonic oscillator (such as a spring-mass system) with a sinusoidal 
forcing term experiences resonance if the frequency of the forcing term 
is the same as the natural frequency. If the forcing frequency is slightly 
different from the natural frequency, then the system exhibits a beat. In 
Problems 13 through 17 we explore the effect of some nonsinusoidal 
periodic forcing functions. 


13. Consider the initial value problem 
y"+y=f(t), x0 =0, y(0) =0, 


where 


n 
F(t) = ut) € 25 Dus). 
k=l 
a. Draw the graph of f(t) on an interval such as 0 < t < 67. 
b. Find the solution of the initial value problem. 
Q c. Letn = 15. Plot the graph of the solution for 0 < t < 60. 
Describe the solution and explain why it behaves as it does. 
d. Investigate how the solution changes as n increases. What 
happens as n — oo? 


14. Consider the initial value problem 
y'-0.1y -y-f(), y0) =0, y(0) =0, 


where f (t) is the same as in Problem 13. 
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Q a. Plot the graph of the solution. Use a large enough value 
of n and a long enough t-interval so that the transient part of 
the solution has become negligible and the steady state is clearly 
shown. 

b. Estimate the amplitude and frequency of the steady-state part 
of the solution. 

C. Compare the results of part b with those from Section 3.8 for 
a sinusoidally forced oscillator. 


15. Consider the initial value problem 
y'*y-g(), y(0) =0, y(0) =0, 


where 


g(t) = ug) + 3 CD Ius). 


k=1 


a. Draw the graph of g(t) on an interval such as 0 € t < 67. 
Compare the graph with that of f(t) in Problem 13a. 

b. Find the solution of the initial value problem. 

Q c. Let n = 15. Plot the graph of the solution for 0 < t < 60. 
Describe the solution and explain why it behaves as it does. 
Compare it with the solution of Problem 13. 

d. Investigate how the solution changes as n increases. What 
happens as n — oo? 


16. Consider the initial value problem 


y” +0.1y +y=g8(t), y(0)=0, y(0) —0, 


where g(t) is the same as in Problem 15. 
Q a. Plot the graph of the solution. Use a large enough value 
of n and a long enough t-interval so that the transient part of 
the solution has become negligible and the steady state is clearly 
shown. 
@ b. Estimate the amplitude and frequency of the steady-state 
part of the solution. 
C. Compare the results of part b with those from Problem 15 and 
from Section 3.8 for a sinusoidally forced oscillator. 


17. Consider the initial value problem 
y’+y=hA(t), y(0 20, y'(0) — 0, 


where 


n 
h(t) = uo(t) 23 CD ua). 
k=l 

Observe that this problem is identical to Problem 15, except that the 
frequency of the forcing term has been increased somewhat. 

a. Find the solution of this initial value problem. 

Q b. Let n > 33 and plot the solution for 0 < t < 90 or longer. 

Your plot should show a clearly recognizable beat. 

@ c. From the graph in part b, estimate the “slow period" and 

the “fast period" for this oscillator. 

d. For a sinusoidally forced oscillator, it was shown in Section 


1 
3.8 that the “slow frequency” is given by zie — wo|, where 
wọ is the natural frequency of the system and w is the forcing 
1 
frequency. Similarly, the “fast frequency” is ove + wo). Use 


these expressions to calculate the “fast period” and the “slow 
period” for the oscillator in this problem. How well do the results 
compare with your estimates from part c? 
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65 Impulse Functions 


In some applications it is necessary to deal with phenomena of an impulsive nature—for 
example, voltages or forces of large magnitude that act over very short time intervals. Such 
problems often lead to differential equations of the form 


ay" + by' + cy = g(t), (1) 


where g(t) is large during a short interval t — 7 < t < tọ +T for some 7 > 0, and is 
otherwise zero. 
The integral 7(7), defined by 


to+T 
i) = f «od, Q) 
to—T 
or, since g(t) = 0 outside of the interval (to — T, to + T), by 
oo 
I(T) x g(t)dt, (3) 
—00 


is a measure of the strength of the forcing function. In a mechanical system, where 
g(t) is a force, I(T) is the total impulse of the force g(t) over the time interval 
(to — T, to + 7). Similarly, if y is the current in an electric circuit and g(t) is the time 
derivative of the voltage, then / (7) represents the total voltage impressed on the circuit during 
the interval (to — T, to +7). 

In particular, let us suppose that tọ is zero and that g(t) is given by 


=F <t-< T; 


g(t) =d,(t) = 4 27’ (4) 


0, t<—T or t=>T, 


where 7 is a small positive constant (see Figure 6.5.1). According to equation (2) or (3), 
it follows immediately that in this case, (7) = 1 independent of the value of 7, as long as 
T #0. Now let us idealize the forcing function d; by prescribing it to act over shorter and 
shorter time intervals; that is, we consider the functions d, (f) as r — O* (see Figure 6.5.2). 
As a result of this limiting operation, we obtain 

lim d;(f) 20, tz0. (5) 


Tt 


t -8 4 -2-101 2 4 8 t 
GY Graph of y = d; (t). A Graphs of y = d, (1) asr > 0t. 


Further, since /(7) = 1 foreach 7 Z 0, it follows that 


lim 7(7) = 1. (6) 


T—0* 


Equations (5) and (6) are used to define an idealized unit impulse function 6, which 
imparts an impulse of magnitude one at t = 0 but is zero for all values of t other than zero. 
That is, the “function” 6 is defined to have the properties 


ó(t) 20, +t 40; (7) 


J ó(t)dt=1. (8) 


There is no ordinary function of the kind studied in elementary calculus that satisfies both 
equations (7) and (8). The “function” 6, defined by those equations, is an example of what are 
known as generalized functions; it is usually called the Dirac* delta function. Since 6(1) 
corresponds to a unit impulse at £ = 0, a unit impulse at an arbitrary point t = tọ is given by 
6(t — to). From equations (7) and (8), it follows that 


6(t—t%) =0, t to; (9) 


oo 
l ó(t — todt = 1. (10) 
—Ooo0 

The Dirac delta function does not satisfy the conditions of Theorem 6.1.2, but its Laplace 
transform can nevertheless be formally defined. Since (+) is defined as the limit of d; (1) as 
T — O*,it is natural to define the Laplace transform of 6 as a similar limit of the transform 
of d,. In particular, we will assume that ty > O0 and will define Z(6(t — tọ) } by the equation 


L{6(t — to)} = lim £(d;(t — to)]. (11) 


T—0* 


To evaluate the limit in equation (11), we first observe that if 7 < tọ, which must eventually 
be the case as T — OT, then tg — 7 > 0. Since d, (t — ty) is nonzero only in the interval from 
to — T to fo + T, we have 


L{d,(t — to)} = [ e“'d,(t — to) dt 
0 


to+T 
-/ e "d.(t — t)dt. 
to 


Substituting for d, (t — t9) from equation (4), we obtain 


1 totr 1 t=to+T 
L{d,(t —19)) = e "dt = — —e 


2T Jy—r 2sT — 


= 1 e "he( eT = e) 
2sT 


or 


sinh( s7 ) gs. 


L{d,(t — t)} = (12) 


ST 
The quotient sinh(s7) /(sT) is indeterminate as r — O*, but its limit can be evaluated by 


l'Hópital's? rule. We obtain 
sinh( s7 ) . scosh(sT) 
m —— — = lim ————— =1 


T—0* ST T—0* 5 


^Paul A. M. Dirac (1902-1984), English mathematical physicist, received his Ph.D. from Cambridge in 1926 and 
was professor of mathematics there until 1969. He was awarded the Nobel Prize for Physics in 1933 (with Erwin 
Schródinger) for fundamental work in quantum mechanics. His most celebrated result was the relativistic equation 
for the electron, published in 1928. From this equation he predicted the existence of an "anti-electron," or positron, 
which was first observed in 1932. Following his retirement from Cambridge, Dirac moved to the United States and 
held a research professorship at Florida State University. 

>Marquis Guillaume de I’ Hôpital (1661-1704) was a French nobleman with deep interest in mathematics. For a time 
he employed Johann Bernoulli as his private tutor in calculus. L'Hópital published the first textbook on differential 
calculus in 1696; in it appears the property of limits that is named for him. 
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Then from equation (11) it follows that 
L{6(t — tg)) = e^. (13) 

Equation (13) defines Z(ó(t — t))} for any t; > 0. We extend this result, to allow fo to be 
zero, by letting ty — O* on the right-hand side of equation (13); thus 

L{6(t)} = lim e? 5 I. (14) 

to OF 

It is reassuring to see that the Laplace transform formulas derived in equations (13) and (14) 
are consistent with the Laplace transform of a horizontally shifted function: 


L{6(t — to) } = e^*^ £{6(t)} = ge. 


In a similar way, it is possible to define the integral of the product of the delta function 


and any continuous function f. We have 
[o] 


I. 6(t —t) f(t)dt = lim d,(t — to) f(t)dt. (15) 


T—0* J—co 


Using the definition (4) of d; (t) and the mean value theorem for integrals, we find that 


oo 


oo tod T 
] amne c f fai 
- 2T 


1 
= yr fü =f), 
T 


where fj — T < t* < to +7. Hence t* — fg as tT — O°, and it follows from equation (15) 
that 


J 6(t — to) f(t)dt = f (to). (16) 


oo 


The following example illustrates the use of the delta function in solving an initial value 
problem with an impulsive forcing function. 


EXAMPLE 1 


Find the solution of the initial value problem 


2y" + y' +2y = 6(6— 5), (17) 


y(0) =0, y0) = 0. (18) 


Solution: 


This initial value problem arises from the study of the same electric circuit or mechanical oscillator 
as in Example 1 of Section 6.4. The only difference is in the forcing term. 

To solve the given problem, we first take the Laplace transform of the differential equation and 
use the initial conditions, obtaining 


(232 +s 4-2)Y(s) =e. 
Thus 
—5s —5s 1 


Y(s) = = 
(s) 2s2 +s +2 2 ( E 
s 


(19) 


By Theorem 6.3.2, or from line 9 of Table 6.2.1, 


1 4 4/15 
Lo! = e ^ sin (45 (20) 
5 


Hence, by Theorem 6.3.1, we have 


yt) = LY (s)} = 


2 4/15 
us(t)e 0-79 /^ gin | “(4 — 5) (21) 
15 4 
which is the formal solution of the given problem. It is also possible to write y(t) in the form 


0, t<5, 


= 2 4/15 22 
T Ao vs), t>5. nd 
5 


Ni (4 


The graph of equation (22) is shown in Figure 6.5.3. Since the initial conditions at £ = 0 are 
homogeneous and there is no external excitation until t = 5, there is no response in the interval 
0 < t < 5. The impulse at f = 5 produces a decaying oscillation that persists indefinitely. The 
response is continuous at f = 5 despite the singularity in the forcing function at that point. However, 
the first derivative of the solution has a jump discontinuity at £ = 5, and the second derivative has 
an infinite discontinuity there. This is required by the differential equation (17), since a singularity 
on one side of the equation must be balanced by a corresponding singularity on the other side. 


Solution of the initial value problem (17), (18): 
2y"+ y'+2y = ó(t — 5), y(0) 20, y'(0) 20. 


In dealing with problems that involve impulsive forcing, the use of the delta function 
usually simplifies the mathematical calculations, often quite significantly. However, if the 
actual excitation extends over a short, but nonzero, time interval, then an error will be 
introduced by modeling the excitation as taking place instantaneously. This error may be 
negligible, but in a practical problem it should not be dismissed without consideration. In 
Problem 12 you are asked to investigate this issue for a simple harmonic oscillator. 


Problems 


In each of Problems 1 through 8: 
a. Find the solution of the given initial value problem. 
Q b. Plot a graph of the solution. 


1. y"+2y'+2y=6(t-7); y(0-1 y(0) =0 
2. y"+4y =d(t-—7) —6(t—27m); (0) =0, y(0 =0 

” / : / : y? —y 2é(r- 1); 
3. y”+3y' +2y =ô(t—5)+u10(t); (0) —0, y'(0) = 1/2 y'(0) =0, y"(0) =0 
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y’+2y'+3y =sint+6(t—37); y(0) =0, y'(0) =0 
. y’+y=6(t-27) cost; y(0) =0, y(0) =1 

y” +4y =26(t—7/4); y0) = 0, y(0 =0 

. y’+2y'+2y=cost+6(t—7/2); y(0) =0, y(0 =0 


y(0) =0, y'(0) — 0, 
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9. Consider again the system in Example 1 of this section, in which 
an oscillation is excited by a unit impulse at  — 5. Suppose that it is 
desired to bring the system to rest again after exactly one cycle— that 
is, when the response first returns to equilibrium moving in the positive 
direction. 

Q a. Determine the impulse k5(t — to) that should be applied 
to the system in order to accomplish this objective. Note that 
k is the magnitude of the impulse and tọ is the time of its 
application. 

Q b. Solve the resulting initial value problem, and plot its 
solution to confirm that it behaves in the specified manner. 


@ 10. Consider the initial value problem 


y"+yy +y=6(t-1), y(0) =0, y'(0) =0, 


where ^ is the damping coefficient (or resistance). 
Qa. Let y = L Find the solution of the initial value problem 
and plot its graph. 
b. Find the time f, at which the solution attains its maximum 
value. Also find the maximum value y, of the solution. 


Q c. Let, = i and repeat parts a and b. 


d. Determine how f, and y, vary as y decreases. What are the 
values of t; and y; when y = 0? 


11. Consider the initial value problem 
y” +y ty =ko(t—1), y(0 —0, y(0) — 0, 
where k is the magnitude of an impulse at t = 1, and y is the damping 
coefficient (or resistance). 
Qa. Lety = : Find the value of k for which the response has 
a peak value of 2; call this value k,. 
Q b. Repeat part (a) for y = 2 


€. Determine how k; varies as y decreases. What is the value of 


kı when y = 0? 
12. Consider the initial value problem 
y’+y= fet), y(0) =0, y(0)— 0, 


1 . 
where f;(t) = gx (usw —ug4x(t)) withO<k < 1. 


a. Find the solution y = ó(t, k) of the initial value problem. 


b. Calculate lim ¢(t, k) from the solution found in part a. 
kot 


C. Observe that lim f(t) = ó(t —4). Find the solution @0(t) 
k>0t 
of the given initial value problem with f(t) replaced by 6(t—4). 
Is it true that do(t) = lim ¢(t, k)? 
k—>0t 


1 1 
O d. Plot $t, z) Ó |t, il and ó(f) on the same axes. 


Describe the relation between ó (t, k) and oo(t). 


Problems 13 through 16 deal with the effect of a sequence of impulses 
on an undamped oscillator. Suppose that 
y't+y=f(t), y(0)=0, y'(0)—0. 
For each of the following choices for f(t): 
a. Try to predict the nature of the solution without solving the 
problem. 
@ b. Test your prediction by finding the solution and plotting 
its graph. 
c. Determine what happens after the sequence of impulses ends. 
20 


13. f(t) =F 6(t — km) 
k= 


14. f(t) 2 3 (7D — kr) 


k= 


15. f(t) = SS 6(t — Qk — 1)7) 


k= 


40 11 
16. f()-9»«-D*F!6 (: = 2) 


k= 


17. The position of a certain lightly damped oscillator satisfies the 
initial value problem 


20 
y” +0.1y +y = $ (-D'60 -kr), y(0) =0, y(0) =0. 
k=1 


Observe that, except for the damping term, this problem is the same 
as Problem 14. 
a. Try to predict the nature of the solution without solving the 
problem. 
@ b. Test your prediction by finding the solution and drawing 
its graph. 
c. Determine what happens after the sequence of impulses ends. 


@ 18. Proceed as in Problem 17 for the oscillator satisfying 
15 

y'-0.1y ye M 6G -Qk- 18), (0) 20, y(0) =0. 
k=1 


Observe that, except for the damping term, this problem is the same 
as Problem 15. 


19. a. By the method of variation of parameters, show that the 
solution of the initial value problem 


y" +2y’+2y 2 f(t); y(0)20, y'(0) 20 


y- ih eT) F(T) sin(t — 7) dr. 
0 


b. Show that if f(t) = ó(t — 7), then the solution of part a 
reduces to 


y = ur (the! sin(t — x). 


c. Use a Laplace transform to solve the given initial value 
problem with f(t) = ó(t — 7), and confirm that the solution 
agrees with the result of part b. 


66 The Convolution Integral 


Sometimes it is possible to identify a Laplace transform H(s) as the product of two other 
Laplace transforms F(s) and G(s), the latter transforms corresponding to known functions f 
and g, respectively. In this event, we might anticipate that H(s) would be the transform of the 
product of f and g. However, this is not the case; in other words, the Laplace transform cannot 
be commuted with ordinary multiplication. On the other hand, if an appropriately defined 
“generalized product" is introduced, then the situation changes, as stated in the following 
theorem. 


Theorem 6.6.1 | Convolution Theorem 


If F(s) = L{ f(t)} and G(s) = L{g(t)} both exist for s > a > 0, then 
H(s) = F(s)G(s) - £C(h(t), sa, (1) 


where 
t t 
m= f fa- nacer o | Aee- nan. Q) 
0 0 


The function h is known as the convolution of f and g; the integrals in equation (2) are called 
convolution integrals. 


The equality of the two integrals in equation (2) follows by making the change of variable 
t— T = € in the first integral. Before giving the proof of this theorem, let us make some 
observations about the convolution integral. According to this theorem, the transform of the 
convolution of two functions, rather than the transform of their ordinary product, is given by 
the product of the separate transforms. It is conventional to emphasize that the convolution 
integral can be thought of as a “generalized product" by writing 


h(t) — Cf *g)(t). (3) 
In particular, the notation (f*g)(t) serves to indicate the first integral appearing 
in equation (2); the second integral in equation (2) is denoted as (g  f)(1). 


The convolution f * g has many of the properties of ordinary multiplication. For example, 
it is relatively simple to show that 


f*gogsf (commutative law) (4) 
f*(gic- go) = f*gı + f*g (distributive law) (5) 
(f*g)xh-f*(gxh) (associative law) (6) 
fx0=0x f =0. (zero property) (7) 


In equation (7) the zeros denote not the number 0 but the function that has the value 0 for each 
value of t. The proofs of these properties are left to you as exercises. 

However, there are other properties of ordinary multiplication that the convolution 
integral does not have. For example, it is not true in general that f x 1 is equal to f. To see 
this, note that 


(roo = | fa- nado [ fa nar. 
0 0 


If, for example, f(t) = cost, then 


t T=t 
Cf *DCO a cos(t — T) dr = —sin(t — T) 
0 7=0 
= —sin0+sint 
= sint. 


Clearly, ( f *1)(t) # f(t) inthis case. Similarly, it may not be true that f x f is nonnegative. 
See Problem 3 for an example. 
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Convolution integrals arise in various applications in which the behavior of the system at 
time f depends not only on its state at time f but also on its past history. Systems of this kind 
are sometimes called hereditary systems and occur in such diverse fields as neutron transport, 
viscoelasticity, and population dynamics, among others. 

Turning now to the proof of Theorem 6.6.1, we note first that if 


hes n e f(é) dé and G(s) = | a" oar, 
0 0 
then 
F(s)G(s) = n e^* f(£) dé i Par cart (8) 
0 0 


Since the integrand of the first integral does not depend on the integration variable of the 
second, we can write F(s)G(s) as an iterated integral 


F(s)G(s) = n ar) | e (Co de ar 


= [a g(T) (| g ev) FCE) a) dr. (9) 
0 0 


The latter integral can be put into a more convenient form by introducing a change of variable. 
Let € = t — 7, for fixed 7, so that d€ = dt. Further, £ = 0 corresponds to t = 7, and £ = co 
corresponds to t = co; then the integral with respect to € in equation (9) is transformed into 
one with respect to t: 


F(s)G(s) = [ «o ([ ere - na) dr. (10) 
0 T 


The iterated integral on the right-hand side of equation (10) is carried out over the shaded 
triangular region extending to infinity in the tr -plane shown in Figure 6.6.1. Assuming that 
the order of integration can be reversed, we rewrite equation (10) so that the integration with 
respect to 7 is executed first. In this way we obtain 


F(s)G(s) x em fe — Dac ar Jar (11) 
0 0 
Or 
F(s)G(s) — l e "h(t)dt = L{h(t)}, (12) 
0 


where A(t) is defined by equation (2). This completes the proof of Theorem 6.6.1. 


T-0 


| FIGURE 6.6.1 | Region of integration in F(s) G(s). 


EXAMPLE 1 


Find the inverse Laplace transform of 


(13) 


Solution: 


It is convenient to think of H(s) as the product of 5^? and a/(s? +a’), which, according to lines 3 
and 5 of Table 6.2.1, are the transforms of t and sin(at), respectively. Hence, by Theorem 6.6.1, the 
inverse Laplace transform of H(s) is 

— sin(at) 


t 
h(t) — | (t—T)sin(ar) dr = = (14) 
0 


a2 


You can verify that the same result is obtained if A(t) is written in the alternative form 


r= f T sin(a(t—T))dT, 
0 


which confirms equation (2) in this case. Of course, h(t) can also be found by expanding H(s) in 
partial fractions. 


EXAMPLE 2 


Find the solution of the initial value problem 


y” +4y = g(t), (15) 


y(0) =3, y(0) =-1. (16) 


Solution: 


By taking the Laplace transform of the differential equation and using the initial conditions, we obtain 
s?Y(s) — 3s +14+4Y(s) = G(s) 


or 


3s— 1 G(s) 


Ya) = ; 
(s) s?4+4 s4 


(17) 


Observe that the first and second terms on the right-hand side of equation (17) contain the dependence 
of Y(s) on the initial conditions and forcing function, respectively. It is convenient to write Y(s) in 
the form 


Pes 25 * aig (18) 
8). — s Js 
s?+4 2544-4 25244 


Then, using lines 5 and 6 of Table 6.2.1 and Theorem 6.6.1, we obtain 
1. tf. 
y = 3cos(2t) — 2 sin(2t) + 2 sin(2(t — 7))g(7T) dT. (19) 
0 


If a specific forcing function g is given, then the integral in equation (19) can be evaluated (by 
numerical means, if necessary). 


Example 2 illustrates the power of the convolution integral as a tool for writing the solution 
of an initial value problem in terms of an integral. In fact, it is possible to proceed in much 
the same way in more general problems. Consider the problem consisting of the differential 
equation 


ay" + by + cy = g(t), (20) 
where a, b, and c are real constants and g is a given function, together with the initial conditions 
y(0) = ye  »y(0) = yo. (21) 


The Laplace transform approach yields some important insights concerning the structure of the 
solution of any problem of this type. 
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The initial value problem (15), (16) is often referred to as an input-output problem. The 
coefficients a, b, and c describe the properties of some physical system, and g(t) is the input 
to the system. The values yo and y, describe the initial state, and the solution y is the output 
at time f. 

Taking the Laplace transform of equation (20) and using initial conditions (21), we obtain 


(as? + bs + c) Y(s) — (as + b) yo — ayy = G(s). 


If we let 
(as + b) yo + ay; s 
(s) E and W(s) LIT Q2) 
then we can write 
Y(s) = @(s) + V(s). (23) 
Consequently, 
y(t) = (1) + V(t), (24) 


where $(t) = £-!($(s)) and w(t) = £-! (V (s)). Observe that ¢ (t) is the solution of the 
initial value problem 


ay" +by'+cy=0, y(0) = yo, y'(0) = y (25) 


obtained from equations (20) and (21) by setting g(t) equal to zero. Similarly, w(t) is the 
solution of 


ay" +by'+cy = g(t), y(0)=0, y(0) 20, (26) 


in which the initial values yo and y, are each replaced by zero. 

Once specific values of a, b, and c are given, we can use Table 6.2.1 to find 
p(t) = L~!{®(s)}, possibly in conjunction with a translation or a partial fraction expansion. 
To find (t) = £-! (W(s)), it is convenient to write V (s) as 


V(s) = H(s)G(s), (27) 


where H(s) = (as? + bs + c)-!. The function H is known as the transfer function? and 
depends only on the properties of the system under consideration; that is, H(s) is determined 
entirely by the coefficients a, b, and c. On the other hand, G(s) depends only on the external 
excitation g(t) that is applied to the system. By the Convolution Theorem (Theorem 6.6.1) 
we can write 


V) = £7 (HG)6G)] x h(t — T)g(T) dr, Q8) 
0 


where h(t) = £-! (H(s)), and g(t) is the given forcing function. 

To obtain a better understanding of the significance of h(t), we consider the case in which 
G(s) = 1; consequently, g(t) = ó(t) and V(s) = H(s). This means that y = A(t) is the 
solution of the initial value problem 


ay" + by'+cy=6(t), y(0) 20, y'(0) 20, (29) 


obtained from equation (26) by replacing g(t) by 6(t). Thus A(t) is the response of the system 
to a unit impulse applied at t = 0, and it is natural to call h(t) the impulse response of the 
system. Equation (28) then says that w(t) is the convolution of the impulse response and the 
forcing function. 

Referring to Example 2, we note that the transfer function is H(s) — 1/ (s? + 4) and 
the impulse response is h(t) = 1 sin(2t). Also, the first two terms on the right-hand side of 
equation (19) constitute the function ¢ (t), the solution of the corresponding homogeneous 
equation that satisfies the given initial conditions. 


This terminology arises from the fact that H(s) is the ratio of the transforms of the output and the input of the 
problem (20). 


Problems 


1. Prove the commutative, distributive, and associative properties 
of the convolution integral. 


a. fxg=gxf 
b. f«(gitg)—f*gtf*g 
C. fx*(gx*xh)-(fx*g)xh 


2. Findan example different from the one in the text showing that 
(f * 1) (f) need not be equal to f(t). 


3. Show, by means of the example f(t) = sint, that f x f is not 
necessarily nonnegative. 


In each of Problems 4 through 6, find the Laplace transform of the 
given function. 


4. fo = f e resa 
0 

5. ro - [ e UD sinr dr 
0 


6. ro - [ sin(t — T) cost dr 
0 


In each of Problems 7 through 9, find the inverse Laplace transform 
of the given function by using the convolution theorem. 


1 
7. F(s)= Sab 
s 
8. F(s) = (s+ DG +4) Dis? +4) 
9. F(s)— ! 


(s+ 32 +4) 
10. a. If f(t) = t" and g(t) = t", where m and n are positive 
integers, show that 


1 
f *g-— puis u" (1 - u)” du. 
0 


b. Use the convolution theorem to show that 


m!n! 


1 
D u" (1 —u)" du = ——————. 
0 (m 4 n 4 1)! 


c. Extend the result of part b to the case where m and n are 
positive numbers but not necessarily integers. 


In each of Problems 11 through 15, express the solution of the given 
initial value problem in terms of a convolution integral. 


1l. y"-w?y2g(); y(0 20, y(0)21 
12. 4y” +4y'+17y=g(t); y(0) =0, y'(0) =0 


5 
13. y"-y'- 32 1 — us(t); 


14. y”+3y'+2y —cos(at); 


15. y(9 + 5y" E 4y = g(t); 
y"(0) 20, y"(0) =0 


x0) 21, y(0 2-1 


y(0) =1, y'(0) =0 
y(0) =1, y'(0) — 0, 
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16. Consider the equation 
p(t) «f k(t — E)E (E) dE = f(t), 
0 


in which f and k are known functions, and ó is to be determined. Since 
the unknown function o appears under an integral sign, the given 
equation is called an integral equation; in particular, it belongs to 
a class of integral equations known as Volterra integral equations’. 
Take the Laplace transform of the given integral equation and obtain 
an expression for L{¢(t)} in terms of the transforms L{ f(t)} and 
L{k(t)} of the given functions f and k. The inverse transform of 
Lo (t)) is the solution of the original integral equation. 


17. Consider the Volterra integral equation (see Problem 16) 
t 
oo [ «- oec d& = sin(2r). (30) 
0 


a. Solve the integral equation (30) by using the Laplace 
transform. 

b. By differentiating equation (30) twice, show that ¢(f) 
satisfies the differential equation 


Q" (t) + b(t) = —4sin(2r). 
Show also that the initial conditions are 
(0) =0, (0) =2. 


c. Solve the initial value problem in part b, and verify that the 
solution is the same as the one in part a. 


In each of Problems 18 and 19: 
a. Solve the given Volterra integral equation by using the 
Laplace transform. 
b. Convert the integral equation into an initial value problem, as 
in Problem 17b. 
c. Solve the initial value problem in part b, and verify that the 
solution is the same as the one in part a. 


18. oi + [ a- oec 4t =I 
0 


19. so «2 f cos(t — E)E (£) dé =e! 
0 


There are also equations, known as integro-differential equations, in 
which both derivatives and integrals of the unknown function appear. 
In each of Problems 20 and 21: 
a. Solve the given integro-differential equation by using the 
Laplace transform. 
b. By differentiating the integro-differential equation a 
sufficient number of times, convert it into an initial value 
problem. 
c. Solve the initial value problem in part b, and verify that the 
solution is the same as the one in part a. 


20. so [ a-ootdt or $(0) =0 
0 


1 t 
21. g(t) — J G-E PE dE =t, $(0 =1 
0 


7See the footnote about Vito Volterra in Section 9.5. 
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22. The Tautochrone. A problem of interest in the history 
of mathematics is that of finding the tautochrone?—the curve 
down which a particle will slide freely under gravity alone, 
reaching the bottom in the same time regardless of its starting 
point on the curve. This problem arose in the construction of 
a clock pendulum whose period is independent of the amplitude 
of its motion. The tautochrone was found by Christian Huygens 
(1629-1695) in 1673 by geometric methods, and later by Leibniz 
and Jakob Bernoulli using analytic arguments. Bernoulli's solution 
(in 1690) was one of the first occasions on which a differential 
equation was explicitly solved. The geometric configuration is 
shown in Figure 6.6.2. The starting point P(a,b) is joined 
to the terminal point (0,0) by the arc C. Arc length s is 


ya 


P(a, b) 


4 


B 4 


RY 


| FIGURE 6.6.2 | The tautochrone. 


8The word “tautochrone” comes from the Greek words tauto, which means 
“same,” and chronos, which means “time.” 
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measured from the origin, and f(y) denotes the rate of change of s 
with respect to y: 


AN 1/2 
ju s 1+ = (31) 
i dy l 


Then it follows from the principle of conservation of energy that the 
time T (b) required for a particle to slide from P to the origin is 

^ fo) 
v28 Jo Nb —» 


a. Assume that T(b) = Tp, a constant, for each b. By taking 
the Laplace transform of equation (32) in this case, and using the 
convolution theorem, Theorem 6.6.1, show that 


2g To. 


T(b) = 


dy. (32) 


F(s) = «s (33) 
then show that 
2g T 
f= ve To (34) 


T Jy i 
Hint: See Problem 24 of Section 6.1. 
b. Combining equations (32) and (34), show that 


d 2a — 
"M = pucr Je (35) 
dy y 


C. Use the substitution y = 2a sin^(0 /2) to solve equation 
(35), and show that 


x — a(0 +sinð), 


where a = gT, /n?. 


y — a(1 — cos0). (36) 


Equations (36) can be identified as parametric equations of a 
cycloid. Thus the tautochrone is an arc of a cycloid. 
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CHAPTER? 


Systems of First-Order 
Linear Equations 


Many physical problems involve a number of separate but interconnected components. For 
example, the current and voltage in an electrical network, each mass in a mechanical system, 
each element (or compound) in a chemical system, or each species in a biological system have 
this character. In these and similar cases, the corresponding mathematical problem consists 
of a system of two or more differential equations, which can always be written as first-order 
differential equations. In this chapter we focus on systems of first-order linear differential 
equations and, in particular, differential equations having constant coefficients, utilizing some 
of the elementary aspects of linear algebra to unify the presentation. In many respects this 
chapter follows the same lines as the treatment of second-order linear differential equations in 
Chapter 3. 


71 Introduction 


Systems of simultaneous ordinary differential equations arise naturally in problems involving 
several dependent variables, each of which is a function of the same single independent 
variable. We will denote the independent variable by ¢ and will let x1, x2, x3, ... represent 


dependent variables that are functions of t. Differentiation! with respect to t will be denoted 
dx 


by, for example, p or x}. 

Let us begin by considering the spring—mass system in Figure 7.1.1. The two masses 
move on a frictionless surface under the influence of external forces F\(t) and F(t), and 
they are also constrained by the three springs whose constants are Kk, k2, and k3, respectively. 
We regard motion and displacement to the right as being positive. 


leith vets!) A two-mass, three-spring system. 


Using arguments similar to those in Section 3.7, we find the following equations for the 
coordinates x, and x» of the two masses: 


d?x 
mou = ka(x2 — x1) — kyxy + F(t) = —(K + ko) x1 + kaxa + Fit), 
2 (1) 
X 
"m = kax — ky — xi) + F(t) = kaxi — (Ko + ki) xa + F(t). 


See Problem 14 for a full derivation of the system of differential equations (1). 
Next, consider the parallel LRC circuit shown in Figure 7.1.2. Let V be the voltage drop 
across the capacitor and / the current through the inductor. Then, referring to Section 3.7 and 


lIn some treatments you will see differentiation with respect to time represented with a dot over the function, as in 

dx, d?xi 

di and i; = FU We reserve this notation for a specific purpose, which will be introduced in Section 9.6. 
t 


x= 
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to Problem 16 of this section, we can show that the voltage and current are described by the 
system of equations 


> 


di V 
dt^ L 
dV joy 

dt | C RC 


where L is the inductance, C is the capacitance, and R is the resistance. 


(2) 


C 


R 


A parallel LRC circuit. 


One reason why systems of first-order equations are particularly important is that 
equations of higher order can always be transformed into such systems. This is usually required 
if a numerical approach is planned, because, as we will see in Chapter 8, almost all codes 
for generating numerical approximations to solutions of differential equations are written for 
systems of first-order equations. The following example illustrates how easy it is to make 
the transformation from a second-order differential equation to a system of two first-order 
differential equations. 


EXAMPLE 1 


The motion of a certain spring- mass system (see Example 3 of Section 3.7) is described by the 
second-order differential equation 


n 1 rf 
u + g^ +u=0. (3) 
Rewrite this equation as a system of first-order equations. 


Solution: 


Let x, = u and x; = u'. Then it follows that x, = x». Further, u” = x5. Then, by substituting for u, 
u', and u” in equation (3), we obtain 


1 
x, + get e. 


Thus x, and x» satisfy the following system of two first-order differential equations: 


X5 = —X1 EZ —X2. (4) 


The general equation of motion of a spring—mass system 
mu" + yu' + ku = F(t) (5) 


can be transformed into a system of first-order differential equations in the same manner. If 
we let x; = u and x2 = u’, and proceed as in Example 1, we quickly obtain the system 


/ 
X1 = X2, 


, k y 1 
die San , "iU (6) 


To transform an arbitrary n" order equation 


y = Bi y, y, sog 9 (7) 


into a system of n first-order differential equations, we extend the method of Example 1 by 
introducing the variables x1, x2, ... , x, defined by 


dg. wy xa my Vues du Em v. (8) 


It then follows immediately that 


d ees 
ig 
X, = X3, 
(9) 
Esci = Xn, 
and, from equation (7), 
x» PF eon di: (10) 
Equations (9) and (10) are a special case of the more general system 
44 = PEG Els eds aS 
a = F(t, X1. X2, +++ Xi) 
(11) 
x, = Falle X1, X2, knaag sX): 


In a similar way, the system (1) can be reduced to a system of four first-order equations of the 
form (11), and the system (2) is already in this form. In fact, systems of the form (11) include 
almost all cases of interest. Much of the more advanced theory of differential equations is 
devoted to such systems. 

A solution of the system (11) on the interval J: a < t < 0 consists of n functions 


Xy = Qilt), x?) = Pt), ... Xn = Palt) (12) 


where each function is differentiable at all points in interval J and the system of equations (11) 
is satisfied at all points in interval 7. In addition to the given system of differential equations, 
there may also be given n initial conditions of the form 


0 0 
dug) =x], Go) = X3, ecu aw Ug) = x9, (13) 


where fo is a specified value of t in J, and M". OM xe are prescribed numbers. The differential 
equations (11) and the initial conditions (13) together form an initial value problem. 

A solution (12) can be viewed as a set of parametric equations in an n-dimensional space. 
For a given value of f, equations (12) give values for the coordinates x1, ... ,x, of a point 
in the space. As t changes, the coordinates in general also change. The collection of points 
corresponding toa < t < p forms a curve in the space. It is often helpful to think of the curve 
as the trajectory, or path, of a particle moving in accordance with the system of differential 
equations (11). The initial conditions (13) determine the starting point of the moving particle. 
This curve is most easily visualized when n = 2 and the curve lies in the x,x»-plane. 

The following conditions on Fj, Fy, ... , Fa, which are easily checked in specific 
problems, are sufficient to ensure that the initial value problem (11), (13) has a unique solution. 
Theorem 7.1.1 is analogous to Theorem 2.4.2, the existence and uniqueness theorem for a 
single first-order equation. 


Theorem 7.1.1 


. Es , -- OF, OF, 
Let each of the n functions F}, ..., F, and the n^ first partial derivatives ——, ...,—., 
Ox, OXn 
OF, OF, . : ; : 
T ane " be continuous in a region R of tx,xX.--+-X,-space defined by 
Ox, Ox, 
; 0 0 0 
act«f,o« xi < By, ...,AQn < Xn < n, and let the point (v x5. T sn) be 


in R. Then there is an interval |t — tọ| < h in which there exists a unique solution x; = @)(t),..., 
Xn = Ọn(t) of the system of differential equations (11) that also satisfies the initial conditions (13). 
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The proof of this theorem can be constructed by generalizing the argument in Section 2.8, 
but we do not give it here. However, note that, in the hypotheses of the theorem, nothing is said 
about the partial derivatives of Fi, ... , Fa with respect to the independent variable t. Also, 
in the conclusion, the length 2h of the interval in which the solution exists is not specified 
exactly, and in some cases it may be very short. Finally, the same result can be established on 
the basis of somewhat weaker but more complicated hypotheses, so the theorem as stated is 
not the most general one known, and the given conditions are sufficient, but not necessary, for 
the conclusion to hold. 

If each of the functions Fj, ... , Fa in equations (11) is a linear function of the dependent 
variables xj, ... , Xn, then the system of differential equations is said to be linear; otherwise, 
it is nonlinear. Thus the most general system of n first-order linear differential equations has 
the form 


x; = pu(t)xi +--+ + pu(Dxs + g(t), 
x) pa(t)xi +--+ + Pont) xn + gat), 
(14) 


X. = paltry c + Pan(t)xXn git). 


If each of the functions g(t), ... , g(t) is zero for all ¢ in the interval 7, then the system 
(14) is said to be homogeneous; otherwise, it is nonhomogeneous. Observe that the systems 
(1) and (2) are both linear. The system (1) is nonhomogeneous unless F(t) = F(t) = 0, 
while the system (2) is homogeneous. 

For the linear system (14), the existence and uniqueness theorem is simpler and also has 
a stronger conclusion. It is analogous to Theorems 2.4.1 and 3.2.1. 


If the functions pii, pis .... Pnn» £1 -....8, are continuous on an open interval 
I1: o < t < f, then there exists a unique solution x = (ft), ..., x, = y(t) of the system 


a oe M ; "un" 0 0 
(14) that also satisfies the initial conditions (13), where tọ is any point in 7, and Xp. +++ , Xn are any 
prescribed numbers. Moreover, the solution exists throughout the interval /. 


Note that, in contrast to the situation for a nonlinear system, the existence and uniqueness 
of the solution of a linear system are guaranteed throughout the interval in which the 
hypotheses are satisfied. Furthermore, for a linear system the initial values 5 Lakhs xe at 
t = to are completely arbitrary, whereas in the nonlinear case the initial point must lie in 
the region R defined in Theorem 7.1.1. 

The rest of this chapter is devoted to systems of linear first-order differential equations 
(nonlinear systems are included in the discussions in Chapters 8 and 9). Our presentation 
makes use of matrix notation and assumes that you have some familiarity with the properties 
of matrices. The basic facts about matrices needed for this discussion are presented in Sections 
7.2 and 7.3; some more advanced material is reviewed as needed in later sections. 


Problems 


In each of Problems 1 through 3, transform the given equation intoa In each of Problems 4 and 5, transform the given initial value problem 


system of first-order equations. into an initial value problem for two first-order equations. 
1. u"-40.5u' + 2u=0 4. u"--0.25u' + 4u = 2cos(3t), u(0) =1, u'(0) =—2 
2. Cw + tw! (0 —0.25)u 20 5. ul" + p(t)u' t q(r)u = g(t), u(0) 2 ug, u'(0) = uj 


3. u 4 20 


6. Systems of first-order equations can sometimes be transformed 
into a single equation of higher-order. Consider the system 


x = —2x, + x, X) =X, — 2x2. 

a. Solve the first differential equation for x». 

b. Substitute the result of a into the second differential equation, 
thereby obtaining a second-order differential equation for x,. 

C. Solve the differential equation found in b for x,. 

d. Use the results of a and c to find x32. 


In each of Problems 7 through 9, proceed as in Problem 6. 
a. Transform the given system into a single equation of second- 
order. 
b. Find x, and x» that also satisfy the given initial conditions. 
C. Sketch the graph of the solution in the x,x»?-plane for t > 0. 


7. xj—3x;-2x5 x,(0) =3 
1 
x, = 2x; — 2x2), x2(0) = = 
2 
8. xi = 2x5, x1(0) =3 
X5=—2x1, x2(0) =4 
9. x) = -ix 42x, x(0-2-2 
xh = 2x, — ix, 42(0) =2 


10. Transform equations (2) for the parallel circuit into a single 
second-order equation. 


11. Show that if a), a12, a21, and a»; are constants with a4? and a», 
not both zero, and if the functions gı and g» are differentiable, then 
the initial value problem 
0 

xi = axı +a gà). x(0 = x, 
0 
x, = axı d-doox3 + Q(t), — x3(0) = x, 
can be transformed into an initial value problem for a single second- 
order equation. Can the same procedure be carried out if aj), ... , a22 
are functions of t? 


12. Consider the linear homogeneous system 


x' = py(t)x + p(t)y, 
y! = pa(t)x + pox(1) y. 


Show that if x = x,(t), y = y,(t) and x = x(t), y = y(t) 
are two solutions of the given system, then x = c,x,(t) + c2x2(t), 
y = cyy\(t) + c2y2(t) is also a solution for any constants c, and 


c2. This is the principle of superposition; it will be discussed in much 
greater detail in Section 7.4. 


13. Letx = x(t), y = yı (t) and x = x2(t), y = y2(t) be any two 
solutions of the linear nonhomogeneous system 


x = py(t)x + prl(t)y + 81(t), 
y! = poi (t)x + pool(t)y + ga(1). 


Show that x = xi(f) — x2(t), y = yı(t) — y2(t) is a solution of the 
corresponding homogeneous system. 


14. Equations (1) can be derived by drawing a free-body diagram 
showing the forces acting on each mass. Figure 7.1.3a shows the 
situation when the displacements x, and x» of the two masses are both 
positive (to the right) and x2 > xı. Then springs 1 and 2 are elongated 
and spring 3 is compressed, giving rise to forces as shown in Figure 
7.1.3b. Use Newton’s law ( F = ma) to derive equations (1). 
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(a) 
klx — xi) hax» 
(b) 


JA: (a) The displacements x; and x? are both 
positive. (b) The free-body diagram for the spring — mass system. 


15. Transform the system (1) into a system of first-order differential 
equations by letting yj = x1, y? = x2, y3 = xj, and y4 = x5. 

Electric Circuits. The theory of electric circuits, such as that shown 
in Figure 7.1.2, consisting of inductors, resistors, and capacitors, is 
based on Kirchhoff's laws: (1) The net flow of current into each node 
(or junction) is zero, and (2) the net voltage drop around each closed 
loop is zero. In addition to Kirchhoff's laws, we also have the relation 
between the current 7, with units of amperes through each circuit 
element and the voltage drop V , measured in volts, across the element: 


V = RI, R resistance in ohms; 
dV : . 2 
E =], C = capacitance in farads; 
dl : ; 

Tr =); L = inductance in henrys. 


Kirchhoff’s laws and the current-voltage relation for each circuit 
element provide a system of algebraic and differential equations 
from which the voltage and current throughout the circuit can be 
determined. Problems 16 through 18 illustrate the procedure just 
described. 


16. Consider the circuit shown in Figure 7.1.2. Let 74, J, and 1; be 
the currents through the capacitor, resistor, and inductor, respectively. 
Likewise, let Vi, V2, and V3 be the corresponding voltage drops. The 
arrows denote the arbitrarily chosen directions in which currents and 
voltage drops will be taken to be positive. 
a. Applying Kirchhoff's second law to the upper loop in the 
circuit, show that 


V; — V; —0. (15) 
In a similar way, show that 
V — V; —0. (16) 


b. Applying Kirchhoff’s first law to either node in the circuit, 
show that 


c. Use the current-voltage relation through each element in the 
circuit to obtain the equations 


CV —Ih, V2=Rh, LI, = V3. (18) 
d. Eliminate V», V3, J}, and I, among equations (15) through 
(18) to obtain 
/ Vi / 
CV, --h- Fr LL = V- (19) 


Observe that if we omit the subscripts in equations (19), then we 
have the system (2) of this section. 


? Actual capacitors typically have capacitances measured in microfarads. 
We use farad as the unit for numerical convenience. 
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17. Consider the circuit shown in Figure 7.1.4. Use the method 
outlined in Problem 16 to show that the current 7 through the inductor 
and the voltage V across the capacitor satisfy the system of differential 
equations 


E. I-V ANON V 
dt — ^ dt 
R= 1 ohm 
L = 1 henry 
R = 2 ohms 
c 
C=, farad 


IJ [d 7]: 1239/7 9 The circuit in Problem 17. 


18. Consider the circuit shown in Figure 7.1.5. Use the method 
outlined in Problem 16 to show that the current / through the inductor 
and the voltage V across the capacitor satisfy the system of differential 
equations 


dl dV V 
L --HR - V, =I l 
dt a: e R5 
L 
R, 
C 


AIEA The circuit in Problem 18. 


19. Consider the two interconnected tanks shown in Figure 7.1.6. 
Tank 1 initially contains 30 gal of water and 25 oz of salt, and Tank 2 
initially contains 20 gal of water and 15 oz of salt. Water containing 
1 oz/gal of salt flows into Tank 1 at a rate of 1.5 gal/min. The 
mixture flows from Tank 1 to Tank 2 at a rate of 3 gal/min. Water 
containing 3 oz/gal of salt also flows into Tank 2 at a rate of 1 gal/min 
(from the outside). The mixture drains from Tank 2 at a rate of 4 gal/ 
min, of which some flows back into Tank 1 at a rate of 1.5 gal/min, 
while the remainder leaves the system. 
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1.5 gal/min 


—— 


1 gal/min 


—— 


1 oz/gal 3 oz/gal 


3 gal/min 


2.5 gal/min Tank 2 


IAU: IAE Two interconnected tanks (Problem 19). 


a. Let Q,(t) and Q(t), respectively, be the amount of salt in 
each tank at time t. Write down differential equations and initial 
conditions that model the flow process. Observe that the system 
of differential equations is nonhomogeneous. 

b. Find the values of Q; and Q, for which the system is in 
equilibrium —that is, does not change with time. Let Qr and 
oF be the equilibrium values. Can you predict which tank will 
approach its equilibrium state more rapidly? 

c. Let x, = Qi(1) — QF and x; = Q(t) — QF. Determine 
an initial value problem for x, and x2. Observe that the system of 
equations for x; and x» is homogeneous. 


20. Consider two interconnected tanks similar to those in Figure 
77.1.6. Initially, Tank 1 contains 60 gal of water and Q^ oz of salt, and 
Tank 2 contains 100 gal of water and Q3 oz of salt. Water containing 
qı oz/gal of salt flows into Tank 1 at a rate of 3 gal/min. The mixture 
in Tank 1 flows out at a rate of 4 gal/min, of which half flows into Tank 
2, while the remainder leaves the system. Water containing q2 oz/gal of 
salt also flows into Tank 2 from the outside at the rate of 1 gal/min. The 
mixture in Tank 2 leaves it at a rate of 3 gal/min, of which some flows 
back into Tank 1 at arate of 1 gal/min, while the rest leaves the system. 
a. Draw a diagram that depicts the flow process described 
above. Let Q(t) and Q»(1), respectively, be the amount of 
salt in each tank at time t. Write down differential equations and 
initial conditions for Qı and Q, that model the flow process. 
b. Find the equilibrium values OF and pr in terms of the 
concentrations q; and q». 
C. Is it possible (by adjusting q; and q2) to obtain Qr — 60 and 
of = 50 as an equilibrium state? 
d. Describe which equilibrium states are possible for this system 
for various values of q; and q2. 


For both theoretical and computational reasons, it is advisable to bring some of the results of 
matrix algebra? to bear on the initial value problem for a system of linear differential equations. 


3The properties of matrices were first extensively explored in 1858 in a paper by the English algebraist Arthur Cayley 
(1821-1895), although the word “matrix” was introduced by his good friend James Sylvester (18 14-1897) in 1850. 
Cayley did some of his best mathematical work while practicing law from 1849 to 1863; he then became professor 
of mathematics at Cambridge, a position he held for the rest of his life. After Cayley's groundbreaking work, the 
development of matrix theory proceeded rapidly, with significant contributions by Charles Hermite, Georg Frobenius, 


and Camille Jordan, among others. 


This section and the next are devoted to a brief summary of the facts that will be needed later. 
More details can be found in any elementary book on linear algebra. We assume, however, 
that you are familiar with determinants and how to evaluate them. 

We designate matrices by boldfaced capitals A, B, C, ... , occasionally using boldfaced 
Greek capitals , v, ... . A matrix A consists of a rectangular array of numbers, or elements, 
arranged in m rows and n columns — that is, 


aij 12 in 
a21 ü5) =s An 

A= . . "a (1) 
m] m2 se Amn 


We speak of A as an m x n matrix. Although later in the chapter we will often assume that the 
elements of certain matrices are real numbers, in this section we allow for the possibility that 
the elements of matrices may be complex numbers. The element lying in the i" row and j'^ 
column is designated by a;;, the first subscript identifying its row and the second its column. 
Sometimes the notation (ajj) is used to denote the matrix whose generic element is aj;. 
Associated with each matrix A is the matrix A^, which is known as the transpose of A 
and is obtained from A by interchanging the rows and columns of A. Thus, if A = (ai), then 
AT = (aji). Also, we will denote by a;; the complex conjugate of a;;, and by A the matrix 
obtained from A by replacing each element aj; by its conjugate a;;. The matrix A is called the 


—T 
conjugate of A. It will also be necessary to consider the transpose of the conjugate matrix A . 
This matrix is called the adjoint of A and will be denoted by A*. 


For example, let 
3 2-1 
A= . 
443i —5-42i 


AT 3 44 3i x 3 2+i 
“Mer 5 ^. Moa Lge gay’ 
T 3 4 — 3i 
TRG -52i 


We are particularly interested in two somewhat special kinds of matrices: square 
matrices, which have the same number of rows and columns— that is, m = n; and vectors 
(or column vectors), which can be thought of as n x 1 matrices, or matrices having only one 
column. Square matrices having n rows and n columns are said to be of order n. We denote 
(column) vectors by boldfaced lowercase letters: x, y, £, n, ... . The transpose x’ of ann x 1 
column vector is a 1 x n row vector — that is, the matrix consisting of one row whose elements 
are the same as the elements in the corresponding positions of x. 


Then 


Properties of Matrices. 

1. Equality. Two m x n matrices A and B are said to be equal if all corresponding elements 
are equal —that is, if a;; = bj; for each i and j. 

2. Zero. The symbol 0 will be used to denote the matrix (or vector) each of whose 
elements is zero. 

3. Addition. The sum of two m x n matrices A and B is defined as the matrix obtained 
by adding corresponding elements: 


A+ B= (aij) + (bij) = (aij + bij). (2) 
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With this definition, it follows that matrix addition is commutative and associative, so 
that 


A+B=B+A, A+(B+C) =(A+B)+C. (3) 


4. Multiplication by a Number. The product of a matrix A by a real or complex number 
a is defined as follows: 


aA = a(ajj) = (aaij); (4) 
that is, each element of A is multiplied by a. The distributive laws 
a(A+B)=aA+aB, (a+f)A=aA+ BA (5) 


are satisfied for this type of multiplication. In particular, the negative of A, denoted by —A, is 
defined by 


—A = (—1)A. (6) 
5. Subtraction. The difference A — B of two m x n matrices is defined by 
A-—B=A+(-B). (7) 
Thus 
A—B = (aij) — (bij) = (aij — bij). (8) 


6. Multiplication. The product AB of two matrices is defined whenever the number 
of columns in the first factor is the same as the number of rows in the second. 
If A and B are m x n and n x r matrices, respectively, then the product C = AB is anm x r 
matrix. The element in the i row and j'^ column of C is found by multiplying each element 
of the i" row of A by the corresponding element of the j" column of B and then adding the 
resulting products. In symbols, 


n 


Cjj = 3 aug. (9) 


k=1 


By direct calculation, it can be shown that matrix multiplication satisfies the associative 
law 


(AB) C = A(BC) (10) 
and the distributive law 
A(B 4- C) = AB+ AC. (11) 


However, in general, matrix multiplication is not commutative. For both products AB and BA 
to exist and to be of the same size, it is necessary that A and B be square matrices of the same 
order. Even in that case the two products are usually unequal. In general, 


AB z BA. (12) 


EXAMPLE 1 


To illustrate the multiplication of matrices, and also the fact that matrix multiplication is not 
necessarily commutative, consider the matrices 


Solution: 
From the definition of multiplication given in equation (9), we have 
2-242 1+2-1 -1+0+1 
AB = {042-2 0-241 0+0-1 
44142 2-1-1 -2+0+1 


2 2 0 
—-[0-1 -1 
7 0 -1 
Similarly, we find that 
0 -3 0 
BA= |1 —4 2 
4 -5 4 


Clearly, AB # BA. 


7. Multiplication of Vectors. There are several ways of forming a product of two vectors 
x and y, each with n components. One is a direct extension to n dimensions of the familiar dot 
product from physics and calculus; we denote it by x’ y and write 


n 
X y=) uy (13) 
i=l 


The result of equation (13) is a real or complex number, and it follows directly from 
equation (13) that 


x'y=y"x, x (y+z2) =x"y+x"z, (ax) y=a(x"y) =x (ay). (14 


There is another vector product that is also defined for any two vectors having the same 
number of components. This product, denoted by (x, y), is called the scalar or inner product 
and is defined by 


(x,y) =) xy. (15) 
i=l 


The scalar product is also a real or complex number, and by comparing equations (13) and 
(15), we see that they are related: 


(x,y) =x". (16) 


Thus, if all the elements of y are real, then the two products (13) and (15) are identical. From 
equation (15) it follows that 
(xy) =(y,x), (x,y +2) = (x, y) + (x, 2), T 
(ax,y) 2 o(x y) (x, ay) =a(x,y). 


Note that even if the vector x has elements with nonzero imaginary parts, the scalar 
product of x with itself yields a nonnegative real number 


n n 
(x,x) = by = 5 Ix; 2. (18) 
i=l i-l 


The nonnegative quantity (x, x) !/?. often denoted by ||x||, is called the length, or magnitude, 
of x. The only vector x with length zero, (x, x) = 0, is the zero vector x = 0; every other vector 
has positive length. If (x, y) = 0, then the two vectors x and y are said to be orthogonal. For 
example, the unit vectors i, j, k of three-dimensional vector geometry form an orthogonal set. 
On the other hand, if some of the elements of x are not real, then the product 


n 
xx= NC (19) 
i=l 
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T. 


may not be a real number. Moreover, x` x can be zero for some nonzero vectors. 


For example, let 


i 2—i 1 
x=/ -2 |, y= i and z= | 0 
1+i 3 i 


Then 
x’y = (i)(2— i) - (-2)() +1 + )(3) = 4 4 3i, 
(x, y) =DD + (-2)(-) (0 9 0(3) 224 7i, 
x’x = (i)? c (22)? t (1 i)? 23 4 2i, 
(x, x) = (Ð +(-2)(-2) c (0 D(1— i) =7, 
z'z = (1)(1) +(0)(0) +) 2140-120, 
(zz) = (1)(1) + (0)(0) + ()(—i) =1+0+1 =2. 


8. Identity. The n x n multiplicative identity, or simply the n x n identity matrix I, is 
given by 


n columns 
SS 
l Qu 
0 1 0 
I= |e 5 - n rows. (20) 
0.0 >. 1 
From the definition of matrix multiplication, we have 
AI=IA=A (21) 


for any (square) matrix A. Hence the commutative law does hold for square matrices if one of 
the matrices is the identity. 

9. Inverse and Determinant. The n x n square matrix A is said to be nonsingular or 
invertible if there is another matrix B such that AB = I and BA = I, where I is then x n 
identity matrix. If there is such a B, it can be shown that there is only one. It is called the 
multiplicative inverse, or simply the inverse, of A, and we write B = A-!. Then 


AA | 2 A'A — I. (22) 


Matrices that do not have an inverse are called singular or noninvertible. 

There are various ways to compute A^! from A, assuming that it exists. One way involves 
the use of determinants. Associated with each element aj; of a given matrix is the minor M;j, 
which is the determinant of the matrix obtained by deleting the i row and j" column of the 
original matrix — that is, the row and column containing a;;. Also associated with each element 
aj; is the cofactor C;; defined by the equation 


Ci; = (-1)* Mij. (23) 
The determinant of A, written det A, can be found as the sum of the cofactors along any row 
or column of A. For example, expanding along the first row yields the formula 
det A = Cj; + Cio b Cin- 

When det A Æ 0, A is nonsingular and A^! exists. If we write B = A^, then it can be 

shown that the general element b;; is given by 
Gx 
bj; 2 —L. 
7^ detA 


Although formula (24) is not an efficient way? to calculate A ^! , it does suggest a condition 
that A must satisfy for it to have an inverse. In fact, the condition is both necessary and 


(24) 


^For large n the number of multiplications required to evaluate A^! by equation (24) is proportional to n!. If we use 
a more efficient method, such as the row reduction procedure described in this section, the number of multiplications 
is proportional only to n?. Even for small values of n (such as n = 4), determinants are not an economical tool in 
calculating inverses, and row reduction methods are preferred. 


sufficient: A is nonsingular if and only if det A Z 0. Equivalently, A is singular if and only if 
det A = 0. 

Another (and usually better) way to compute A^! is by means of elementary row 
operations. There are three such operations: 


1. Interchange of two rows. 
2. Multiplication of a row by a nonzero scalar. 
3. Addition of any multiple of one row to another row. 


The transformation of a matrix by a sequence of elementary row operations is referred to as 
row reduction or Gaussian? elimination. Any nonsingular matrix A can be transformed into 
the identity I by a systematic sequence of these operations. It is possible to show that if the 
same sequence of operations is then performed on I, it is transformed into A^. It is most 
efficient to perform the sequence of operations on both matrices at the same time by forming 
the augmented matrix (A | D. The following example illustrates the calculation of an inverse 
matrix in this way. 


EXAMPLE 2 


Find the inverse of 


1 =I -1 
A-|3 -1 2 
2 2 3 


Solution: 


The first step in finding A^! is to form the augmented matrix (A | I): 


1 -1 —1 |1 0 0 
(A\I)=]3 -1 2. 0 1 0 
2 2 3 | 0 0 
The vertical line is a visual reminder that this 3 x 6 matrix is formed from two 3 x 3 matrices. The 


matrix A can be transformed into I by the following sequence of operations, and at the same time, I 
is transformed into A~!. The result of each step appears below the statement. 


(a) Obtain zeros in the off-diagonal positions (shaded) in the first column by adding (—3) times the 
first row to the second row and adding ( —2) times the first row to the third row. 


1 -1 =l 1 0 0 


(b) Obtain a 1 in the diagonal position (shaded) in the second column by multiplying the second 


row by 3 
1 -1 -1 1 0 0 
5 3 1 
1 =I- = 0 
" 2 2 2 
0 4 5| —2 0 1 


Carl Friedrich Gauss (1777-1855) was born in Brunswick (Germany) and spent most of his life as professor of 
astronomy and director of the Observatory at the University of Gottingen. Gauss made major contributions to many 
areas of mathematics, including number theory, algebra, non-Euclidean and differential geometry, and analysis, as 
well as to more applied fields such as geodesy, statistics, and celestial mechanics. He is generally considered to be 
among the half-dozen best mathematicians of all time. 
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(c) Obtain zeros in the off-diagonal positions (shaded) in the second column by adding the second 
row to the first row and adding (—4) times the second row to the third row. 


1 0 3 1 1 0 
2 2 2 

0 1 3 3 1 0 
2 2 2 


0 0 —5 4 -2 1 


(d) Obtain a 1 in the diagonal position (shaded) in the third column by multiplying the third row by 
1 


2 
1 1 
1 o 2). 2-4 
2] & 3 
5| 3 1 
i zl. = c 
0 1i IT» 2 
o o wl 2. 
5 5 5 


3 
(e) Obtain zeros in the off-diagonal positions (shaded) in the third column by adding (- >) times 


5 
the third row to the first row and adding (- 5) times the third row to the second row. 


7 1 3 
] 0 oO], = <= = 
10 10 10 
1 1 1 
=(I|A"!). 
0 1 0 E > (1147!) 
4 2 1 
E EM 1|-- = =e 
5 5 5 
Thus 
7 1 3 
10 10 10 
etn E QI og 
2 2 2 
4 2 1 
5 5 5 


That this matrix is, in fact, the inverse of A can be verified by direct multiplication with the original 
matrix A. 


This example was made slightly simpler by the fact that the given matrix A had a 1 in the 
upper left corner (aj; = 1). If this is not the case, then the first step is to produce a 1 there 
by multiplying the first row by 1/a;,, as long as aj; zz 0. If a4; = O, then the first row must 
be interchanged with some other row to bring a nonzero element into the upper left position 
before proceeding. If this cannot be done, because every element in the first column is zero, 
then the matrix has no inverse and is singular. A similar situation may occur at later stages of 
the process as well, and the remedy is the same: interchange the given row with a lower row 
so as to bring a nonzero element to the desired diagonal location. If, at any stage, this cannot 
be done, then the original matrix is singular. 


Matrix Functions. We sometimes need to consider vectors or matrices whose elements are 
functions of a real variable t. We write 


x(t) a(t) >>> a) 
x(t) = : and A(t) = : : ; (25) 
x,Ct) amı(t) dE: agp (t) 
respectively. 
The matrix A(f) is said to be continuous at t = fy or on an interval a. < t < f if each 


element of A is a continuous function at the given point or on the given interval. Similarly, 
A(t) is said to be differentiable if each of its elements is differentiable, and its derivative 


dA/ dt is defined by 
gx mn (26) 
dt dt J’ 
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that is, each element of dA / dt is the derivative of the corresponding element of A. In the same 
way, the integral of a matrix function is defined as 


b b 
/ A(t)dt = (| aj). (27) 


For example, if 


then 


4, _ [Cost 1 i EPVEE VE 
«o =( 0 EM and f aoa = (7 0 ) 


Many of the rules of elementary calculus extend easily to matrix functions; in particular, 


d dA 
— (CA) = C—., where Cisa constant matrix; (28) 
dt dt 
aA dB. 
—(A = — + — 29 
EA +B) = T d (29) 
dB | dA 
—(AB) = A— + —B. 30 
L )= di rs di (30) 


In equations (28) and (30), care must be taken in each term to avoid interchanging the 
order of multiplication. The definitions expressed by equations (26) and (27) also apply as 
special cases to vectors. 

We conclude this section with an important reminder: some operations on matrices are 
accomplished by applying the operation separately to each element of the matrix. Examples 
include multiplication by a number, differentiation, and integration. However, this is not true 
of many other operations. For instance, the square of a matrix is not calculated by squaring 
each of its elements. 


Problems 


] -2 0 4 —2 3 —2 1 2 1 2 3 
1. IfA= 3 2 -1] andB= |-1 5 0 |, find 3. IfA= 1 0 —3 | andB = 3 —] -1 |, find 
—2 1 3 6 1 2 2 —1 1 —2 1 0 
a. 20A+B a AT 
b. A —4B b B? 
c. AB c. AT - B? 
d. BA d (A+B)? 
3. a= (15. JE) an= ( DE aal Mii 
3+2i 2—i 2 —2i . —7i2-i 243i)" in 
a. A —2B 
b. 3A +B * d 
c. AB aes 
d. BA C. A 
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1 -2 0 2 1 — 
5. IfA= 3 2 -1] B2|-2 3 3],and 
-2 0 3 1 
2 1 0 
C= |1 2 2], verify that 


0 1 -1 


a. (AB)C = A(BC) 

b. (A+B) -C—-A- (B4 C) 
c. A(B - C) =AB+ AC 

d. a(A+B) =aA+aB 


6. Prove each of the following laws of matrix algebra: 
a. A+B=B+A 
b. A+(B+C) =(A+B)+C 
€. a(A+B) =aA+aB 
d. (o -8)A - aA BA 
e. A(BC) = (AB)C 
f. A(B-- C) =AB+ AC 


2 -1+i 
7. Ifx= 3i and y = 2 |, find 
1-i 3-i 
a. xy 
b. y^y 
c. (x,y) 
d. (yy) 


In each of Problems 8 through 14, if the given matrix is nonsingular, 
find its inverse. If the matrix is singular, verify that its determinant is 
Zero. 


1 2 3 
10. 2 4 
3 5 6 
1 2 1 
11. —2 1 8 
1.:—2 T 
2 0 
12. 0 2 1 
0 0 2 


2 3 1 
15 |-1 2 d 
AT i 


15. IfA is a square matrix, and if there are two matrices B and C 
such that AB = I and CA = I, show that B = C. Thus, if a matrix 
has an inverse, it can have only one. 


e! 26-* e 
16. IfA(t) = | 2e! e™  —e? | and 
—e! 3e 22! 
2e! e™ 3e” 
B(2|-e 2e” e% | , find 
3e! =e =e” 
a. A+3B 
b. AB 
dA 


dt 


c. 
1 
i A(t)dt 
0 


In each of Problems 17 and 18, verify that the given vector satisfies 
the given differential equation. 


x 


1 1 1 
18. x [2 1 —1 Íx, 
0 —1 1 
6e! 6 0 
x= | —8e™ +2e” | 2|-8]e* «2| 1]e* 
—4e^! — 2e” —4 ai 


In each of Problems 19 and 20, verify that the given matrix satisfies 
the given differential equation. 


1 1 e? et 
19. w= Y, Yv()-— 
P 5) ee J 
1 -<] 4 el e e 


20. W=(3 2 -1|w, P) = 


E —e -—e 


N 
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73 Systems of Linear Algebraic Equations; 
Linear Independence, Eigenvalues, 
Eigenvectors 


In this section we review some results from linear algebra that are particularly important for 
the solution of systems of linear differential equations. Some of these results are easily proved 
and others are not; since we are interested simply in summarizing some useful information in 
compact form, we give no indication of proofs in either case. All the results in this section 
depend on some basic facts about the solution of systems of linear algebraic equations. 


Systems of Linear Algebraic Equations. A set of n simultaneous linear algebraic 
equations in n variables 


4141X1 + 12X2 + +++ + AinXn = Dy, 


(1) 


1X1 + an2X2 + +++ + AnnXn = by, 
can be written in matrix form as 
Ax = b, (2) 


where the n x n matrix A and the n-dimensional vector b are given, and the components of the 
n-dimensional vector x are to be determined. If b — 0, the system is said to be homogeneous; 
otherwise, it is nonhomogeneous. 

If the coefficient matrix A is nonsingular— that is, if det A is not zero—then there is a 
unique solution of the system (2) for any vector b. Since A is nonsingular, A~! exists, and the 
solution can be found by multiplying each side of equation (2) on the left by A~!; thus 


x—A lb. (3) 


In particular, the homogeneous problem Ax = 0, corresponding to b = 0 in equation (2), has 
only the trivial solution x = 0. 

On the other hand, if A is singular—that is, if det A is zero —then, depending on the 
specific right-hand side b, solutions of equation (2) either do not exist, or do exist but are 
not unique. Since A is singular, A^! does not exist, so equation (3) is no longer valid. 

When A is singular, the homogeneous system 


Ax = 0 (4) 


has (infinitely many) nonzero solutions in addition to the trivial solution. The situation for 
the nonhomogeneous system (2) is more complicated. This system has no solution unless the 
vector b satisfies a certain further condition. This condition is that 


(b, y) = 0, (5) 


for all vectors y satisfying A*y = 0, where A* is the adjoint of A. If condition (5) is met, then 
the system (2) has (infinitely many) solutions. These solutions are of the form 


x 2 x 4 £, (6) 


where x‘ is a particular solution of equation (2), and £ is the most general solution of the 
homogeneous system (4). Note the resemblance between equation (6) and the solution of a 
nonhomogeneous linear differential equation. The proofs of some of the preceding statements 
are outlined in Problems 21 through 25. 

The results in the preceding paragraph are important as a means of classifying the solutions 
of linear systems. However, for solving particular systems, it is generally best to use row 
reduction to transform the system into a much simpler one from which the solution(s), if 
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there are any, can be written down easily. To do this efficiently, we can form the augmented 
matrix 


ayy oct Aan | bi 
(Alb) =|: pes (7) 


an1 ttt Am bn 


by adjoining the vector b to the coefficient matrix A as an additional column. The vertical 
line replaces the equals sign and is said to partition the augmented matrix. We now perform 
row operations on the augmented matrix so as to transform A into an upper triangular 
matrix — that is, a matrix whose elements below the main diagonal are all zero. Once this is 
done, itis easy to see whetherthe system has solutions, and to find them if it does. Observe that 
elementary row operations on the augmented matrix (7) correspond to legitimate operations 
on the equations in the system (1). The following examples illustrate the process. 


EXAMPLE 1 


Solve the system of equations 


xı — 2x2 + 3x3 = 7, 
—X| + X2 Mani 2x3 = —5, (8) 
2x1 — X2— Xa -— 4. 


Solution: 


The augmented matrix for the system (8) is 


-1 1 —2 | 5]. (9) 
2 —1 -1 4 


We now perform row operations on the matrix (9) with a view to introducing zeros in the lower 
left part of the matrix. Each step is described and the result recorded below. 


(a) Add the first row to the second row, and add ( —2) times the first row to the third row. 


1 —2 3 T 
0 -l 1 2 
0 3-7 | —10 


(b) Multiply the second row by —1. 


1 —2 3 7 
-1 -2 
3 -7 | —10 


c 


c 


(c) Add ( —3) times the second row to the third row. 


oo 
O m. 
| | 
e me 
| | 
e N 


(d) Divide the third row by —4. 


oo 
O - 
| 
—— 
l 
PN 
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The matrix obtained in this manner corresponds to the system of equations 
xı — 2x5 + 3x3 EE 
X)— x3 = —2, (10) 


X3 — l, 


which is equivalent to the original system (8). Note that the coefficients in equations (10) form a 
triangular matrix. From the last of equations (10) we conclude that x4 = 1, from the second equation 


X9 = —2 + x3 = —1, and from the first equation x, = 7 + 2x? — 3x3 = 2. Thus we obtain 
2 
x—|-1], 
1 


which is the solution of the given system (8). Incidentally, since the solution is unique, we conclude 
that the coefficient matrix is nonsingular. 


EXAMPLE 2 


Discuss solutions of the system 


xı — 2x. + 3x3 = bi, 
=x] + X2 — 2x3 = do, (11) 
2x1 — x2 + 3x4 = b3 
for various values of bi, b», and b3. 


Solution: 


Observe that the coefficients in the system (11) are the same as those in the system (8) except for the 
coefficient of x3 in the third equation. The augmented matrix for the system (11) is 


"EMT Ln (12) 
2 -1 WA 


By performing steps (a), (b), and (c) as in Example 1, we transform the augmented matrix (12) 
into 


poop. ST b 
Ü-. Ub oup | 5-55 (13) 
0 0 0| bi+3b +b 


The equation corresponding to the third row of the matrix (13) is 
bı + 3b; +b; = 0; (14) 


thus the system (11) has no solution unless the condition (14) is satisfied by b1, b2, and b3. It is 
possible to show that this condition is just equation (5) for the system (11). 

Let us now assume that b, = 2, b; = 1, and b} = —5, in which case equation (14) is satisfied. 
Then the first two rows of the matrix (13) correspond to the equations 


X1 — 2x2 + 3x3 — 2, (15) 


X2 — X3 = —3. 
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To solve the system (15), we can choose one of the unknowns arbitrarily and then solve for the other 
two. If we let x3 = a, where a is arbitrary, it then follows that 


xX, = x3 — 3 =Q — 3, 
ti = 2—3 422l 9) a eS oo MA, 


If we write the solution in vector notation, we have 


—a—4 -1 —4 
x= a-3]=a 1] + 4-3]. (16) 
Qa 1 0 


It is easy to verify that the second term on the right-hand side of equation (16) is a solution of the 
nonhomogeneous system (11) and that the first term is the most general solution of the homogeneous 
system corresponding to (11). 


Row reduction is also useful in solving homogeneous systems and systems in which the 
number of equations is different from the number of unknowns. 


Linear Dependence and Independence. A collection of k vectors x), ... , x is said 
to be linearly dependent if there exists a set of real or complex numbers c1, ... , Ck, at least 
one of which is nonzero, such that 


cix P Ter cx P? = 0. (17) 


In other words, x, ... , x“ are linearly dependent if there is a linear relation among them. 
On the other hand, if the only values of the coefficients c1, ... , c; for which equation (17) is 
satisfied are c; = c; = --- = cy = 0, then x 9, ... , x are said to be linearly independent. 

Consider now a collection of n vectors, each of which has n components. Form the n x n 


matrix X by putting the vector x‘/) into column j of X. Thus X = (xij), where xj; = am the 
jth component of the vector x". Also let c = (c j). Then equation (17) can be written as 
(1) (n) 
Xj C cec Xi Ca X111 beet XinCn 
: J= s > | =o 
Pc Tec xe, XniCy tess + XanCn 
or, equivalently, 
Xe = 0. (18) 
If detX + 0, then the only solution of equation (18) is c = 0, but if det X = 0, there are 
nonzero solutions. Thus the set of vectors xU, ... , x? is linearly independent if and only if 
det X + 0. 
1 =2 3 
For instance, the vectors | —1 |, 1 |, and —2 | are linearly independent; see 
2 -1 -1 
-2 3 
Example 1. Similarly, from Example 2, we know the vectors | —1 |. 1 |, and =2 
=] 3 


are linearly dependent. 


EXAMPLE 3 


Determine whether the vectors 


<=) 2h xe-pr. ae 1 (19) 
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are linearly independent or linearly dependent. If they are linearly dependent, find a linear relation 
among them. 


Solution: 


To determine whether x“), x2, and x)? are linearly dependent, we seek constants c1, cz, and c3 
such that 


cix? + cox + eux? = 9. Q0) 


Equation (20) can also be written in the form 
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Now let us extend the concepts of linear dependence and independence to a collection 


of vector functions xU (t), ... , x? (f) defined on an interval a < t < 3. The vectors 
x(t) (C718 1 Tererered 6. [(Nowe521 T12 8826.845 1.1955 0.3085 r 6 9.92)T ET8238.6(on)982T BT 0 Tc /F20 12. 743.293030: 
TT 
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To find the eigenvectors, we return to equation (29) and replace A by each of the eigenvalues 


in turn. For À = 2 we have 
1 -1 Xi 0 
= . (31) 
4 —4/ Nx, 0 


Hence each row of this vector equation leads to the condition x; — x. = 0, so x, and x» are equal 
but their value is not determined. If x, = c, then x; = c also, and the eigenvector x“! is 


x) = (;): c #0. (32) 


Thus, for the eigenvalue A, = 2 there is an infinite family of eigenvectors, indexed by the arbitrary 
constant c. We will choose a single member of this family as a representative of the rest; in this 
example it seems simplest to let c = 1. Then, instead of equation (32) we write 


1 
(1) — 
= B (33) 


and remember that any nonzero multiple of this vector is also an eigenvector. We say that x (P is the 
eigenvector corresponding to the eigenvalue A, = 2. 
Now, setting A = —1 in equation (29), we obtain 


4 —lIYfxi 0 
= . (34) 
4 —l/Nx5 0 
Again, we obtain a single condition on x, and x2, namely, 4x; — x; = 0. Thus the eigenvector 
corresponding to the eigenvalue A, = —1 is 


e - () "m 


or any nonzero multiple of this vector. 


As Example 4 illustrates, eigenvectors are determined only up to an arbitrary nonzero 
multiplicative constant; if this constant is specified in some way, then the eigenvectors are 
said to be normalized. In Example 4, we chose the constant c so that the components of 
the eigenvectors would be small integers. However, any other nonzero choice of c is equally 
valid, although perhaps less convenient. Sometimes it is useful to normalize an eigenvector x 
by choosing the constant so that its length ||x|| = (x, x) 1/2 2 ], 

Since the characteristic equation (27) for an n x n matrix A is a polynomial equation 
of degree n in A, each n x n matrix has n eigenvalues Àj, ... , Àn, some of which may 
be repeated. If a given eigenvalue appears m times as a root of equation (27), then that 
eigenvalue is said to have algebraic multiplicity m. Each eigenvalue has at least one 
associated eigenvector, but can have other linearly independent eigenvectors. If an eigenvalue 
has q linearly independent eigenvectors. we say the eigenvalue has geometric multiplicity q. 
It is possible to show that 


l<q<m. (36) 


That is, the geometric multiplicity never exceeds the algebraic multiplicity. Examples 
demonstrate that g may be any integer in this interval. If each eigenvalue of A is simple (has 
algebraic multiplicity 1), then each eigenvalue also has geometric multiplicity 1. 

It is possible to show that if A; and A» are two eigenvalues of A and if A; z Ao, 
then their corresponding eigenvectors x“) and x?) are linearly independent (Problem 29). 
This result extends to any collection of distinct eigenvalues Aj, ... , Ag: their eigenvectors 
x)... , x are linearly independent. Thus, if each eigenvalue of an n x n matrix is simple, 
then the n eigenvectors of A, one for each eigenvalue, are linearly independent. On the other 
hand, if A has one or more repeated eigenvalues, then there may be fewer than 7 linearly 
independent eigenvectors associated with A, since for a repeated eigenvalue we may have 
q « m. As we will see in Section 7.8, this fact may lead to complications later on in the 
solution of systems of differential equations. 
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EXAMPLE 5 


Find the eigenvalues and eigenvectors of the matrix 


O 1 T 
A=ļ|1 0 IJ. (37) 
1 1 0 


Solution: 


The eigenvalues A and eigenvectors x satisfy the equation (A — AI)x = 0, or 


—A 1 1 X1 0 
1 —A 1 x5] 210]. (38) 
1 1 —A X3 0 


The eigenvalues are the roots of the equation 


—A 1 1 
det(A-AD=| 1 -A 1|2—3 43A 4220. (39) 
1 1 —A 
The roots of equation (39) are A; = 2, Ap = —1, and A4 = —1. Thus 2 is a simple eigenvalue, and 
—] is an eigenvalue of algebraic multiplicity 2, or a double eigenvalue. 
To find the eigenvector x(? corresponding to the eigenvalue \,, we substitute A = 2 in 


equation (38); this gives the system 


-2 1 1\ fm 0 
1-2 i[||xl|-2][o]. (40) 
1 1 -2/ Wm 0 


We can use elementary row operations to reduce this to the equivalent system 


2 -1 -1 xy 0 
0 1 -1 x;|] 210]. (41) 
0 0 0 X3 0 
Solving this system yields the eigenvector 
1 
x9 = |1]. (42) 
1 
For \ = —1, equations (38) reduce immediately to the single equation 
X1 c X2 +x — 0. (43) 
Thus values for two of the quantities x1, x2, x3 can be chosen arbitrarily, and the third is determined 
from equation (43). For example, if x; = cı and x2 = c3, then x3 = —c, — c2. In vector notation we 
have 
Cl 1 0 
x= C2 = C] 0 + C2 1 . (44) 
=p] Cp —1 —1 


For example, by choosing c, = 1 and c; = 0, we obtain the eigenvector 


x? = 0 |. (45) 


Any nonzero multiple of x‘) is also an eigenvector, but a second linearly independent eigenvector 
can be found by making another choice of c, and c; —for instance, c; = 0 and c; = 1. In this case 
we obtain 


x? = 1], (46) 
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which is linearly independent of x(?. Therefore, in this example, two linearly independent 
eigenvectors are associated with the double eigenvalue. 


An important special class of matrices, called self-adjoint or Hermitian matrices, are 
those for which A* = A; that is, aj; = aj;. Hermitian matrices include as a subclass real 
symmetric matrices—that is, matrices that have real elements and for which AT =A. 
The eigenvalues and eigenvectors of Hermitian matrices always have the following useful 
properties: 


1. All eigenvalues are real. 


2. There always exists a full set of n linearly independent eigenvectors, regardless of the 
algebraic multiplicities of the eigenvalues. 


3. If x and x® are eigenvectors that correspond to different eigenvalues, then 
(xD, x2) = 0. Thus, if all eigenvalues are simple, then the associated eigenvectors 
form an orthogonal set of vectors. 

4. Corresponding to an eigenvalue of algebraic multiplicity m, it is possible to choose m 
eigenvectors that are mutually orthogonal. Thus the full set of n eigenvectors can always 
be chosen to be orthogonal as well as linearly independent. 


The proofs of statements 1 and 3 above are outlined in Problems 27 and 28. Example 5 
involves a real symmetric matrix and illustrates properties 1, 2, and 3, but the choice we have 
made for x? and x) does not illustrate property 4. However, it is always possible to choose 
an x) and x) so that (x) , x9?) = 0. For instance, in Example 5 we could have chosen x 


as before and x?) by using c = 1 and c; = —2 in equation (46). In this way we obtain 
1 1 
x?) = 0 * x! 3) = —2 
-1 1 
as the eigenvectors associated with the eigenvalue A = —1. These eigenvectors are orthogonal 


to each other as well as to the eigenvector x‘) that corresponds to the eigenvalue A = 2. 


Problems 


In each of Problems | through 5, either solve the given system of 5. XI — X420 


equations, or else show that there is no solution. 
q : 3x; + X2 + X3 — 0 


1. X1 = x3=0 —X, c X5 c 2x4 —0 
3x1 + x2 + x3 = 1 
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In each of Problems 6 through 9, determine whether the members of 


—x, + x2 + 2x3 = 2 the given set of vectors are linearly independent. If they are linearly 
2 db x =l dependent, find a linear relation among them. In Problems 6 to 9, 
11, and 12, vectors are written as row vectors to save space but may 


2x1 + X2 c X421 


xı — xXx: +2x;=1 vectors may be used instead of the vectors themselves. 
3. xı + 2x3 — X3 = 2 6. x D —(1,1,0), x 2 — (0,1, 1), 
PAM RÉP o) (2) 
7. = (2, 1,0), = (0, 1, 0), 
Xj— x +2x; = -1 x ( 0), x (0, 1, 0) 
4. xı +2x2— x420 8. x) = (1,2, —1,0), x2 49 3 ih, 


2x, + X2 cT X420 


Xi— X24 2x42 0 9. x = (1,2, 2, x? =(3,1,0), 
x? =(2,-1,1), x = (4,3, 2) 


x? = (1,0, 1) 


x? = (—1, 2,0) 


x® =(—1,0,2,2), 3x9 = (3, —1, 1,3) 


be considered as column vectors; that is, the transposes of the given 
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10. Suppose that each of the vectors x, ...,x(” has n 
components, where n « m. Show that x) , x"? are linearly 
dependent. 


In each of Problems 11 and 12, determine whether the members of the 
given set of vectors are linearly independent for —oo < t < oo.lf 
they are linearly dependent, find the linear relation among them. 
HL. x'(0)-(0675,267)- Sate tue 

x® (t) = (3e™, 0) 
12. x? (t) = (2sint, sint), 
13. Let 


x(t) = (sint, 2sint) 


xO) = e xr) = ('). 
te t 


Show that x(t) and x ? (t) are linearly dependent at each point in 
the interval 0 < t < 1. Nevertheless, show that x“! (+) and x) (1) are 
linearly independent on 0 < f < 1. 


In each of Problems 14 through 20, find all eigenvalues and 
eigenvectors of the given matrix. 


14. B E 
3 1 
15. b x 
4 -1 
16. © : 
1 —2 

1 


17. 


18. 


| 

| 
RE 

| 


oO] 0o | r5 


20. 


S| Sei weir 
= 


Problems 21 through 25 deal with the problem of solving Ax = b 
when det A = 0. 


21. a. Suppose that A is a real-valued n x n matrix. Show that 
(Ax, y) = (x, Aly) for any vectors x and y. Hint: You may 
find it simpler to consider first the case n = 2; then extend the 
result to an arbitrary value of n. 

b. If A is not necessarily real, show that (Ax, y) = (x, A*y) for 
any vectors x and y. 

C. If A is Hermitian, show that (Ax, y) = (x, Ay) for any 
vectors x and y. 


22. Suppose that, for a given matrix A, there is a nonzero vector x 
such that Ax — 0. Show that there is also a nonzero vector y such that 
A*y — 0. 

23. Suppose that det A = 0 and that Ax = b has solutions. Show 
that (b, y) = 0, where y is any solution of A*y = 0. Verify that this 
statement is true for the set of equations in Example 2. Hint: Use the 
result of Problem 21b. 


24. Suppose that det A = 0 and that x = x is a solution of Ax = b. 
Show that if € is a solution of A€ = 0 and a is any constant, then 
x = x0 + o£ is also a solution of Ax = b. 


25. Suppose that detA = 0 and that y is a solution of A*y = 0. 
Show that if (b, y) = 0 for every such y, then Ax = b has solutions. 
Note that this is the converse of Problem 23; the form of the solution is 
given by Problem 24. Hint: What does the relation A*y = 0 say about 
the rows of A? It may be helpful to consider the case n = 2 first. 


26. Prove that A = 0 is an eigenvalue of A if and only if A is 
singular. 


27. In this problem we show that the eigenvalues of a Hermitian 
matrix A are real. Let x be an eigenvector corresponding to the 
eigenvalue A. 

a. Show that (Ax, x) — (x, Ax). Hint: See Problem 21c. 

b. Show that A(x, x) = A(x, x). Hint: Recall that Ax = Ax. 

€. Show that \ = A; that is, the eigenvalue A is real. 


28. Show that if A, and > are eigenvalues of a Hermitian matrix A, 
and if A; # Ap, then the corresponding eigenvectors x‘) and x‘?) are 
orthogonal. Hint: Use the results of Problems 21c and 27 to show that 
(Ai = Ag (x), x) = 0. 

29. Show that if A, and A» are eigenvalues of any matrix A, and 
if A, # Ao, then the corresponding eigenvectors x!) and x(? are 
linearly independent. Hint: Start from cx‘? + cx?) = 0; multiply 
by A to obtain c, A x (P +.c)\ 7x = 0. Then show that c, = c; = 0. 
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Linear Equations 


The general theory of a system of n first-order linear equations 


x; = pu(t)xi +--+ + pu(Dxs + g(t), 


(1) 


X = pa (txi dd Panl) Xn git) 


closely parallels that of a single linear equation of nth order. The discussion in this section 
therefore follows the same general lines as that in Sections 3.2 and 4.1. To discuss the 
system (1) most effectively, we write it in matrix notation. That is, we consider 
xX; = xj(f),...,x, = X(t) to be components of a vector x = x(t), similarly, 
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gi(£), ... , &n(t) are components of a vector g(t), and p(t), ... , Pnn(t) are elements of 
ann x n matrix P(t). Equation (1) then takes the form 
x = P(t)x+ g(t). (2) 


The use of vectors and matrices not only saves a great deal of space and facilitates calculations 
but also emphasizes the similarity between systems of differential equations and single (scalar) 
differential equations. 

A vector x = x(f) is said to be a solution of equation (2) if its components satisfy the 
system of equations (1). Throughout this section we assume that P and g are continuous on 
some interval a < t < (3; that is, each of the scalar functions p11, ... , Pans £1, +--+ >» gnis 
continuous there. According to Theorem 7.1.2, this is sufficient to guarantee the existence of 
solutions of equation (2) on the interval o. < t < f. 

It is convenient to consider first the homogeneous equation 


x’ = P(t)x (3) 


obtained from equation (2) by setting g(t) = 0. Just as we have seen for a single linear 
differential equation (of any order), once the homogeneous equation has been solved, there 
are several methods that can be used to solve the nonhomogeneous equation (2); this is taken 
up in Section 7.9. 

We use the notation 


x(t) X(t) 
OG) = iod sux) = a T (4) 
NO NO 


to designate specific solutions of the system (3). Note that x;;(t) = xp (1) refers to the i‘ 
component of the j" solution x (t). The main facts about the structure of solutions of the 
system (3) are stated in Theorems 7.4.1 to 7.4.5. They closely resemble the corresponding 


theorems in Sections 3.2 and 4.1; some of the proofs are left to you as exercises. 


Theorem 7.4.1 | Principle of Superposition 


If the vector functions x and x are solutions of the system (3), then the linear combination 
ci X P + ex) is also a solution for any constants c, and c». 


The principle of superposition is proved simply by differentiating cıx? + cox? 
and using the facts that x“) and x? satisfy equation (3). As an example, it can be verified 


that 
Hoe (i-e meme Qe o 
satisfy the equation 
x = 6 ) X. (6) 
4 1 
Then, according to Theorem 7.4.1, 


1 IX... cie? + coe 
x = cx” (t) box? = c1 (2) e" e (= = (2% - ee! 0 


also satisfies equation (6). 
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By repeated application of Theorem 7.4.1, we can conclude that if x‘, ... , x? are 
solutions of equation (3), then 

x — cix P (1) +--- tex P (1) (8) 

is also a solution for any constants cj, ... , c. Thus every finite linear combination of 


solutions of equation (3) is also a solution. The question that now arises is whether all solutions 
of equation (3) can be found in this way. By analogy with previous cases, it is reasonable to 
expect that for the system (3) of n first-order differential equations it is sufficient to form linear 


combinations of n properly chosen solutions. Therefore, let xU, .. . , x ben solutions of the 
system (3), and consider the matrix X(t) whose columns are the vectors x P (1), ... , x(t): 
X14 (1) = xis) 
X(t) = : l ; (9) 
Xy) D Xnn(t) 


Recall from Section 7.3 that the columns of X(t) are linearly independent for a given value of 
t if and only if det X + 0 for that value of t. This determinant is called the Wronskian of the 


n solutions x P, ... , x? and is also denoted by W [x P, ... , x |; that is, 
W x», oM ei = det X(t). (10) 
The solutions x‘, ...,x‘” are then linearly independent at a point if and only if 
W [x ?, ... ,x™] is not zero there. 
Theorem 7.4.2 
If the vector functions x, ... , x? are linearly independent solutions of the system (3) for each 
point in the interval a < t < 0, then each solution x = x(t) of the system (3) can be expressed as 
a linear combination of x P, ... , x? 
x(t) = ex P(A) E ex P (1) (11) 
in exactly one way. 


Before proving Theorem 7.4.2, note that according to Theorem 7.4.1, all expressions of the 
form (11) are solutions of the system (3), while by Theorem 7.4.2 all solutions of equation (3) 
can be written in the form (11). If the constants c1, ... , c, are thought of as arbitrary, then 
equation (11) includes all solutions of the system (3), and it is customary to call it the general 
solution. Any set of solutions (x VU, ... , x?) of equation (3) that is linearly independent at 
each point in the interval a < t < f is said to be a fundamental set of solutions for that 
interval. 

To prove Theorem 7.4.2, we will show that any solution x(t) of equation (3) can be written 
as x(t) = cix P (1) +--+ + cx? (t) for suitable values of cj, ... , Cn. Let t = tọ be some 
point in the interval a < t < and let y = x(19). We now wish to determine whether there 
is any solution of the form x = cix P (t) + --- + c, x"? (t) that also satisfies the same initial 
condition x(t9) = y. That is, we wish to know whether there are values of cj, ... , c, such that 


cix” (to) ++ + eux" (to) = y, (12) 
or, in scalar form, 
c1Xi (to) +++ + CnXin(to) = yr 


(13) 
CiXni(to) + +++ + CnXnn(to) = Yn- 


The necessary and sufficient condition that equations (13) possess a unique solution 


Ci, ... ,Cn is precisely the nonvanishing of the determinant of coefficients, which is the 
Wronskian W [x 9, m x] evaluated at £ = to. The hypothesis that x‘), ... , x? 
are linearly independent throughout a < t < £ guarantees that W [x U, ... , x] is not 


zero at t = to, and therefore there is a (unique) solution of equation (3) of the form 
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x = cix P (t) +--+ + c, xU (t) that also satisfies the initial condition (12). By the uniqueness 
part of Theorem 7.1.2, this solution is x(t), and hence x(t) = cix P (t) +--+ + c,x? (t), as 
was to be proved. 


Theorem 7.4.3 | AbeUs Theorem 


If xU, ... ,x are solutions of equation (3) on the interval a < t < f, then in this interval 
WI[x( P, ... , x ?] either is identically zero or else never vanishes. 


The significance of Theorem 7.4.3 lies in the fact that it relieves us of the necessity 


of examining W [x P, ... , x] at all points in the interval of interest and enables us to 
determine whether x‘), ... , x? form a fundamental set of solutions merely by evaluating 
their Wronskian at any convenient point in the interval. 
Theorem 7.4.3 is proved by first establishing that the Wronskian of x‘), ... , x? 
satisfies the differential equation (see Problem 8) 
aw 
p (puit) + p) + +++ + Pan(t)) W. (14) 
Hence 
W(t) -cew( [truco ecc pm (ld), (15) 


where c is an arbitrary constant, and the conclusion of the theorem follows immediately. The 
expression for W(f) in equation (15) is known as Abel’s formula; note the similarity of this 
result to Theorem 3.2.7 and especially to equation (23) of Section 3.2. 

Alternatively, Theorem 7.4.3 can be established by showing that if n solutions 


x)... , x? of equation (3) are linearly dependent at one point f = fg, then they must 
be linearly dependent at each point ina < t < Jf (see Problem 14). Consequently, 
if xU, ... ,x” are linearly independent at one point, they must be linearly independent at 


each point in the interval. 
The next theorem states that the system (3) always has at least one fundamental set of 


solutions. 

Theorem 7.4.4 
Let 

1 0 0 

0 1 0 

e= lol, =] 0 em =| id; 

; 0 

0 0 1 
further, let x, ... , x be the solutions of the system (3) that satisfy the initial conditions 

x) (to) =e, 22. , x9 (to) = e, (16) 

respectively, where fọ is any point in a. < t < 0. Then x), ... , x" form a fundamental set of 
solutions of the system (3). 


To prove this theorem, note that the existence and uniqueness of the solutions 
x)... , x mentioned in Theorem 7.4.4 are ensured by Theorem 7.1.2. It is not hard to 
see that the Wronskian of these solutions is equal to 1 when t = tọ; therefore, xU, ... , x? 
are a fundamental set of solutions. 

Once one fundamental set of solutions has been found, other sets can be generated by 
forming (independent) linear combinations of the first set. For theoretical purposes, the set 
given by Theorem 7.4.4 is usually the simplest. 
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Finally, it may happen (just as for second-order linear equations) that a system whose 
coefficients are all real-valued may give rise to solutions that are complex-valued. In this 
case, the following theorem is analogous to Theorem 3.2.6 and enables us to obtain real-valued 
solutions instead. 


Theorem 7.4.5 


Consider the system (3) 
x’ = P(f)x, 
where each element of P is a real-valued continuous function. If x = u(t) +iv(t) isa complex-valued 


solution of equation (3), then its real part u(t) and its imaginary part v(t) are also solutions of this 
equation. 


To prove this result, we substitute u(t) + iv(1) for x in equation (3), thereby obtaining 
x — P(t)x = u(t) — P(t) u(t) +: i(v(t) — P(t)v(t)) = 0. (17) 


We have used the assumption that P(t) is real-valued to separate equation (17) into its real 
and imaginary parts. Since a complex number is zero if and only if its real and imaginary parts 
are both zero, we conclude that u/(t) — P(t) u(t) = 0 and v'(t) — P(t) v(t) = 0. Therefore, 
u(t) and v(t) are solutions of equation (3). 

To summarize the results of this section: 


1. Any set of n linearly independent solutions of the system x’ = P(7)x constitutes a 
fundamental set of solutions. 

2. Under the conditions given in this section, such fundamental sets always exist. 

3. Every solution of the system x’ = P(t)x can be represented as a linear combination of 


any fundamental set of solutions. 


Problems 


In Problems 1 through 6 you are given a homogeneous system of first- 
order linear differential equations and two vector-valued functions, 
x) and x. 
a. Show that the given functions are solutions of the given 
system of differential equations. 
b. Show that x = c;x + cox) is also a solution of the given 
system for any values of c, and c3. 
C. Show that the given functions form a fundamental set of 
solutions of the given system. 
d. Find the solution of the given system that satisfies the initial 
condition x(0) — (1, 2)7. 
e. Find W [x 5, x ?](»). 
f. Show that the Wronskian, W = W [x(9, x], found in e is 
a solution of Abel’s equation: W’ = ( p(t) + p22(t)) W. 


2 d "o 
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x = sint 
2sint — cost 


4 —2 2 2 0 
m .xX(D — (2) = 
4. = 2 a = (3) (1) 


x; x) = n 


—1 1 1 
val) o. (2). .— -1 
JI 36 s -(3) 
,_ [f3 -2 ogy i Naa 1 
6. ied J (t > 0); x = 5) mS E 


7. Prove the generalization of Theorem 7.4.1, as expressed in the 
sentence that includes equation (8), for an arbitrary value of the 
integer k. 


8. Inthis problem we outline a proof of Theorem 7.4.3 in the case 
n = 2. Let x‘ and x") be solutions of equation (3) fora < t < f, 
and let W be the Wronskian of x“ and x“). 


a. Show that 
dx — dx | |x x 
OM l Ge dt | 4 
dt di dx 
(1) (2) 2 aes 
*) do dt dt 


b. Using equation (3), show that 


dw 


E m W. 
P7 (Pii + P22) 


c. Find W(t) by solving the differential equation obtained in 
part b. Use this expression to obtain the conclusion stated in 
Theorem 7.4.3. 

d. Prove Theorem 7.4.3 for an arbitrary value of n by 
generalizing the procedure of parts a, b, and c. 


9. Show that the Wronskians of two fundamental sets of solutions 
of the system (3) can differ at most by a multiplicative constant. 
Hint: Use equation (15). 


10. If x; = y and x; = y’, then the second-order equation 
y" + p(t)y’ +q(thy =0 (18) 
corresponds to the system 
/ 
X1 = %2, 
x, = —q(t)xi — p(t) xo. (19) 


Show that if x‘ and x are a fundamental set of solutions of 
equations (19), and if y/! and y? are a fundamental set of solutions 
of equation (18), then W[y(9, y®] = cW[x,x], where c 
is a nonzero constant. Hint: yr) and y(t) must be linear 
combinations of x,,(t) and xj5(1). 


11. Show that the general solution of x’ = P(t)x + g(t) is the sum 
of any particular solution x‘”) of this equation and the general solution 
x of the corresponding homogeneous equation. 


2 
12. Consider the vectors x(t) = @ and x? (t) = p 


a. Compute the Wronskian of x) and x), 

b. In what intervals are x/! and x? linearly independent? 

c. What conclusion can be drawn about the coefficients in the 
system of homogeneous differential equations satisfied by x‘ 
and x ?? 

d. Find this system of equations and verify the conclusions of 
part c. 
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12 a el 
» and x“ (t) = r , and 


answer the same questions as in Problem 12. 


13. Consider the vectors x(t) = 


Problems 14 and 15 indicate an alternative derivation of 
Theorem 7.4.2. 
14. Let x, ... , x" be solutions of x’ = P(r)x on the interval 


a < t < D. Assume that P is continuous, and let tọ be an arbitrary 
point in the given interval. Show that x“), ... , x"? are linearly 
dependent fora < t < f if (and only if) x P (to), ... , x"? (to) 
are linearly dependent. In other words, x‘, ... , x"? are linearly 
dependent on the interval (a, (3) if they are linearly dependent at any 
point in it. Hint: There are constants c4, ... , Cm that satisfy 


cux P (to) Tec £u x (to) = 0. 


Let z(t) = cix P (t) +--+ + c x"? (t), and use the uniqueness 
theorem to show that z(t) = 0 for each t ina < t < f. 
15. Let xU, ... , x be linearly independent solutions of x’ = 
P(t)x, where P is continuous ona « t « f. 

a. Show that any solution x = z(t) can be written in the form 


zt) = cix (1) b eux Q2) 


for suitable constants cj, ... , c4. Hint: Use the result of Problem 
10 of Section 7.3, and also Problem 14 above. 

b. Show that the expression for the solution z(t) in part a is 
unique; that is, if z(r) = kıx P(t) + --- + k, x(t), then 
ki = C]; s Kn = Cg. 

Hint: Show that (k; — cj) x P (t) +--+ + (kn — en) X(t) = 0 
for each t ina < t < f, and use the linear independence of 
xD, xl), 


75 Homogeneous Linear Systems with 


Constant Coefficients 


We will concentrate most of our attention on systems of homogeneous linear equations with 


constant coefficients — that is, systems of the form 


(1) 


where A is a constant n x n matrix. Unless stated otherwise, we will assume further that all 


the elements of A are real (rather than complex) numbers. 


If 1 — 1, then the system reduces to a single first-order equation 


dx (2) 
— =ax, 
dt 
whose solution is x(t) = ce“’. Note that x = 0 is the only critical point when a # 0. 
If a < O, then all nontrivial solutions approach x(t) = O as t increases, and in this case 


we say that x(t) = 0 is an asymptotically stable equilibrium solution. On the other hand, if 
a > 0, then every solution (except the equilibrium solution x(t) = 0 itself) moves further 
from the equilibrium solution as ¢ increases. Thus, in this case, x(t) = 0 is unstable. 

For systems of n equations, the situation is similar but more complicated. Equilibrium 
solutions are found by solving Ax = 0. We usually assume that detA +4 0, sox = Ois 
the only equilibrium solution. An important question is whether other solutions approach this 
equilibrium solution or depart from it as t increases; in other words, is x = 0 asymptotically 


stable or unstable? Or are there still other possibilities? 
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The case n = 2 is particularly important and lends itself to visualization in the 
x1x2-plane, called the phase plane. By evaluating Ax at a large number of points and plotting 
the resulting vectors, we obtain a direction field of tangent vectors to solutions of the system 
of differential equations. A qualitative understanding of the behavior of solutions can usually 
be gained from a direction field. More precise information results from including in the plot 
some solution curves, or trajectories. A plot that shows a representative sample of trajectories 
for a given system is called a phase portrait. A well-constructed phase portrait provides easily 
understood information about all solutions of a two-dimensional system in a single graphical 
display. Although creating quantitatively accurate phase portraits generally requires computer 
assistance, it is usually possible to sketch qualitatively accurate phase portraits by hand, as we 
demonstrate in Examples 2 and 3 below. 

Our first task, however, is to show how to find solutions of systems such as equation (1). 
We start with a particularly simple example. 


EXAMPLE 1 


Find the general solution of the system 
; 2 0 (3) 
x= x. 
0 —3 


The most important feature of this system is that the off-diagonal entries of the coefficient matrix are 
zero; that is, it is a diagonal matrix. Thus, by writing the system in scalar form, we obtain 


Solution: 


x = 2x1, X) = —3x2. 
Each of these equations involves only one of the unknown variables, so we can solve the two 
equations separately. In this way we find that 


2t 3t 
> 


X| = Ce X2 = Ce , 


where c, and c» are arbitrary constants. Then, by writing the solution in vector form, we have 


ce” e” 0 IY ,, O*\ a 
x= n = C] 0 + C2 e = C] 0 e + C2 1 e . (4) 


Now we define the two solutions x/! and x") so that 


1 0 
(1) — 2t (2) = —3t 
x ^(f) = (5: ; Xt) (1- : (5) 


The Wronskian of these solutions is 
e” 0 


Ü. ques (6) 


Wis? a |) = 


which is never zero. Therefore, x) and x( form a fundamental set of solutions, and the 
general solution of equation (3) is given by equation (4). 


In Example 1 we found two independent solutions of the given system (3) in the form of 
an exponential function multiplied by a vector. This was perhaps to be expected since we have 
found other linear equations with constant coefficients to have exponential solutions, and the 
unknown x in the system (3) is a vector. So let us try to extend this idea to the general system 
(1) by seeking solutions of the form 


x — ĉe", (7) 


where the exponent r and the vector € are to be determined. Substituting from equation (7) for 
x in the system (1) gives 


rée" = A€e™. 
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Upon canceling the nonzero scalar factor e"', we obtain AE = r£, or 
(A-rD£-0, © (8) 


| 
where I is the n x n identity matrix. Thus, to solve the system of differential equations (1), we 


must solve the system of algebraic equations (8). This latter problem is precisely the one that 
determines the eigenvalues and eigenvectors of the matrix |A. Therefore, the vector x given by 
equation (7) is a solution of equation (1), provided that r i$ an eigenvalue and € an associated 
eigenvector of the coefficient matrix A. 

The following two examples are typical of 2 x 2 sys[ems with eigenvalues that are real 
and different. In each example we will solve the system and construct a corresponding phase 
portrait. We will see that solutions have very distinct Epometrical patterns, depending on 
whether the eigenvalues have the same sign or different bjgns. Later in the section we return 
to a further discussion of the general n x n system. 


EXAMPLE 2 


Consider the system 


^fi 1 P 
a 4]* (9) 


Plot a direction field and determine the qualitative behavior of solutions. Then find the general 
solution and draw a phase portrait showing several trajectorjgs. 


Solution: 


The direction field shown in Figure 7.5.1 consists of 441 aifows drawn on the 21 x 21 grid of the 
square —2.5 < x, < 2.5, —2.5 < x < 2.5. (The step se in both x, and x» is 0.25.) At each 
gridpoint (x1, x2), the arrow is drawn in the direction 


Eo 


For example, at ( 1, 0) the direction vector is (1, 4)" and at} 1, —1) it is (—2, —5)7. (These two 
direction vectors appear in green.) All direction vectors areflrawn with the same length, which is 
short enough to keep nearby arrows from crossing. 
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To find solutions explicitly, we assume that x = £e'' and substitute for x in equation (9). We 
are led to the system of algebraic equations 


Gn 59-0 " 


Equations (10) have a nontrivial solution if and only if the determinant of coefficients is zero. Thus 
allowable values of r are found from the equation 


—-(1—-r-42r?-2r -32(r —3)(r 4 1) 20. (11) 


]1—r 1 
4 l-r 


Equation (11) has the roots r; = 3 and r; = —1; these are the eigenvalues of the coefficient matrix 
in equation (9). 
When r — 3, the two equations in system (10) reduce to the single equation 


—X d &2 = 0. (12) 


Thus £5 = 2£,, and the eigenvector corresponding to r; = 3 can be taken as 


(D: 1 
€ ac ; (13) 


Similarly, corresponding to r; = —1, we find that £ = —2£,, so the eigenvector is 


1 
(2) — 
E = (3). (14) 


The corresponding solutions of the differential equation are 


x(t) = (Je x (t) - [e (15) 


The Wronskian of these solutions is 


3t =f 


W lx, x ](1) = | = —4e”, (16) 


2e! —2e 


which is never zero. Hence the solutions x“) and x(? form a fundamental set, and the general solution 
of the system (9) is 


x= cux Pr) + cox (1) 


1 3t 1 —t 
= C] 2 e + C2 E € , (17) 


where c; and c» are arbitrary constants. 
To visualize the solution (17), it is helpful to consider its graph in the x,x»-plane for various 
values of the constants c, and c3. We start with x = c,x (1) or, in scalar form, 


X1 = ce”, X2 = Iger, 


By eliminating t between these two equations, we see that this solution lies on the straight line 
x2 = 2x ,; see Figure 7.5.2a. This is the line through the origin in the direction of the eigenvector 
E(D, If we look at the solution as the trajectory of a moving particle, then the particle is in the first 
quadrant when c, > 0 and in the third quadrant when c, < 0. In either case the particle departs from 
the origin as f increases. 

Next consider x = c;x ) (1), or 


X] = coe !, X? = —2c5e !. 


This solution lies on the line x; = —2x,, whose direction is determined by the eigenvector £ (? . The 
solution is in the fourth quadrant when c; > 0 and in the second quadrant when c; < 0, as shown 
in Figure 7.5.2a. In both cases the particle moves toward the origin as t increases. 
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The general solution (17) is a linear combination of x“ (t) and x) (1). For large t, the term 
ci x Ü (t) is dominant and the term c>x‘”) (t) becomes negligible. Thus all solutions for which c, 4 0 
are asymptotic to the line x. = 2x; as t — oo. Similarly, all solutions for which c; # O0 are 
asymptotic to the line x. = —2x; ast — —oo. Figure 7.5.2(a) is a phase portrait for the system 
(9). The fundamental solutions of x‘) and x? are the solid and dashed black curves, respectively; 
several other trajectories are also displayed. The pattern of trajectories in this figure is typical of all 
2 x 2 systems x’ = Ax for which the eigenvalues are real and of opposite signs. The origin is called 
a saddle point in this case. Saddle points are always unstable because almost all trajectories depart 
from them as f increases. 
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(b) (c) 


AAU: A (a) A phase portrait for the system (9); the origin is a saddle point. (b) Typical 
plots of x, versus t for the system (9). (c) Typical plots of x2 versus ¢ for the system (9). The 
component plots in (b) and (c) are color-coded with their trajectory in (a). The solid black and 
dashed black curves show the fundamental solutions x‘) and x), respectively. The purple curves 
are for the solution that passes through ( —2, —1), red through ( —1, —1), green (0, —1), orange 
(1, — D), blue (3/2, —1), and gold (0, 1). 


In the preceding paragraph, we have described how to draw by hand a qualitatively correct 
sketch of the trajectories of a system such as equation (9), once the eigenvalues and eigenvectors have 
been determined. However, to produce a detailed and accurate drawing, such as Figure 7.5.2(a) and 
other figures that appear later in this chapter, a computer is extremely helpful, if not indispensable. 

As an alternative to Figure 7.5.2(a), you can also plot x, or x2 as a function of t; some typical 
plots of x, versus t are shown in Figure 7.5.2(b), and those of x2 versus t are shown in Figure 
7.5.2c. For certain initial conditions it follows that c; = 0 in equation (17) so that x, = c2e™ and 
xı — Oas f — oo. One such graph is shown in Figure 7.5.2(b), corresponding to a trajectory that 
approaches the origin in Figure 7.5.2(a). For most initial conditions, however, c; ZZ 0 and x, is 
given by x, = cie? + cye~'. Then the presence of the positive exponential term causes x, to grow 
exponentially in magnitude as f increases. Several graphs of this type are shown in Figure 7.5.2(b), 
corresponding to trajectories that depart from the neighborhood of the origin in Figure 7.5.2(a). It is 
important to understand the relation between the direction field in (a) and the component plots in (b) 
and (c) of Figure 7.5.2 and other similar figures that appear later, since you may want to visualize 
solutions either in the x;x»-plane or as functions of the independent variable t. 
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EXAMPLE 3 


Consider the system 


x - & IB (18) 


Draw a direction field for this system and find its general solution. Then plot a phase portrait showing 
several typical trajectories in the phase plane. 


Solution: 


The direction field for the system (18) in Figure 7.5.3 shows clearly that all solutions approach the 
origin. 
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Direction field for the system (18). 


To find the solutions, we assume that x = £e''; then we obtain the algebraic system 
—3-r y Ei 0 
= : (19) 
y2 -2-r] N& 0 


The eigenvalues satisfy 


(-3—-r)(-2-r) -22 r? E 5r c 4A 5 (r - D)(r 4-4) — 0, (20) 
sor; = —l and r = —4. For r = —1, equation (19) becomes 
-2 2YfüN (0 ai 
f2 -1J \& 0} 
Hence £; = 2E 1 and the eigenvector € ‘” corresponding to the eigenvalue r; = —1 can be taken 
as 
E = : . (22) 
A 
Similarly, corresponding to the eigenvalue r; = —4, we have €; = — v £5, so the eigenvector is 


g”? = [x Q3) 


1 


Thus a fundamental set of solutions of the system (18) is 


1 = 
x Pr) = (s) x(t) = ( eeu (24) 
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and the general solution is 


1 = 
x 2 cux P (0) tox” =c e+e, v2 gs. (25) 
JA 1 


A phase portrait for the system (18) shown in Figure 7.5.4(a). The trajectory corresponding to 
the solution x(t) is the solid black curve; it approaches the origin along the line x; = v2. 
The dashed black curve is the trajectory for the solution x(? (1); it approaches the origin along the 
line x; = -,/2 x2. The slopes of these lines are determined by the eigenvectors €“!) and €), 
respectively. In general, we have a combination of these two fundamental solutions. As f — oo, 
because e^ is much smaller than e^, the solution x ? (1) is negligible compared to x (t). Thus, 
unless c; = 0, the solution (25) approaches the origin tangent to the line x. = 2x1. The pattern 
of trajectories shown in Figure 7.5.4(a) is typical of all 2 x 2 systems x’ = Ax for which the 
eigenvalues are real, different, and of the same sign. The origin is called a node for such a system. If 
the eigenvalues were positive rather than negative, then the trajectories would be similar but traversed 
in the outward direction. Nodes are asymptotically stable if the eigenvalues are negative and unstable 
if the eigenvalues are positive. 
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(a) 


(b) (c) 


(a) A phase portrait for the system (18); the origin is an asymptotically stable 
node. (b) Typical plots of x; versus ¢ for the system (18). (c) Typical plots of x» versus t for the 
system (18). The component plots in (b) and (c) are color-coded to their trajectories in (a). The 
solid black and dashed black curves show the fundamental solutions x and x‘) , respectively. 
The purple curves are for the solution that passes through ( —2, —2), red through (0, —2), green 
(2, —2), orange ( —2, 2), blue (0, 2), and gold (2, 2). 


Although Figure 7.5.4(a) was computer-generated, a qualitatively correct sketch of the 
trajectories can be drawn quickly by hand on the basis of a knowledge of the eigenvalues and 
eigenvectors. 

Some typical plots of x, versus t are shown in Figure 7.5.4(b); the corresponding x; versus 
t plots appear in Figure 7.5.4(c). Observe that each of the coordinate graphs in Figures 7.5.4(b) 
and 7.5.4(c) approaches the t-axis asymptotically as t increases, corresponding to a trajectory that 
approaches the origin in Figure 7.5.2(a). 
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Examples 2 and 3 illustrate the two main cases for 2 x 2 systems having eigenvalues that 
are real and different. The eigenvalues have either opposite signs (Example 2) or the same sign 
(Example 3). The other possibility is that zero is an eigenvalue, but in this case it follows that 
det A = 0, which violates the assumption made at the beginning of this section. However, see 
Problems 5 and 6. 

Returning to the general n x n system (1), we proceed as in the examples. To find solutions 
of the differential equation (1), we must find the eigenvalues and eigenvectors of A from the 
associated algebraic system (8). The eigenvalues r;, ... , 7, (which need not all be different) 
are roots of the n" degree polynomial equation 


det(A — rI) = 0. (26) 


The nature of the eigenvalues and the corresponding eigenvectors determines the nature of the 
general solution of the system (1). If we assume that A is a real-valued matrix, then we must 
consider the following possibilities for the eigenvalues of A: 


1. All eigenvalues are real and different from each other. 
2. Some eigenvalues occur in complex conjugate pairs. 
3. Some eigenvalues, either real or complex, are repeated. 


If the n eigenvalues are all real and different, as in the three preceding examples, then 
each eigenvalue has algebraic and geometric multiplicity one. Thus, associated with each 
eigenvalue r; is a real eigenvector £ ?, and the n eigenvectors £ 0, ... , £? are linearly 
independent. The corresponding solutions of the differential system (1) are 


xO Ct) 2 £O gni, Ld xM rt) = go enit (27) 
To show that these solutions form a fundamental set, we evaluate their Wronskian: 


(D ont (1) pnt (1) (n) 
£i e T i e 1 e 1 


We cage” (Dex : "E eee : |. Q8 


(1) x (n) r (1) (n) 
En eh aa En eh M Vas ^ 


First, we observe that the exponential function is never zero. Next, since the eigenvectors 
£O, ... , £7 are linearly independent, the determinant in the last term of equation (28) 
is nonzero. As a consequence, the Wronskian W [x 9, hu x? | (t) is never zero; hence 
x)... , x? form a fundamental set of solutions. Thus the general solution of equation 


(1)is 


x= cy€ Dent JE eid ae c,& Me™, (29) 


If A is real and symmetric (a special case of Hermitian matrices), recall from Section 7.3 
that all the eigenvalues r1, ... , rn must be real. Further, even if some of the eigenvalues 
are repeated, there is always a full set of n eigenvectors £ ?, ... ,€ ” that are linearly 
independent (in fact, orthogonal). Hence the corresponding solutions of the differential system 
(1) given by equation (27) again form a fundamental set of solutions, and the general solution 
is again given by equation (29). The following example illustrates this case. 


EXAMPLE 4 


Find the general solution of 


x=[1 0 1x. (30) 


7.5 Homogeneous Linear Systems with Constant Coefficients 


Solution: 


Observe that the coefficient matrix is real and symmetric. The eigenvalues and eigenvectors of this 
matrix were found in Example 5 of Section 7.3: 


nez EV =) 1), Q1) 
1 
1 0 
m=-l n=-l; ¿g7 = 0], p = ME (32) 
-1 -1 


Hence a fundamental set of solutions of equation (30) is 


1 1 0 
x Ge Tp caue Ole", ae] 1 ie, (33) 
1 —1 —1 
and the general solution is 
1 1 0 
x—-cec|1]eé'x-c| O}et+e3{ 1]e*. (34) 
1 —1 -1 
This example illustrates the fact that even though an eigenvalue (r = —1) has algebraic multiplicity 


2, it may still be possible to find two linearly independent eigenvectors £(? and ‘° and, as a 
consequence, to construct the general solution (34). 

The behavior of the solution (34) depends critically on the initial conditions. For large t, the first 
term on the right-hand side of equation (34), because of its positive exponential, is the dominant one; 
therefore, if c; 4 0, all components of x become unbounded as t — oo. On the other hand, for certain 
initial points c; will be zero. In these cases, the solution involves only the negative exponential terms, 
and x — (0 ast — oo. The initial points that cause c, to be zero are precisely those that lie in the 


plane determined by the eigenvectors £? and £C? corresponding to the two negative eigenvalues. 
Thus solutions that start in this plane approach the origin as t — oo, while all other solutions become 
unbounded. 


If some of the eigenvalues occur in complex conjugate pairs, then there are still n linearly 
independent solutions of the form (27), provided that all the eigenvalues are different. Of 
course, the solutions arising from complex eigenvalues are complex-valued. However, as in 
Section 3.3, it is possible to obtain a full set of real-valued solutions. This is discussed in 
Section 7.6. 

More serious difficulties can occur if an eigenvalue is repeated. In this event the 
number of corresponding linearly independent eigenvectors may be smaller than the algebraic 
multiplicity of the eigenvalue. If so, the number of linearly independent solutions of the 
form £e" will be smaller than n. To construct a fundamental set of solutions, it is then 
necessary to seek additional solutions of another form. The situation is somewhat analogous 
to that for an n'" order linear equation with constant coefficients; a repeated root of the 
characteristic equation gave rise to solutions of the form e"', te”, 12e", ... . The case of 
repeated eigenvalues is treated in Section 7.8. 

Finally, if A is complex, then complex eigenvalues need not occur in conjugate pairs, and 
the eigenvectors are normally complex-valued even though the associated eigenvalue may be 
real-valued. The solutions of the differential equation (1) are still of the form (27), provided 
that there are n linearly independent eigenvectors, but in general all the solutions are complex- 
valued. 
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Problems 


In each of Problems | through 4: 
@ a. Draw a direction field. 
b. Find the general solution of the given system of equations and 
describe the behavior of the solution as tf — oo. 
Q c. Plot a few trajectories of the system. 


1. xo i^ 
2 -2 


I 
v 
Il 
Js] O2 | CA 


In each of Problems 5 and 6 the coefficient matrix has a zero 
eigenvalue. As a result, the pattern of trajectories is different from 
those in the examples in the text. For each system: 

Q a. Draw a direction field. 

b. Find the general solution of the given system of equations. 

Q c. Draw a few of the trajectories. 


5. x'— f s 
8 —6 
3 6 
6. x= (3 i 


In each of Problems 7 through 9, find the general solution of the given 
system of equations. 


1 1 2 
7. x «|1 2 14x 
2 1 1 
3 2 4 
8. x 2|[2 0 2)x 
4 2 3 
1 -l 4 
9, x'— [3 2 —l[x 
2 1 -l 


In each of Problems 10 through 12, solve the given initial value 
problem. Describe the behavior of the solution as t — oo. 


10. x'— f B x(0) — (2) 
1 0) = 1 
4 x, x(0) = 3 


0 0 -1 7 
12. x= 2 0 O|x x02|5 
-1 2 4 3 

13. The system tx’ = Ax is analogous to the second-order Euler 


equation (Section 5.4). Assuming that x = t”, where € is a constant 
vector, show that £ and r must satisfy (A —rI) £ = 0 in order to obtain 
nontrivial solutions of the given differential equation. 


Referring to Problem 13, solve the given system of equations in each 
of Problems 14 through 16. Assume that t > 0. 


14. tx’ = E 3^ 
3 -2 

15. x= E D. 
3 1 

16. tx = E) 
8 —6 


In each of Problems 17 through 19, the eigenvalues and eigenvectors 
of a matrix A are given. Consider the corresponding system x’ = Ax. 
Q a. Sketch a phase portrait of the system. 
Q b. Sketch the trajectory passing through the initial point 
(2, 3). 
Q c. For the trajectory in part b, sketch the graphs of x, versus 
t and of x> versus t. 


-L ¿0 = ( D r--2, EP = G) 
| g0 = ( J n=-2, 9 = C) 
19. 5-4, ES B n=2, EV = ( ) 

2 -2 


20. Consider a 2 x 2 system x’ = Ax. If we assume that r; Æ r2, 
the general solution is x = c, £ P e"! + c£ P e"?', provided that £ (P 
and £ Ó) are linearly independent. In this problem we establish the 
linear independence of £ (! and € ® by assuming that they are linearly 
dependent and then showing that this leads to a contradiction. 
a. Explain how we know that £ (U satisfies the matrix equation 
(A — riD£(O) = 0; similarly, explain why (A — r;D £(? = 0. 
b. Show that (A — r5D£(? = (rj — r)£&(V. 
c. Suppose that £(? and £(? are linearly dependent. Then 
ci £C? + o€ = 0 and at least one of c, and c; (say, c4) is 
not zero. Show that (A — r;D (ci £ + c5£ 0?) = 0, and also 
show that (A — r;D (cj) +e P) = c(r; —r;) é . Hence 
cı = 0, which is a contradiction. Therefore, € and £ are 
linearly independent. 
d. Modify the argument of part c if we assume that c; Æ 0. 
e. Carry outa similar argument for the case A is 3 x 3; note that 
the procedure can be extended to an arbitrary value of n. 


17. ri 


18. F1 


21. Consider the equation 


ay" + by' + cy =0, (35) 


where a, b, and c are constants with a + 0. In Chapter 3 it was shown 
that the general solution depended on the roots of the characteristic 
equation 


ar? +br+c=0. (36) 


a. Transform equation (35) into a system of first-order equations 
by letting x, = y, x2 = y’. Find the system of equations x’ = Ax 
xl 

X2 

b. Find the equation that determines the eigenvalues of the 
coefficient matrix A in part a. Note that this equation is just the 
characteristic equation (36) of equation (35). 


satisfied by x = 


22. The two-tank system of Problem 19 in Section 7.1 leads to the 
initial value problem 


1 3 
[m ~ 10 40 m -17 
x= 1 1> x(0) = e 
10 5 


where x, and x» are the deviations of the salt levels Q; and Q, from 
their respective equilibria. 
a. Find the solution of the given initial value problem. 
Q b. Plot x, versus t and x» versus ¢ on the same set of axes. 
Q c. Find the smallest time T such that |xj(7)| < 0.5 and 
|xo(t)| < 0.5 for allt > T. 


23. Consider the system 


1 
a. Solve the system fora = 3 What are the eigenvalues of the 


coefficient matrix? Classify the equilibrium point at the origin as 
to type. 

b. Solve the system for a = 2. What are the eigenvalues of the 
coefficient matrix? Classify the equilibrium point at the origin as 
to type. 

C. In parts a and b, solutions of the system exhibit two 
quite different types of behavior. Find the eigenvalues of the 
coefficient matrix in terms of o, and determine the value of a 


1 m i 
between — and 2 where the transition from one type of behavior 


to the other occurs. This value of a is called a bifurcation value 
for this problem. 
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Electric Circuits. Problems 24 and 25 are concerned with the electric 
circuit described by the system of differential equations in Problem 18 
of Section 7.1: 


Ry 1 
ct d = 5 : 1(0) = Ip, V(0) = V 
dV) | 1 1 2 LEER dis 
C CR, 
(37) 
24. a. Find the general solution of equation (37) if Ri = 1 Q, 
2 
Ry = S0.L-2HamdC- 3P 


b. Show that /(t) — O0 and V(t) — 0 as t — œ, regardless 
of the initial values Jy and Vo. 


25. Consider the preceding system of differential equations (37). 
a. Find a condition on A, R2, C, and L that must be satisfied 
if the eigenvalues of the coefficient matrix are to be real and 
different. 

b. If the condition found in part a is satisfied, show that both 
eigenvalues are negative. Then show that both Z(t) — 0 and 
V(t) — Oast — oco, regardless of the initial conditions. 

C. If the condition found in part a is not satisfied, then the 
eigenvalues are either complex or repeated. Do you think that 
I(t) — Oand V(t) — Oast — oo in these cases as well? 
Hint: In part c, one approach is to change the system (37) into 
a single second-order equation. We also discuss complex and 
repeated eigenvalues in Sections 7.6 and 7.8. 
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In this section we consider again a system of n linear homogeneous equations with constant 


coefficients 


(1) 


where the coefficient matrix A is real-valued. If we seek solutions of the form x = €e”, then 
it follows, as in Section 7.5, that r must be an eigenvalue and £ a corresponding eigenvector 


of the coefficient matrix A. Recall that the eigenvalues r1, ... 


characteristic equation 


det(A — rD =0 


, 4 of A are the roots of the 


(2) 


and that the corresponding eigenvectors are nonzero vectors that satisfy 


(A —rD£ — 0. 


(3) 


If A is real-valued, then the coefficients in the polynomial equation (2) for r are real- 
valued, and any complex-valued eigenvalues must occur in conjugate pairs. For example, if 
rı = À-c ip, where À and p are real, is an eigenvalue of A, then so is r2 = A — iji. To explore 
the effect of complex-valued eigenvalues, we begin with an example. 
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EXAMPLE 1 


Find a fundamental set of real-valued solutions of the system 


1 
2 d 
x-| ? | |x (4) 
= Mes 
2 


Plot a phase portrait and graphs of components of typical solutions. 


Solution: 


A direction field for the system (4) is shown in Figure 7.6.1. This plot suggests that the trajectories 
in the phase plane spiral clockwise toward the origin. 


Xo Å 
——————————MÀMMNNNNNNN 
—€——————— — ——MMFONNNNNNNN 
———————— RN NNN NNN LLY 
FO SANNA LLL LS 
FAFA Fee aN NNN NAY \ ND UNS 
FAA Ae NSNN ALAN YE EE | 
PAAZ ZS a a O E A S VV EL lA 
AP BP PA PPPS KAN V NOE o oW qj 
LAF PL PAPA BNA AE IX. gg 
AACA Cae ST STRE RN. 
AM RA) MM M HEHTLAEH M 
APPL PEA tt Nees FL SS ALL 
ALEEPPPANN Ree eR er SILLS IS 
LEELEE NNN eee wn ee uuu 
fh PLAN N NN NS Hee ant at att P P ut 
ES NNNM MR——————wwx 
V NN NN NNNM OOo oem nm nn 
UEAN NNN MIR AR em nm nm nme 


A direction field for the system (4). 


To find a fundamental set of solutions, we assume that 
x = £e" (5) 


and obtain the set of linear algebraic equations 


1 
og f 
2 fi \ (0 
i b os (&) - (9) ue 
2 


for the eigenvalues and eigenvectors of A. The characteristic equation is 


1 
——-—r 1 
5 
2 , Jere =o; (7) 
-1 —- -r 
2 
therefore, the eigenvalues are r; = -} +i andr = -i — i. From equation (6), a straightforward 


calculation shows that the corresponding eigenvectors are 


gO = (;). eO = (2). (8) 


Observe that the eigenvectors €“!) and € are also complex conjugates. Hence a fundamental set 
of solutions of the system (4) is 


1 ; 1 ; 
xDe) = (i) eID PD) = i (9) 


To obtain a set of real-valued solutions, we can (by Theorem 7.4.5) choose the real and 
imaginary parts of either x‘) or x). In fact, 


o) lY 22 E e/2cost\ | fe"? sint 
x (t)2l|.]e (cost +isint) = J2 +i E : (10) 
i —e sint e cost 


Hence a pair of real-valued solutions of equation (4) is 


u(t) se 2 v(t) oer): (11) 
— sint cost 


To verify that u(t) and v(t) are linearly independent, we compute their Wronskian: 


e^? cost e? sint 2,2 3 m 
W[u, v](t) = E 5 =e (cost + sint) =e`. 
—e7t? sint e™/ 2 cost 


The Wronskian W[u, v](t) is never zero, so it follows that u(t) and v(t) constitute a fundamental 
set of (real-valued) solutions of the system (4). 


The graphs of the solutions u(t) and v(t) are shown as the solid and dashed black curves in 
Figure 7.6.2(a), a phase portrait for the system (4). Since 


o-( o = (° 
u(0) = oJ v(0) — i} 


the graphs of u(t) and v(t) pass through the points (1, 0) and (0, 1), respectively. Other solutions 
of the system (4) are linear combinations of u(t) and v(t), and graphs of a few of these solutions are 
also shown in Figure 7.6.2(a). Each trajectory approaches the origin along a spiral path as t — oo, 
making infinitely many circuits about the origin; this is due to the fact that the solutions (11) are 
products of decaying exponential and sine or cosine factors. Some typical graphs of x, versus t are 
shown in Figure 7.6.2(b); each one represents a decaying oscillation in time. The corresponding 
graphs of x2 versus t are shown in Figure 7.6.2(c). 
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x14 XA 
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(a) A phase portrait for the system (4); the origin is a spiral point. (5) 
Plots of x, versus t for the system (4). (c) Plots of x» versus t for the system (4). The 
component plots in (b) and (c) are color-coded to their trajectories in (a). The solid 
black curves are for the solution u(t) that passes through (1, 0), the dashed black curves 
are v(t) that passes through (0.1), the purple curves pass through (1, 1), red through 
(—1, 1), green through ( — 1, —1), and orange through (1, — 1). 


Figure 7.6.2(a) is typical of all 2 x 2 systems x' — Ax whose eigenvalues are complex 
with negative real part. The origin is called a spiral point and is asymptotically stable because all 
trajectories approach it as t increases. For a system whose eigenvalues have a positive real part, the 
trajectories are similar to those in Figure 7.6.2(a), but the direction of motion is away from the origin, 
and the trajectories become unbounded. In this case, the origin is unstable. 
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If the real part of the eigenvalues is zero, then the trajectories neither approach the origin nor 
become unbounded, but instead repeatedly traverse a closed curve about the origin. Examples of 
this behavior can be seen in Figures 7.6.3(b) and 7.6.4(b) below. In this case, the origin is called 
a center and is said to be stable, but not asymptotically stable. In all three cases, the direction of 
motion may be either clockwise, as in this example, or counterclockwise, depending on the elements 
of the coefficient matrix A. 

The phase portrait in Figure 7.6.2(a) was drawn by a computer, but it is possible to produce 
a useful sketch of the phase portrait by hand. We have noted that when the eigenvalues A + iy 
are complex, then the trajectories either spiral in (A < 0), spiral out (A > 0), or repeatedly 
traverse a closed curve (A = 0). To determine whether the direction of motion is clockwise or 
counterclockwise, we only need to determine the direction of motion at a single convenient point. 


For instance, in the system (4) we might choose x = (0, 1)7. Then Ax = | 1, — E Thus at the 


point (0, 1) in the phase plane, the tangent vector x’ to the trajectory at that point has a positive x- 
component and therefore is directed from the second quadrant into the first. The direction of motion 
is thus clockwise for each trajectory of this system. 


Returning to the general system of linear differential equations (1) 


we can proceed just as in the example. Suppose that there is a pair of complex conjugate 
eigenvalues, rı = A + iu and r2 = A — ip. Then the corresponding eigenvectors €‘) and 
£ ) are also complex conjugates. To see that this is so, recall that r; and £ (! satisfy 


(A — rD£(? =0. (12) 


On taking the complex conjugate of this equation and noting that A and I are real-valued, we 
obtain 


(A— ruD)£? 5 (A - ri D£O — 0, (13) 


where 7; and €» are the complex conjugates of r; and (P, respectively. In other words, 


rə = T; is also an eigenvalue, and £ ? = £(" is a corresponding eigenvector. The corres- 
ponding solutions 


xD) = Ee, ati egg" (14) 


of the differential equation (1) are then complex conjugates of each other. Therefore, as 
in Example 1, we can find two real-valued solutions of equation (1) corresponding to the 
eigenvalues r; and r by taking the real and imaginary parts of x‘) (t) or x? (t) given by 
equation (14). 

Let us write £ (D = a + ib, where a and b are real; then we have 


x(t) = (a + ib)e^*'?! = (a + ib)e*(cos( pt) + i sin(ut)). (15) 


Upon separating x! (1) into its real and imaginary parts, we obtain 
x(t) = e(acos( ut) — bsin(jt)) + ie" (asin(ut) + bcos(pt)). (16) 
If we write x‘ (t) = u(t) + iv(t), then the vectors 


u(t) = e"(acos(ut) — bsin(gt)), 
(17) 
v(t) = e" (asin(yut) + bcos(qt)) 


are real-valued solutions of equation (1). It is possible to show that u and v are linearly 
independent solutions (see Problem 22). 

For example, suppose that the matrix A has two complex eigenvalues rj = A + ip, 
r2 = À —ip, and that rs, ... , r, are all real and distinct. Let the corresponding eigenvectors 


be £(? = a+ ib, €% = a— ib, €%,...,€™. Then the general solution of equation 
(iis 


X= cqu(1) + V(t) - e£ Ve $+ + c EU gt, (18) 


where u(t) and v(t) are given by equations (17). We emphasize that this analysis applies 
only if the coefficient matrix A in equation (1) is real, for it is only then that complex-valued 
eigenvalues and eigenvectors must occur in conjugate pairs. 

For 2 x 2 systems with real-valued coefficients, we have now completed our description 
of the three main cases that can occur. The three main cases are: 


1. Eigenvalues are real and have opposite signs; x = 0 is a saddle point. 
2. Eigenvalues are real and have the same sign but are unequal; x = 0 is a node. 
3. Eigenvalues are complex with nonzero real part; x = 0 is a spiral point. 


Other possibilities occur as transitions between two of the cases just listed and are less 
likely to be encountered in real-world applications. For example, a zero eigenvalue occurs 
during the transition between a saddle point and a node. Purely imaginary eigenvalues occur 
during a transition between asymptotically stable and unstable spiral points. Finally, real and 
equal eigenvalues appear during the transition between nodes and spiral points. 


EXAMPLE 2 


The system 


2 Q 2 19 
x*=\-2 0J" (19) 


contains a parameter a. Describe how the solutions depend qualitatively on a; in particular, find the 
bifurcation values of a, that is, the values of a where the qualitative behavior of the trajectories in 
the phase plane changes markedly. 


Solution: 


The behavior of the trajectories is controlled by the eigenvalues of the coefficient matrix. The 
characteristic equation is 


r? — ar --4 — 0, (20) 
so the eigenvalues are 
a Xt /a?— 16 
a 3 Q1) 


From equation (21) it follows that the eigenvalues are complex conjugates for —4 < a < 4 and are 
real otherwise. 

Two bifurcation values are a = —4 and a = 4, where the eigenvalues change from real-valued 
to complex-valued, or vice versa. For a < —4 both eigenvalues are negative, so all trajectories 
approach the origin, which is an asymptotically stable node. Fora > 4 both eigenvalues are positive, 
so the origin is again a node, this time unstable; all trajectories (except x = 0) become unbounded. 
In the intermediate range, —4 < a < 4, the eigenvalues are complex and the trajectories are spirals. 
However, for —4 < a < Othe real part of the eigenvalues is negative, the spirals are directed inward, 
and the origin is asymptotically stable, whereas for 0 < a < 4 the real part of the eigenvalues is 
positive and the origin is unstable. 

A third bifurcation value is œ = 0, where the direction of the spirals changes from inward 
to outward. When œa = O the origin is a center and the trajectories are closed curves about 
the origin, corresponding to solutions that are periodic in time. The other bifurcation values, 
a = +4, yield eigenvalues that are real and equal. In this case the origin is again a node, 
but the phase portrait differs somewhat from those in Section 7.5. We take up this case in 
Section 7.8. 


A Multiple Spring-Mass System. Consider the system of two masses and three springs 
shown in Figure 7.1.1, whose equations of motion are given by equations (1) in Section 7.1. 
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If we assume that there are no external forces, then Fi(t) = 0, F(t) = 0, and the resulting 
equations are 


d?x, 
ms —(K, + k3) x1 + k2x2, 
(22) 
d?xj 
ire ME Kx, — (Ko + k3) x2. 


These equations can be solved as a system of two second-order equations (see Problem 24), 
but, as is consistent with our approach in this chapter, we will transform them into a system 
of four first-order equations. Let y; = x1, yo = x», y3 = xj, and y4 = x5. Then 


Yi =¥3, Yo = Vas (23) 
and, from equations (22), 
my, = —(ki +ko) yi +hoy2, may; = kayı — (ko + k3) y2. (24) 


The following example deals with a particular case of this two-mass, three-spring system. 


EXAMPLE 3 


Suppose that m, = 2, m; = 9/4, kı = 1, ky = 3, and k3 = 15/4 in equations (23) and (24) so that 


these equations become 


4 1 / 3 / 4 
Y= y3 y= Ya, 3 =—2yı + Qj» *»-3n- 3y». (25) 
Analyze the possible motions described by equations (25), and draw graphs showing typical 
behavior. 
Solution: 


Let y = (yy, y. Ya. y4)” . We can write the system (25) in matrix form as 


0 0 1 0 
0 0 0 1 
y- zu ; 0 0ol|Y— Ay. (26) 
4 
3 —3 0 0 


Keep in mind that y; and y» are the positions of the two masses, relative to their equilibrium positions, 
and that y3 and y4 are their velocities. We assume, as usual, that y = £e'', where r must be an 
eigenvalue of the matrix A and € a corresponding eigenvector. It is possible, though a bit tedious, 
to find the eigenvalues and eigenvectors of A by hand, but it is easy with appropriate computational 
resources. The characteristic polynomial of A is 


r^ 562 4 — (a? +1)(r? +4), (27) 
so the eigenvalues are r} = i, r2 = —i, r3 = 2i, and ry = —2i. The corresponding eigenvectors 
are 

3 3 3 3 
2 —4 —4 
dQ) D 3) _ (4) 28 
£ ae 4 ale @ HEE i Q8) 
2i —2i —8i 8i 


The complex-valued solutions £ Ue! and £(? e^" are complex conjugates, so two real-valued 
solutions can be determined by finding the real and imaginary parts of either of them. For instance, 


we have 


3 
Cl) it 2 roi 
Ee" = 3i (cost + i sint) 
i 
2i 
3cost 3 sint 
2 cost .| 2sint (1) . (1) 
= . +i —u '(t)-iv (t). (29) 
—3 sint 3 cost 
—2 sint 2cost 
In a similar way, we obtain 
3 
EO e2it a (cos(2r) + i sin(21)) 
i 
—8i 
3 cos(2r) 3 sin(2r) 
—4cos(2r) .| —4sin(2r) D 2 
esinta | | gogn | ole eM 
8sin(2t) —8 cos(2t) 


We leave it to you to verify that u P , v» , u(? , and v) are linearly independent and therefore form 
a fundamental set of solutions. Thus the general solution of equation (26) is 


3cost 3sint 3cos(2t) 3sin(2t) 
2cost 2 sint —4cos(2t) —4 sin(2r) 
y-—c, : + C2 +3 N + C4 š (31) 
—3 sint 3 cost —6sin(2t) 6cos(2t) 
—2 sint 2 cost 8 sin(2r) —8 cos(2t) 


where c1, c5, C3, and c4 are arbitrary constants. 

The phase space for this system is four-dimensional, and each solution, obtained by a particular 
set of values for c1, C2, c3, and c4 in equation (31), corresponds to a trajectory in this space. Since 
each solution, given by equation (31), is periodic with period 27 , each trajectory is a closed curve. 
No matter where the trajectory starts at £ = 0, it returns to that point at t = 27, — 47, and so 
forth, repeatedly traversing the same curve in each time interval of length 27. We do not attempt 
to show any of these four-dimensional trajectories here. Instead, in the figures below we show 
projections of certain trajectories in the y; y3- or y2 y4-plane, thereby showing the motion of each mass 
separately. 

The first two terms on the right-hand side of equation (31) describe motions with frequency 1 


and period 27. Note that y; = 3 yı in these terms and that y4 = 3 ys. This means that the two masses 


move back and forth together, always going in the same direction, but with the second mass moving 
only two-thirds as far, and two-thirds as fast, as the first mass. If we focus on the solution u” (1) 
and plot y; versus f and y versus t on the same axes, we obtain the cosine graphs of amplitude 3 and 
2, respectively, shown in Figure 7.6.3(a). The trajectory of the first mass in the y; y3-plane lies on 
the circle of radius 3 shown in Figure 7.6.3(b), traversed clockwise starting at the point (3, 0) and 
completing a circuit in time 27. Also shown in this figure is the trajectory of the second mass in 
the yz y4-plane, which lies on the circle of radius 2, also traversed clockwise starting at (2, 0) and 
also completing a circuit in time 27. The origin is a center in the respective y; y3- and y; y4-planes. 
Similar graphs (with an appropriate shift in time) are obtained from v‘? or from a linear combination 
of u(U and vy), 

The remaining terms on the right-hand side of equation (31) describe motions with frequency 


4 
2 and period 7. Observe that in this case, y; = =z yı and y4 = —=y3. This means that the two 


masses are always moving in opposite directions and that the second mass moves four-thirds as far, 
and four-thirds as fast, as the first mass. If we look only at u2 (1) and plot y, versus t and y; versus 
t on the same axes, we obtain Figure 7.6.4(a). There is a phase difference of 7, and the amplitude 
of y» is four-thirds that of yı, confirming the preceding statements about the motions of the masses. 
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Figure 7.6.4(b) shows a superposition of the trajectories for the two masses in their respective phase 
planes. Both graphs are ellipses, the inner one corresponding to the first mass and the outer one to 
the second. The trajectory on the inner ellipse starts at (3, 0), and the trajectory on the outer ellipse 
starts at ( —4, 0). Both are traversed clockwise, and a circuit is completed in time 7. The origin is a 
center in the respective y; y3- and yy y4-planes. Once again, similar graphs are obtained from v? or 
from a linear combination of u(? and v? . 


(a) (b) 


(a) A plot of y, versus t (solid blue) and y» versus ¢ (dashed red) for the solution 


u(t). (b) Superposition of projections of trajectories in the y,y3- and y?y4-planes for the 
solution u P (1). 


~. 4y? +yz = 36 
bh 


s 


(a) (b) 


aUI (a) A plot of y, versus t (solid blue) and y» versus t (dashed red) for the solution 


u?)(t). (b) Superposition of projections of trajectories in the y; y3- and yz y4-planes for the solution 
u(2(1). 


The types of motion described in the two preceding paragraphs are called fundamental 
modes of vibration for the two-mass system. Each of them results from fairly special initial 
conditions. For example, to obtain the fundamental mode of frequency 1, both of the constants 
c3 and c4 in equation (31) must be zero. This occurs only for initial conditions in which 
3y2(0) = 2y,(0) and 3y4(0) = 2ys(0). Similarly, the mode of frequency 2 is obtained only when 
both of the constants c, and c; in equation (31) are zero— that is, when the initial conditions are such 
that 3y2(0) = —4y;(0) and 3y4(0) = —4y3(0). 

For more general initial conditions the solution is a combination of the two fundamental modes. 
A plot of y, versus t for a typical case is shown in Figure 7.6.5(a), and the projection of the 
corresponding trajectory in the y; y3-plane is shown in Figure 7.6.5(b). Observe that this latter figure 
may be a bit misleading in that it shows the projection of the trajectory crossing itself. This cannot be 
the case for the actual trajectory in four dimensions, because it would violate the general uniqueness 
theorem: there cannot be two different solutions issuing from the same initial point. 
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A solution of the system (25) satisfying the 
initial condition y(0) = (—1, 4, 1, 1)7. (a) A plot of y, versus t. 
(b) The projection of the trajectory in the y; y3-plane. As stated 
in the text, the actual trajectory in four dimensions does not 
intersect itself. 


Problems 


In each of Problems 1 through 4: ; 2 —5 
Q a. Draw a direction field and sketch a few trajectories. 2 x= * 3) T 
b. Express the general solution of the given system of equations 1-1 
in terms of real-valued functions. 3. x= ( ) 
C. Describe the behavior of the solutions as f — oo. 5 -3 
NE 1 2 
l. x= ( i ts 4. x= * E 
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In each of Problems 5 and 6, express the general solution of the given 
system of equations in terms of real-valued functions. 


1 0 0 
5e x = [2 1 —2 |x 
3 2 1 
—3 0 2 
6. x= ] -1 0 [x 
—2 -1 0 


In each of Problems 7 and 8, find the solution of the given initial-value 
problem. Describe the behavior of the solution as t — oo. 


, fa -5 E 
7. xs ( x x(0) = () 
2 _ 1 
E x, x(0)= 2 


In each of Problems 9 and 10: 
a. Find the eigenvalues of the given system. 
Q b. Choose an initial point (other than the origin) and draw the 
corresponding trajectory in the x,x»-plane. 
Q c. For your trajectory in part b, draw the graphs of x, versus 
t and of x» versus f. 
Q d. For your trajectory in part b, draw the corresponding graph 
in three-dimensional tx,x5-space. Note that the projections of 
this plot onto each of the coordinate planes should produce the 
three plots produced in parts b and c. 


ME. 
9, x'— 4 5 |* 
Î => 
4 
4 
= 9 
10. x = 5 6 |* 
—] 2 
5 


In each of Problems 11 through 15, the coefficient matrix contains a 
parameter a. In each of these problems: 
a. Determine the eigenvalues in terms of a. 
b. Find the bifurcation value or values of o; where the qualitative 
nature of the phase portrait for the system changes. 
O c. Draw a phase portrait for a value of a slightly below, and 
for another value slightly above, each bifurcation value. 


11. v= (4 ) 
—] a 


| 
EN 
»* 


12. put E 
1 Qa 
5 3 

13. x» | ^ p x 
a es 
4 

) 


14. x= e 
zi 

15. x= k A 
8 —6 


In each of Problems 16 and 17, solve the given system of equations by 
the method of Problem 13 of Section 7.5. Assume that t > 0. 


"E 
16. w= ( 2 zs 


17; w= (i i^ 
1 -2 


In each of Problems 18 and 19: 
a. Find the eigenvalues of the given system. 
O b. Choose an initial point (other than the origin) and draw 
the corresponding trajectory in the x;x5-plane. Also draw the 
trajectories in the x,x3- and x2x3-planes. 
Q c. For the initial point in part b, draw the corresponding 
trajectory in x4x2xa-space. 


: 1 0 
4 
18. x= 1 : 0 [x 
Ba 4 
1 
0 0 —- 
n 
1 
—- 1 0 
4 
19. x ı -l 96 
x=]|-— —-- x 
4 
0 0 l 
10 


20. Consider the electric circuit shown in Figure 7.6.6. Suppose that 
1 
Ri =R =402,C= j P andL 8H. 


a. Show that this circuit is described by the system of 
differential equations 


1 1 
i(r)= Li te) 
dt VV 5- LE] XV 
2 
where / is the current through the inductor and V is the voltage 
drop across the capacitor. Hint: See Problem 18 of Section 7.1. 
b. Find the general solution of equations (32) in terms of real- 
valued functions. 
c. Find I(t) and V(t) if (0) = 2 A and V(0) 23 V. 
d. Determine the limiting values of /(t) and V(t) as t > oo. 
Do these limiting values depend on the initial conditions? 


Ry 


(32) 


R2 


The circuit in Problem 20. 


21. The electric circuit shown in Figure 7.6.7 is described by the 
system of differential equations 


1 
0 — 
d (1 mr 
= = , 33 
iy) P (5) Em 
C RC 


where / is the current through the inductor and V is the voltage 
drop across the capacitor. These differential equations were derived 
in Problem 16 of Section 7.1. 
a. Show that the eigenvalues of the coefficient matrix are real 
and different if L > 4R?C; show that they are complex conju- 
gates if L < 4R?C. 


1 
b. Suppose that R = 1 Q, C = 2 F, and L — 1 H. Find the 


general solution of the system (33) in this case. 


c. Find /(t) and V(t) if 7(0) 22A and V(0) = 1 V. 

d. For the circuit of part b, determine the limiting values of / (t) 
and V(t) as t — oo. Do these limiting values depend on the 
initial conditions? 


C 


R 


idid:zr3 The circuit in Problem 21. 


22. In this problem we indicate how to show that u(t) and v(t), as 
given by equations (17), are linearly independent. Let r} = A 4 ip 
and 7; = A — ip be a pair of conjugate eigenvalues of the coefficient 
matrix A of equation (1); let £(U = a + ib and £(0 = a — ib be 
the corresponding eigenvectors. Recall that it was stated in Section 7.3 
that two different eigenvalues have linearly independent eigenvectors, 
so ifr; Æ rj, then £ U and € are linearly independent. 
a. First we show that a and b are linearly independent. Consider 
the equation c;a+cyb = 0. Express a and b in terms of £ (? and 
E(D, and then show that (c; — ic?) £? +(e; c ic;) £C? = 0. 
b. Showthatc; — ic; = 0 and c cic? = 0 and then that c; = 0 
and c; = 0. Consequently, a and b are linearly independent. 
c. To show that u(t) and v(t) are linearly independent, consider 
the equation cqu(£9) +¢2V( fo) = 0, where tọ is an arbitrary point. 
Rewrite this equation in terms of a and b, and then proceed as in 
part b to show that c; = 0 and c; = 0. Hence u(t) and v(t) are 
linearly independent at the arbitrary point tọ. Therefore, they are 
linearly independent at every point and on every interval. 


23. A mass m on a spring with constant k satisfies the differential 
equation (see Section 3.7) 


mu" + ku = 0, 
where u(t) is the displacement at time ¢ of the mass from its 
equilibrium position. 
a. Let x; = u, x =u’, and show that the resulting system is 


0 1 
x = k x. 
e ý 
m 


b. Find the eigenvalues of the matrix for the system in part a. 
C. Sketch several trajectories of the system. Choose one of your 
trajectories, and sketch the corresponding graphs of x, versus t 
and x» versus f. Sketch both graphs on one set of axes. 
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d. What is the relation between the eigenvalues of the 
coefficient matrix and the natural frequency of the spring-mass 
system? 
24. Consider the two-mass, three-spring system of Example 3 in 
the text. Instead of converting the problem into a system of four 
first-order equations, we indicate here how to proceed directly from 
equations (22). 
a. Show that equations (22) can be written in the form 


2 2 

2 
4 
3 


Ii 


x = Ax. (34) 


=3 


b. Assume that x = € e” and show that 
(A — r?D£ — 0. 


Note that r? (rather than r) is an eigenvalue of A corresponding 
to an eigenvector £. 

c. Find the eigenvalues and eigenvectors of A. 

d. Write down expressions for x, and xz. There should be four 
arbitrary constants in these expressions. 

e. By differentiating the results from part d, write down 
expressions for x} and x}. Your results from parts d and e should 
agree with equation (31) in the text. 


25. Consider the two-mass, three-spring system whose equations of 
motion are equations (22). Let mı = 1, m5 = 4/3, kı = 1, ko = 3, 
and k3 = 4/3. 
a. As in Example 3, convert the system to four first-order 
equations of the form y' — Ay. Determine the coefficient matrix 
A. 
b. Find the eigenvalues and eigenvectors of A. 
c. Write down the general solution of the system. 
O d. Describe the fundamental modes of vibration. For each 
fundamental mode, draw graphs of y, versus f and y» versus t. 
Also draw the corresponding trajectories in the y; y3- and y2y4- 
planes. 
O e. Consider the initial conditions y(0) = (2,1,0,0)7. 
Evaluate the arbitrary constants in the general solution in 
part c. What is the period of the motion in this case? Plot graphs 
of y, versus f and y? versus f. Also plot the corresponding 
trajectories in the y; y3- and y; y4-planes. Be sure you understand 
how the trajectories are traversed for a full period. 
Q f. Consider other initial conditions of your own choice, and 
plot graphs similar to those requested in part e. 


The structure of the solutions of systems of linear differential equations can be further 
illuminated by introducing the idea of a fundamental matrix. Suppose that x P (1), ... , x(t) 


form a fundamental set of solutions for the equation 
x = P(t)x 


on some interval a < t < f. Then the matrix 


W(t) = (x) xo | T ix») E 


(1) 


, (2) 
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whose columns are the vectors x(t), ... , x? (1), is said to be a fundamental matrix for 
the system (1). Note that a fundamental matrix is nonsingular since its columns are linearly 
independent vectors. 


EXAMPLE 1 


Find a fundamental matrix for the system 
; 1 1 Q) 
x = x. 
4 1 


In Example 2 of Section 7.5, we found that 


3t —t 
Quilt Qe) = d 
x = G4). x o=) 


are linearly independent solutions of equation (3). Thus a fundamental matrix for the system (3) is 
e e 
Wr) = (4) 
i a.) 


The solution of an initial value problem can be written very compactly in terms of a 
fundamental matrix. The general solution of equation (1) is 


Solution: 


x — cix (1) «eux P (1) 65) 

or, in terms of W(t), 
x= V(t)c, (6) 
where c is a constant vector with arbitrary components c1, ... , Cn. For an initial value problem 


consisting of the differential equation (1) and the initial condition 
x(t) = x, (7) 


where tọ is a given point ina < t < ( and x is a given initial vector, it is only necessary 
to choose the vector c in equation (6) so as to satisfy the initial condition (7). Hence c must 


satisfy 
V(1))c = x. (8) 
Therefore, since W(fp) is nonsingular, 
c= V (nx? (9) 
and 
x = V(1)W (nx? (10) 


is the solution of the initial value problem (1), (5). We emphasize, however, that to solve a 
given initial value problem, we would ordinarily solve equation (8) by row reduction and then 
substitute for c in equation (6), rather than compute v fo) and use equation (10). 

Recall that each column of the fundamental matrix W is a solution of equation (1). It 
follows that W satisfies the matrix differential equation 


V' — P). (11) 


This relation is readily confirmed by comparing the two sides of equation (11) column by 
column. 


Sometimes it is convenient to make use of the special fundamental matrix, denoted by 
@(t), whose columns are the vectors x! (1), ... , x"? (t) designated in Theorem 7.4.4. 
Besides the differential equation (1), these vectors satisfy the initial conditions 


x (1) =e, (12) 


where e” is the unit vector, defined in Theorem 7.4.4, with a 1 in the J th position and zeros 
elsewhere. Thus (1) has the property that 


$(-—|. - -| =L (13) 
OO a J 


We will always reserve the symbol ® to denote the fundamental matrix satisfying the initial 
condition (13) and use Y when an arbitrary fundamental matrix is intended. In terms of 
(t), the solution of the initial value problem (1), (7) is even simpler in appearance; since 
$ (1) = I, it follows from equation (10) that 


x = (nx. (14) 


Although the fundamental matrix (1) is often more complicated than W(1), it is especially 
helpful if the same system of differential equations is to be solved repeatedly subject to many 
different initial conditions. This corresponds to a given physical system that can be started 
from many different initial states. If the fundamental matrix (f) has been determined, then 
the solution for each set of initial conditions can be found simply by matrix multiplication, 
as indicated by equation (14). The matrix (f) thus represents a transformation of the initial 
conditions x? into the solution x(t) at an arbitrary time t. Comparing equations (10) and (14) 
makes it clear that P(t) = W(t) P! (to). 


EXAMPLE 2 


For the system (3) 


; 1 1 
x= x 
4 1 
in Example 1, find the fundamental matrix ® such that (0) = I. 


Solution: 


The columns of ® are solutions of equation (3) that satisfy the initial conditions 


x)(0) = 0 x?(0) = Wu (15) 


Since the general solution of equation (3) is 


we can find the solution satisfying the first set of these initial conditions by choosing 


1 
ĉi — h = 3 similarly, we obtain the solution satisfying the second set of initial conditions by 
1 1 
choosing c, = — and c; = — —. Hence 
8c) a C2 4 
1 1 1 
3t =t 3t —t 
af + 5t 44 j^ 
P(t) = F (16) 
1 
3f. 223 a =t 
e e 7° + 3* 


Note that the elements of (7) are more complicated than those of the fundamental matrix W(t) 
given by equation (4); however, it is now easy to determine the solution corresponding to any set of 
initial conditions. 
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The Matrix exp(A;). Recall that the solution of the scalar initial value problem 
x =ax, x(0) = x9, (17) 
where a is a constant, is 
X = xgexp(at). (18) 
Now consider the corresponding initial value problem for an n x n system, namely, 
x —Ax, x(0)-x, (19) 


where A is a constant matrix. Applying the results of this section to the initial value problem 
(19), we can write its solution as 


x = (t)x?, (20) 


where ®(0) = I. Comparing the initial value problems (17) and (19), and their solutions (18) 
and (20), suggests that the matrix (1) might have an exponential character. We now explore 
this possibility. 

The scalar exponential function exp(at) can be represented by the power series 
242 uL 


E at 
ERE E 


a'r 
dee Q1) 
n! 


exp(at) = 1+ b» 


n=1 


which converges for all t. Let us now replace the scalar a by the n x n constant matrix A, the 
scalar 1 by the n x n identity matrix I, and consider the corresponding series 


oc A”g" A2t2 A”g" 
I+ =I+At+ =+ 
n! 2! n! 


n=1 


pas (22) 


Each term in the series (22) is an n x n matrix. It is possible to show that each element of 
this matrix sum converges for all t as n — oo. Thus the series (22) defines as its sum a new 
matrix, which we denote by exp(At); that is, 


oo Ann 
exp(Ar) 2 1-97 a (23) 
n=1 
analogous to the expansion (21) of the scalar function exp(at). 
By differentiating the series (23) term by term, we obtain 
d oc A^ oo A^" 
j; CPAD) “Gam 2? —— | = Aexp(An). (24) 
Therefore, exp( At) satisfies the differential equation 
d 
— exp( At) = A exp( At). (25) 


dt 


Further, by setting t = 0 in equation (23), we find that exp( Aż) satisfies the initial condition 


oo A^ 
exp A)|  =I+ 5 a I. (26) 
t=0 n=1 i 


The fundamental matrix ® satisfies the same initial value problem as exp( At), namely, 


$ = A, (0)=I. (27) 


Then, by the uniqueness part of Theorem 7.1.2 (extended to matrix differential equations), we 
conclude that exp(At) and the fundamental matrix (1) are the same. Thus we can write the 
solution of the initial value problem (19) in the form 


x = exp( Ar) x, (28) 


which is analogous to the solution (18) of the initial value problem (17). 

In order to justify more conclusively the use of exp( At) for the sum of the series (22), 
we should demonstrate that this matrix function does indeed have the properties we associate 
with the exponential function. One way to do this is outlined in Problem 12. 


Diagonalizable Matrices. The basic reason why a system of linear (algebraic or 
differential) equations presents some difficulty is that the equations are usually coupled. In 
other words, some or all of the equations involve more than one—often all—of the unknown 
variables. Hence the equations in the system must be solved simultaneously. In contrast, if 
each equation involves only a single variable, then each equation can be solved independently 
of all the others, which is a much easier task. This observation suggests that one way to solve 
a system of equations might be by transforming it into an equivalent uncoupled system in 
which each equation contains only one unknown variable. This corresponds to transforming 
the coefficient matrix A into a diagonal matrix. 
Eigenvectors are useful in accomplishing such a transformation. Suppose that the 
n x n matrix A has a full set of n linearly independent eigenvectors. Recall that this 
will certainly be the case if the eigenvalues of A are all different, or if A is Hermitian. 
Letting €, ...,€™ denote these eigenvectors and A,,...,A, the corresponding 
eigenvalues, form the matrix T whose columns are the eigenvectors —that is, 
"EOD 
1 1 
T=|: e]. (29) 
(D o (n) 
n n 
Since the columns of T are linearly independent vectors, det T + 0; hence T is nonsingular 
and T^! exists. A straightforward calculation shows that the columns of the matrix AT are 


just the vectors A£”, ... , AE. Since AE = A,£ (9, it follows that 
1 n 
Ai& ! s Aet ? 
AT = . . = TD, (30) 
AE wes Gur 
where 
Ài 0 0 
Ü. Ago © 
D=]. LE (31) 
0 li. ue s 


is a diagonal matrix whose diagonal elements are the eigenvalues of A. From equation (30) it 
follows that 

TAT =D. (32) 
Thus, if the eigenvalues and eigenvectors of A are known, A can be transformed into a 
diagonal matrix by the process shown in equation (32). This process is known as a similarity 
transformation, and equation (32) is summed up in words by saying that A is similar 
to the diagonal matrix D. Alternatively, we may say that A is diagonalizable. Observe 
that a similarity transformation leaves the eigenvalues of A unchanged and transforms its 
eigenvectors into the coordinate vectors eD, ... el”, 

If A is Hermitian, then the determination of T-! is very simple. The eigenvectors 
ED, ... ,E™ of A are known to be mutually orthogonal, so let us choose them so that they 
are also normalized by (£?, £?) = 1 for each i. Then it is easy to verify that T^! = T*; in 
other words, the inverse of T is the same as its adjoint (the transpose of its complex conjugate). 

Finally, we note that if A has fewer than n linearly independent eigenvectors, then there 
is no matrix T such that T AT = D. In this case, A is not similar to a diagonal matrix and is 
not diagonalizable. This situation is discussed in more detail in Section 7.8. 
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EXAMPLE 3 


Consider the matrix 


kat? 3 33 
(«M4 1f (33) 


Find the similarity transformation matrix T and show that A can be diagonalized. 


Solution: 


In Example 2 of Section 7.5, we found that the eigenvalues and eigenvectors of A are 


1 1 
r =3, EV = (2 and r,—2-—1, £? = (3) (34) 


Thus the transformation matrix T and its inverse T~! are 


1 1 
1 1l ELE 4 
TS (; 3) and T = 1 1 (35) 
2 4 
Consequently, you can check that 
3 0 
T'AT = =D. (36) 
0 -l 
Now let us turn again to the system 
x’ = Ax, (37) 


where A is a constant matrix. In Sections 7.5 and 7.6 we described how to solve such a system 
by starting from the assumption that x = £e". Now we provide another viewpoint, one based 
on diagonalizing the coefficient matrix A. 

According to the results stated earlier in this section, it is possible to diagonalize A 
whenever A has a full set of n linearly independent eigenvectors. Let €, ... , £? be 
eigenvectors of A corresponding to the eigenvalues r;---,7, and form the transformation 
matrix T whose columns are £ U, ... , €”. Then, defining a new dependent variable y by 
the relation 


x — Ty, (38) 
we have from equation (37) that 
Ty — ATy. (39) 
Multiplying by T-!, we then obtain 
y = (T AT, (40) 
or, using equation (32), 
y = Dy. (41) 
Recall that D is the diagonal matrix with the eigenvalues r1, ... , 7, of A along the diagonal. 


A fundamental matrix for the system (41) is the diagonal matrix (see Problem 13) 


e 0 ... O0 


Q() = exp(Dr) = | . P a: (42) 


A fundamental matrix V for the system (37) is then found from Q by the transformation (38) 
Ww = TQ; (43) 
that is, 
Deni a orn 
W(t) = . : . (44) 
gei Lus gg 


The columns of W(t) are the same as the solutions in equation (27) of Section 7.5. Thus 
the diagonalization procedure does not offer any computational advantage over the method of 
Section 7.5, since in either case it is necessary to calculate the eigenvalues and eigenvectors 
of the coefficient matrix in the system of differential equations. 


EXAMPLE 4 


Consider once again the system of differential equations 


x’ = Ax, (45) 
where A is given by equation (33). Using the transformation x = Ty, where T is given by 
equation (35), the system (45) reduces to the diagonal system 

Kad er D (46) 
y= 0 —1 y= Vy. 


Obtain a fundamental matrix for the system (46), and then transform it to obtain a fundamental matrix 
for the original system (45). 


Solution: 


By multiplying D repeatedly with itself, we find that 


" b ;) " K ;) 
D? = , D= EUM (47) 
0 1 0 -1 


Therefore, it follows from equation (23) that exp( Dt) is a diagonal matrix with the entries e?' and 
e™ on the diagonal; that is, 
3t 
Di e 0 
e = ; 48 
( 0 et ) (18) 


Finally, we obtain the required fundamental matrix Y(t) by multiplying T and exp( Dr): 


sast e ace ae (49) 
E 2 0 e] (e —2e7 J^ 


Observe that this fundamental matrix is the same as the one found in Example 1. 
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Problems 


In each of Problems | through 8: 
a. Find a fundamental matrix for the given system of equations. 
b. Find the fundamental matrix (t) satisfying ®(0) = I. 


1. gs i^ 
2 -2 


col — RIO 
AIUCrF|— 
»* 


p 
* 
I 

| 


1 -1 4 
8. x =]3 2 -1|x 
2 1 -1 


9. Use the fundamental matrix (7) found in Problem 4 to solve 
the initial value problem 


n —-] -4 "EE E 
x — 1-1) x0 = e 


10. Show that E(t) = P(t) P !(to), where (7) and W(t) are as 
defined in this section. 


11. The fundamental matrix (7) for the system (3) was found in 
Example 2. Show that ®(t)®(s) = (t + s) by multiplying (t) 
and ®(s). 


12. Let (t) denote the fundamental matrix satisfying P’ = A®, 
(0) = I. In the text we also denoted this matrix by exp( Ar). In 
this problem we show that ® does indeed have the principal algebraic 
properties associated with the exponential function. 
a. Show that ®(t)®(s) = ®(t + s); that is, show that 
exp( At) exp( As) = exp(A(t + s)). Hint: Show that if s is fixed 
and f is variable, then both (7) (s) and ®(t + s) satisfy the 
initial value problem Z’ = AZ, Z(0) = ®(s). 


b. Show that ®(t) (—t) =I; that is, exp( At) exp(A(—1)) =I. 

Then show that &(—7) = ®7!(1). 

€. Show that &(1 — s) = P(t) !(s). 
13. Show that if A is a diagonal matrix with diagonal elements 
41, Q5, ... , An, then exp( Ar) is also a diagonal matrix with diagonal 
elements exp(at), exp(aot), ... , exp(ast). 


14. Consider an oscillator satisfying the initial value problem 


u" +w7u — 0, u(0) = ug, u'(0) = vg. (50) 


a. Let x; = u, x; = u’, and transform equations (53) into the 
form 


x —Ax, x(0)— x. (51) 
b. Use the series (23) to show that 


BECAS) silua) a RP (52) 
Ww 


c. Find the solution of the initial value problem (51). 


15. The method of successive approximations (see Section 2.8) can 
also be applied to systems of equations. For example, consider the 
initial value problem 


/ 


x —Ax, x(0)-x, (53) 


where A is a constant matrix and x? is a prescribed vector. 
a. Assuming that a solution x = (f) exists, show that it must 
satisfy the integral equation 


b(t) =x? + | Ad(s) ds. (54) 
0 


b. Start with the initial approximation (1) = x°. Substitute 
this expression for (s) on the right-hand side of equation (51) 
and obtain a new approximation e! (1). Show that 


bo (7) = (14+ A0xP. (55) 
c. Repeat this process and thereby obtain a sequence of 


approximations b, (P, (9, ...,b, ... . Use an induc- 
tive argument to show that 


t2 t 
b™ (t) = (ve eM eem t (56) 


d. Let n — oo and show that the solution of the initial value 
problem (53) is 


b(t) = exp( An x’. (57) 


78 Repeated Eigenvalues 


We conclude our consideration of the linear homogeneous system of differential equations 
with constant coefficients 


x’ = Ax (1) 


with a discussion of the case in which the matrix A has a repeated eigenvalue. Recall that 
in Section 7.3 we stated that a repeated eigenvalue with algebraic multiplicity m > 2 may 
have a geometric multiplicity less than m. In other words, there may be fewer than m linearly 
independent eigenvectors associated with this eigenvalue. The following example illustrates 
this possibility. 


EXAMPLE 1 


Find the eigenvalues and eigenvectors of the matrix 
R 1 -l (2) 
xU "E 


The eigenvalues r and eigenvectors € satisfy the equation (A — rD E = 0, or 


(v 2)9-0 " 


The eigenvalues are the roots of the equation 


Solution: 


—r —-i 


=r? —4r4+4=(r—2)? =0. (4) 
1 3—r 


1 
det(A — 7D — | 


Thus the two eigenvalues are r; = r = 2; that is, the eigenvalue 2 has algebraic multiplicity 2. 
To determine the eigenvectors, we must return to equation (3) and use r — 2. This gives 


—] -1\/& 0 
= : (5) 
1 1/ N£o 0 
Hence we obtain the single condition £; + 2 = 0, which determines £» in terms of £4, or vice versa. 
Thus the eigenvector corresponding to the eigenvalue r — 2 is 


1 
(D... 
gu. (a) (6) 


or any nonzero multiple of this vector. Observe that there is only one linearly independent eigenvector 
associated with the double eigenvalue. 


Returning to the system (1), suppose that r = p is an m-fold root of the characteristic 
equation 

det(A — rI) = 0. (7) 

Then p is an eigenvalue of algebraic multiplicity m of the matrix A. In this event, there 

are two possibilities: either there are m linearly independent eigenvectors corresponding to 


the eigenvalue p, or else, as in Example 1, there are fewer than m linearly independent 
eigenvectors. 


In the first case, let £, ... , £? be m linearly independent eigenvectors associated 
with the eigenvalue p of algebraic multiplicity m. Then there are m linearly independent 
solutions x(t) = Ee”, ... , x(t) = Ee?! of equation (1). Thus in this case, it 


makes no difference that the eigenvalue r — p is repeated; there is still a fundamental set of 
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solutions of equation (1) of the form £e"'. This case always occurs if the coefficient matrix A 
is Hermitian (or real and symmetric). 

However, if the coefficient matrix is not Hermitian, then there may be fewer than m 
independent eigenvectors corresponding to an eigenvalue p of algebraic multiplicity m, and if 
so, there will be fewer than m solutions of equation (1) of the form £e^' associated with this 
eigenvalue. Therefore, to construct the general solution of equation (1), it is necessary to find 
other solutions of a different form. Recall that a similar situation occurred in Section 3.4 for 
the linear equation ay" + by’ + cy = 0 when the characteristic equation has a double root r. 
In that case we found one exponential solution y; (1) = e"', but a second independent solution 
had the form y(t) = te". With that result in mind, consider the following example. 


EXAMPLE 2 


Find a fundamental set of solutions of 


Rx E sd " 


and draw a phase portrait for this system. 


Solution: 


A direction field for the system (8) is shown in Figure 7.8.1. From this figure it appears that all 
nonzero solutions depart from the origin. 


X2 
CARENTAN A EEEE 
PE NN OS 
NN XXX TT TIT 
SAN EAANA TE Att ft 
NOM M XXE Tog t t 
PIN Xd treet 
N.N CT t4 £14 dgdif 
SNA tL t 1 ELE LL 
SAN es ee a ae al a 
Sa EEL PAS SLL 
4 JAY TV YN mo 4te-- 
£4 Jet V T O a d 
4 44 4 d: 4 V VM FREU 
[ALL LLL MAYEN NA AM 
Jj4EEE LN N NNNM 
£L V NV PANNNNAN 
FITIIIEITA VN NNNMN 
EEUU VNNNAN 
PEEP V NEA ANNANN 
LEE V V YN N NNNM 


A direction field for the system (8). 


To solve the system, observe that the coefficient matrix A is the same matrix as in Example 1. 
Thus we know that r — 2 is a double eigenvalue and that it has only a single corresponding 
eigenvector, which we may take as € = (1, —1)". Thus one solution of the system (8) is 


x P (1) = [29 (9) 


but there is no second solution of the form x = £e". 
Based on the procedure used for second-order linear differential equations in Section 3.4, it may 
be natural to attempt to find a second independent solution of the system (8) of the form 


x = te”, (10) 
where € is a constant vector to be determined. Substituting for x in equation (8), we obtain 


2éte* + Ee = A£te^. (11) 


For equation (11) to be satisfied for all t, it is necessary to equate the coefficients of te” and e” on 
each side of equation (11). From the term in e?', we find that 


£- 9. (12) 


Hence there is no nonzero solution of the system (8) of the form (10). 


Since equation (11) contains terms in both te” and e”, it appears that in addition to £te?', the 
second solution must contain a term of the form ne? ; in other words, we need to assume that 


x = Ete” + ne”, (13) 


where € and 77 are constant vectors to be determined. Upon substituting this expression for x in 
equation (8), we obtain 


2éte* + (£ - 2n)e** = A( Ete + ye”). (14) 
Equating coefficients of te” and e” on each side of equation (14) gives the two conditions 
26 = AE 

and 

€+2n=An 
for the determination of € and 77. It is helpful to rewrite these conditions as 

(A -2D€=0 (15) 
and 

(A — 2D m =À, (16) 


respectively. Equation (15) is satisfied if £ is an eigenvector of A corresponding to the eigenvalue 
r = 2, such as € = (1, —1)7. Since det(A — 2I) is zero, equation (16) is solvable only if the right- 
hand side £ satisfies a certain condition. Fortunately, £ and its multiples are exactly the vectors that 
allow equation (16) to be solved." The augmented matrix for equation (16) is 


-1 -1 1 
1 1| -1)}° 
The second row of this matrix is proportional to the first, so the system is solvable. We have 


= — 2 =1, 


so if 7; = k, where k is arbitrary, then 7, = —k — 1. If we write 


TENERO " 


then by substituting for € and n in equation (13), we obtain 


x = &te + ne” = (ve + (1) e? + ire (18) 


The last term in equation (18) is merely a multiple of the first solution x‘ (7) and may be ignored, 
but the first two terms constitute a new solution: 


x(t) = (iye + ( (19) 


An elementary calculation shows that W[x ' , x? ](t) = —e^ + 0, and therefore x” and x) 
form a fundamental set of solutions of the system (8). The general solution is 


x= ax P(t) + Cx) (1) 


EET (ie +c ( (ie + (1) 2 (20) 


The main features of a phase portrait for the solution (20) follow from the presence of the exponential 
factor e” in every term. Therefore x — 0 as? — —oo and, unless both c, and c; are zero, x becomes 


"This condition is that (£, y) = 0 for any nontrivial solution y of (A — 2I) *y = 0. From the fact that every matrix 
and its adjoint have the same eigenvalues (see Problems 21-25 in Section 7.3), we conclude that the vectors y 
are the eigenvectors of A* corresponding to its (repeated) eigenvalue r — 2. An easy calculation shows that 
y=c(1,1) T (see Problem 16). Since € = (1, —1)", we see that (£, y) = 0. Thus, equation (16) is solvable. 
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unbounded as t — oo. If cı and c are not both zero, then along any trajectory we have 


: X(t) —c,e T cj (—te?' = =) . —C| — Cot — C3 
lim = 2: 3 = lim ==] 
1 —oo X(t) cje” + cote?! IE Cy + cot 
Therefore, as £ —  —oo, every trajectory approaches the origin tangent to the line x = —x, 


determined by the eigenvector; this behavior is clearly evident in Figure 7.8.2(a). Further, ast — oo, 
the slope of each trajectory also approaches —1. However, it is possible to show that trajectories do 
not approach any single asymptote as t — oo. Several trajectories, including x‘) (the solid black 
curve) and x”) (the dashed black curve), of the system (8) are shown in Figure 7.8.2(a), and some 
typical plots of x, versus t and of x» versus t are shown in Figures 7.8.2(b) and 7.8.2(c), respectively. 


X» 

NANANA EE tt att 
NNNNNSS b Vat t tit t 
AAAA NN AN IBN WEBEENI 
ANA PA CW Ht t tht t 
MMNNNNSN yA tyWitt t ptt 
MM NNNNNNSM 864 tart 
Sank SN E Intr d ng 
ewm SAN Vtt titt? 
4 an am a N NEP LULL 
L4 EE LLL. 
LID 1 His N SUAS Xi 
seed Wd WA SxS mm 
Edt dW WY AWAN ———— 
TE ERI HENN Ira- 
EBEN WV VPA ARS ASS 
EE VV V PAA ANAS 
PEATE MAA AAAS 
PETE E S ER AWN AA ANAS 
BIZ VY FN AAA AN AAA 
EAE EY VAWW VAN E NN NNNM 


(b) (c) 


(a) Phase portrait of the system (8); the origin is an improper node. (5) Plots of 
xı versus f for the system (8). (c) Plots of xz versus t for the system (8). The component plots in 
(b) and (c) are color-coded to their trajectories in (a). The purple curves are for the solution that 
passes through ( —1, 1/2), red through ( —1, —1/2), green (0, 1/2), orange (0, —1/2), blue 
(1, 1/2), and gold (1, —1/2). 


The pattern of trajectories in Figure 7.8.2(a) is typical of 2 x 2 systems x' — Ax with equal 
eigenvalues and only one independent eigenvector. The origin is called an improper node in 
this case. If the eigenvalues are negative, then the trajectories are similar but are traversed 
in the inward direction. An improper node is asymptotically stable or unstable, depending on 
whether the eigenvalues are negative or positive. 

One difference between a system of two first-order equations and a single second-order 
equation is evident from the preceding example. For a second-order linear equation with a 
repeated root r; of the characteristic equation, a term ce"! in the second solution is not required 
since it is a multiple of the first solution. On the other hand, for a system of two first-order 
equations, the term rje"'' of equation (13) with rı = 2 is not, in general, a multiple of the first 
solution £e", so the term ge"! must be retained. 


Example 2 is entirely typical of the general case when there is a double eigenvalue and 
a single associated eigenvector. Consider again the system (1), and suppose that r = p is 
a double eigenvalue of A, but that there is only one corresponding eigenvector €. Then one 
solution, similar to equation (9), is 


x (t) = Ee, (21) 
where € satisfies 
(A — pHE=0. (22) 


Following the process introduced in Example 2, we find that a second solution, similar to 
equation (19), is 


x) (t) = Ete + ne", (23) 
where € satisfies equation (22) and 7 is determined from 
(A — pDm = €. (24) 


Even though det(A — pI) = 0, it can be shown that it is always possible to solve equation (24) 
for 7. While we will not present all of the details, an important step is to note that if we multiply 
equation (24) by A — pI and use equation (22), then we obtain 


(A — pD?n — 0. 


The vector 77 is called a generalized eigenvector of the matrix A corresponding to the 
eigenvalue p. 


Fundamental Matrices. As explained in Section 7.7, fundamental matrices are formed by 
arranging linearly independent solutions in columns. Thus, for example, a fundamental matrix 
for the system (8) can be formed from the solutions x‘? (1) and x? (t) from equations (9) and 


(19), respectively: 
Ht) = e” te” | xaf 1 t T" 
Caza ajeta] Veg Lpog (25) 


The particular fundamental matrix ® that satisfies ®(0) = I can also be readily found from 
the relation ®(1) = W(t) W^ !(0). From equation (25), we have 


E 1 0 TT 1 0 
wo - (1 2 so that Y o=( 4 (26) 


and then 


= -1 = get ! d : : 
$(r = W(r) Ww (0) =e ( 1 2i Jt 1 ) 


l-t —t 
2t 
E ; 27 
? ( t TA ar 


Recall that the fundamental matrix (1) with (0) = I can also be written as exp( At). For 
the matrix A in Example 2, the solution to x' = Ax with x(0) = x? is x(t) = exp(At)x?, or 
x(t) = ®(t)x° with P(t) given by equation (27). 


Jordan Forms. Ann x n matrix A can be diagonalized as discussed in Section 7.7 only 
if it has a full complement of n linearly independent eigenvectors. If there is a shortage of 
eigenvectors (because of repeated eigenvalues), then A can always be transformed into a nearly 
diagonal matrix called its Jordan® form, which has the eigenvalues of A on the main diagonal, 
ones in certain positions on the diagonal above the main diagonal, and zeros elsewhere. 


8Marie Ennemond Camille Jordan (1838-1922) was professor at the Ecole Polytechnique and the Collége de France. 
He is known for his important contributions to analysis and to topology (the Jordan curve theorem) and especially 
for his foundational work in group theory. The Jordan form of a matrix appeared in his influential book Traité des 
substitutions et des équations algébriques, published in 1870. 
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Consider again the matrix A given by equation (2). To transform A into its Jordan form, 
we construct the transformation matrix T with the single eigenvector £ from equation (6) in its 
first column and the generalized eigenvector 7 from equation (17) with k = 0 in the second 
column. Then T and its inverse are given by 


T-(1 °? a T). -( 1 ° 28 
let <i 7 ^g. Q8) 


As you can verify, it follows that 


2 1 
= — uM 
T AT- ic ; = J. (29) 


The matrix J in equation (29) is the Jordan form of A. It is typical of all Jordan forms in 
that it has a 1 above the main diagonal in the column corresponding to the eigenvector that is 
lacking (and is replaced in T by the generalized eigenvector). 

If we start again from equation (1) 


the transformation x = Ty, where T is given by equation (28), produces the system 
y = Jy, (30) 
where J is given by equation (29). In scalar form the system (30) is 
yi = 2y ty» yy = 2yr. GD 


These equations can be solved readily in reverse order—that is, by starting with the equation 
for y2. In this way we obtain 


y(t) =cye~ and y(t) = cite? + ce”. (32) 


Thus two independent solutions of the system (30) are 


1 t 
y(t) = (o) e% and y(t) — D (33) 
and the corresponding fundamental matrix is 
n e” te” 
W(t) = i P (34) 


Since (0) = I, we can also identify the matrix in equation (34) as exp( Jt). The same result 
can be reached by calculating powers of J and substituting them into the exponential series 
(see Problems 19 through 21). To obtain a fundamental matrix for the original system, we now 
form the product 


2t 


e te?! 
W(t) = Texp(Jt) = ot at qx] (35) 


=e 


which is the same as the fundamental matrix given in equation (25). 

We will not discuss n xn systems x’ = Ax in more detail here. For large n it is possible that 
there may be eigenvalues of high algebraic multiplicity m, perhaps with much lower geometric 
multiplicity q, thus giving rise to m — q generalized eigenvectors. Problems 17 and 18 explore 
the use of Jordan forms for systems of three differential equations. For n > 4 there may also 
be repeated complex eigenvalues. A full discussion? of the Jordan form of a general n x n 
matrix requires a greater background in linear algebra than we assume for most readers of this 
book. 


?For example, see the books listed in the References at the end of this chapter. 
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The amount of arithmetic required in the analysis of a general n x n system may be 
prohibitive to do by hand even if n is no greater than 3 or 4. Consequently, suitable computer 
software should be used routinely in most cases. This does not overcome all difficulties by any 
means, but it does make many problems much more tractable. Finally, for a set of equations 
arising from modeling a physical system, it is likely that some of the elements in the coefficient 
matrix A result from measurements of some physical quantity. The inevitable uncertainties in 
such measurements lead to uncertainties in the values of the eigenvalues of A. For example, 
in such a case it may not be clear whether two eigenvalues are actually equal or are merely 


close together. 


Problems 


In each of Problems 1 through 3: 
a. Draw a direction field and sketch a few trajectories. 
O b. Describe how the solutions behave as t — oo. 
c. Find the general solution of the system of equations. 


2 1 
3. x= A LIX 
4 2 


In each of Problems 4 and 5, find the general solution of the given 
system of equations. 


1 1 1 
4. x =|2 1 -1 |x 
0 -1 1 
0 1 1 
5. x= [1 0 1 |x 
1 1 0 


In each of Problems 6 through 8: 
a. Find the solution of the given initial value problem. 
b. Sketch the trajectory of the solution in the x4x5-plane, and 
also sketch the graph of x, versus t. 


, fi -4 _ iB 
6. = x xo = (7) 
(0) — : 
x x0-|, 
4 3 3 (2 
8. Ee à) xo = (7) 


In each of Problems 9 and 10: 
a. Find the solution of the given initial value problem. 
Q b. Draw the corresponding trajectory in x,x2x5-space. 
C. Sketch the graph of x, versus f. 


= 

as 

Il 
NIVI CA 
NI 023] 02 


1 0 0 -1 
9. x = | -4 1 O}x, x(0)= 2 
3 6 2 —30 


c5 1 1 
j 5 
10. x’ = 1 E. 1 |x, x(0)= 3 
1 1 E P 
2 


In each of Problems 11 and 12, solve the given system of equations by 
the method of Problem 13 of Section 7.5. Assume that t > 0. 


12. «-( E 
4 -7 


13. Show that all solutions of the system 


; a b 
x= x 
fo d 
approach zero as t — oo if and only if a +d < 0 and ad — bc > 0. 


Compare this result with that of Problem 28 in Section 3.4. 


14. Consider again the electric circuit in Problem 21 of Section 7.6. 
This circuit is described by the system of differential equations 


s). 0 


C — RC 


a. Show that the eigenvalues are real and equal if L = 4R?C. 
b. Suppose that R = 1 0, = 1 F, and L = 4H. Suppose also 
that /(0) = 1 A and V(0) = 2 V. Find I(t) and V(t). 


15. Consider again the system 


‘=A do (36) 
X =Ax= x 
1 3 
that we discussed in Example 2. We found there that A has a double 
eigenvalue rj, = rọ = 2 with a single independent eigenvector 


€ = (1, —1)7, or any nonzero multiple thereof. Thus one solution 
of the system (36) is x(t) = ée” and a second independent 
solution has the form 


x (1) = éte” + ne”, 


where € and ņ satisfy 
(A-2D€=0, (A-2Dn=€. (37) 


In the text we solved the first equation for € and then the second 
equation for 7. Here, we ask you to proceed in the reverse order. 
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a. Show that 77 satisfies (A — 21)*n = 0. 

b. Show that (A — 21)? = 0. Thus the generalized eigenvector 
7 can be chosen arbitrarily, except that it must be independent of 
El D. 

c. Let 7 = (0, —1)7. Then determine £ from the second of 
equations (37) and observe that € = (1, —1)7 = £(U. This 
choice of 7) reproduces the solution found in Example 2. 

d. Let 7 = (1,0)? and determine the corresponding 
eigenvector £. 

e. Let 7 = (kj, ky)", where kı and k; are arbitrary numbers. 
What condition on k, and ky ensures that 7 and £(! are 
linearly independent? Then determine £. How is it related to the 
eigenvector €‘))? 


16. In Example 2, with A given in equation (36) above, it was 
claimed that equation (16) is solvable even though the matrix A — 2I 
is singular. This problem justifies that claim. 
a. Find all eigenvalues and eigenvectors for A*, the adjoint 
of A. 
b. Show that the eigenvectors of A and the corresponding 
eigenvectors of A* are orthogonal. 
c. Explain why this proves that equation (16) is solvable. 


Eigenvalues of Multiplicity 3. If the matrix A has an eigenvalue of 
algebraic multiplicity 3, then there may be either one, two, or three 
corresponding linearly independent eigenvectors. The general solution 
of the system x’ = Ax is different, depending on the number of 
eigenvectors associated with the triple eigenvalue. As noted in the text, 
there is no difficulty if there are three eigenvectors, since then there 
are three independent solutions of the form x = £e"'. The following 
two problems illustrate the solution procedure for a triple eigenvalue 
with one or two eigenvectors, respectively. 


17. Consider the system 


1 1 1 
x’ = Ax= 2 1 -1]|x (38) 
-3 2 4 


a. Show that r — 2 is an eigenvalue of algebraic multiplicity 
3 of the coefficient matrix A and that there is only one 
corresponding eigenvector, namely, 


0 
£u 31 
-1 


b. Using the information in part a, write down one solution 
x (P (1) of the system (38). There is no other solution of the purely 
exponential form x = £e". 

c. To find a second solution, assume that x = £te? + ne’. 
Show that £ and 77 satisfy the equations 


(A-2D£ —0, (A-2Dq —£. 


Since € has already been found in part a, solve the second 
equation for 7. Neglect the multiple of €P that appears in n, 
since it leads only to a multiple of the first solution x‘). Then 
write down a second solution x? (1) of the system (38). 
d. To find a third solution, assume that 
t ? 2t 2t 2t 
=E" + nte” + Ge". 


Show that €, 7, and ¢ satisfy the equations 


(A-2D£—0, (A-2Dq—& (A-2D¢ =n. 


18. 


19. 


20. 


The first two equations are the same as in part c, so solve the third 
equation for C, again neglecting the multiple of € that appears. 
Then write down a third solution x?) (1) of the system (38). 

e. Write down a fundamental matrix W(r) for the system (38). 
f. Forma matrix T with the eigenvector €“) in the first column 
and the generalized eigenvectors 7) and ¢ in the second and third 
columns. Then find T~! and form the product J — T-'AT. The 
matrix J is the Jordan form of A. 


Consider the system 


3e =o =2 
x =Ax= 8 —5 -4]x (39) 
—4 3 3 


a. Show that r = 1 is a triple eigenvalue of the coefficient 
matrix A and that there are only two linearly independent 
eigenvectors, which we may take as 

1 0 
0j, &@ =| 2 
2 =3 


= (40) 


Write down two linearly independent solutions x“ (f) and 
x? (t) of equation (39). 

b. To find a third solution, assume that x = £fe' + ne’; then 
show that € and 7 must satisfy 


(A -D£ — 0, 
(A-Dn=6. 


c. Equation (41) is satisfied if € is an eigenvector, so one way 
to proceed is to choose € to be a suitable linear combination of 
€ and £O) so that equation (42) is solvable, and then to solve 
that equation for rj. However, let us proceed in a different way 
and follow the pattern of Problem 15. First, show that y satisfies 


(A — D?g — 0. 


(41) 
(42) 


Further, show that (A—I)? = 0. Thus n can be chosen arbitrarily, 
except that it must be independent of €) and £O. 

d. A convenient choice for 7 is 7 = (0, 0, 1)”. Find the cor- 
responding € from equation (42). Verify that € is an eigen- 
vector of A. 

e. Write down a fundamental matrix V(r) for the system (39). 
f. Forma matrix T with the eigenvector £ P in the first column 
and with the eigenvector € from part d and the generalized 
eigenvector 77 in the other two columns. Find T~! and form the 
product J = T-!AT. The matrix J is the Jordan form of A. 


AÀ 1 
Let J = i 3 where A is an arbitrary real number. 
a. Find P2, J’, and J‘. 


A" Anc 
b. Usean inductive argument to show that J" — ( 0 ý yn ) 


c. Determine exp(Jr). 


d. Use exp(Jt) to solve the initial value problem x’ = Jx, 
x(0) = x?. 
Let 
À 0 0 
J2|0 À 1], 
0 0 A 


where A is an arbitrary real number. 
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a. Find J2, J’, and J^. where A is an arbitrary real number. 
b. Use an inductive argument to show that a. Find J’, J’, and J^. 

n 0 0 b. Use an inductive argument to show that 

J’ = 0 An nvr} . An nA”! Tun DA" 

0 0 M £s 2 
c. Determine exp(Jt). 0 
d. Observe that if you choose À = 1, then the matrix J in this 
problem is the same as the matrix J in Problem 18f. Using the C. Determine exp(Jt). 
matrix T from Problem 18f, form the product Texp(Jt) with d. Note that if you choose A = 2, then the matrix J in this 
A=1. problem is the same as the matrix J in Problem 17f. Using the 
e. Is the resulting matrix the same as the fundamental matrix matrix T from Problem 17f, form the product Texp(Jt) with 
W(t) in Problem 18e? If not, explain the discrepancy. Aà = 2. The resulting matrix is the same as the fundamental matrix 

21. Let W(t) in Problem 17e. If not, explain the discrepancy. 
À 1 0 
J= | 0 À 1], 
0 0 À 
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In this section we turn to the nonhomogeneous system of linear first-order differential 
equations 


x =P(t)x+ g(t), (1) 


where the n x n matrix P(t) and n x 1 vector g(t) are continuous fora < t < f. By the 
same argument as in Section 3.5 (see also Problem 12 in this section), the general solution of 
equation (1) can be expressed as 


x 2 cix P (0) E x eux (0) + V(t), (2) 


where c)x‘(t) +--+ + c, x? (1) is the general solution of the corresponding homogeneous 
system x’ = P(t)x, and v(t) is a particular solution of the nonhomogeneous system (1). We 
will briefly describe several methods for determining v(t). 


Diagonalization. We begin with systems of the form 
x = Ax + g(t), (3) 


where A is ann x n diagonalizable constant matrix. By diagonalizing the coefficient matrix 
A, as indicated in Section 7.7, we can transform equation (3) into a system of equations that 
is readily solved. 

Let T be the matrix whose columns are the eigenvectors €, ... , €” of A, and define 
a new dependent variable y by 


x — Ty. (4) 
Then, substituting for x in equation (3), we obtain 
Ty = ATy + g(t). 
When we multiply this equation (on the left) by T~!, it follows that 
y =(TUAT)y + T'g(t) = Dy + h(t), (5) 


where h(t) = T~'g(t) and where D is the diagonal matrix whose diagonal entries are the 
eigenvalues rj, ... ,r, of A, arranged in the same order as the corresponding eigenvectors 
E€, ...,E™ that appear as columns of T. Equation (5) is a system of n uncoupled first- 
order linear differential equations for yı(t), ... , y,(t); as a consequence, the differential 
equations can be solved separately. In scalar form, equation (5) has the form 


yi) 2rjyj(0 +h), jolL....n, (6) 
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where A (1) is a certain linear combination of g1(1), ... , g,(1). Equation (6) is a first-order 
linear differential equation and can be solved by the methods of Section 2.1. In fact, we have 


t 
yj) = e f e*hj(s)ds+cje", j=1,...,n, (7) 
to 

where the c; are arbitrary constants. Finally, the solution x of equation (3) is obtained from 
equation (4). When multiplied by the transformation matrix T, the second term on the right- 
hand side of equation (7) produces the general solution of the homogeneous equation x’ = Ax, 
while the first term on the right-hand side of equation (7) yields a particular solution of the 
nonhomogeneous system (3). 


EXAMPLE 1 


Find the general solution of the system 


pe —2 1 2e EP , " 
x= 1-2 x+ 3t = Ax + g(t). (8) 


Solution: 


Proceeding as in Section 7.5, we find that the eigenvalues of the coefficient matrix are r; = —3 and 
ra = —1 and that the corresponding eigenvectors are 


eM = (1) and £O = G) (9) 


Thus the general solution of the homogeneous system is 


x=c (ese (10) 


Before writing down the matrix T of eigenvectors, we recall that eventually we must find T^ !. The 
coefficient matrix A is real and symmetric, so we can use the result stated just above Example 3 in 
Section 7.7: T7! is simply the adjoint, which here (since T is real) is just the transpose of T, provided 
that the eigenvectors of A are normalized to have length 1, that is, (£, €) = 1. Hence, because both 
€ and £ Ó have length 4/2, we define 


"O E oo ee eee edd A 
TA -1 ME and so EZ WE 


Letting x = Ty and substituting for x in equation (8), we obtain the following system of 
equations for the new dependent variable y: 
xad uates (12) 
SE BHEN oaJ * 5 Oe ta Bt) 
Thus 
1 -t 3 
yi +3y, = vRe — —t, 
A 


(13) 
3 
X*»-v2e T 


v 


Each of equations (13) is a first-order linear differential equation and so can be solved by the methods 
of Section 2.1. In this way we obtain 


2 3 t] 
y= v2 ., ( ) +cje, 


3 9 
(14) 


3 
y2 = vDte^ + —(t — 1) cce. 
2 


Fi 
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Finally, we write the solution in terms of the original variables: 


2 — 5 


adel ead ee Med TX E. is 
hp aje Ab Je talal €. pe Phe male 0 


where kı = c,/ v2 and ky = c2/ 4/2. The first two terms on the right-hand side of equation (15) 
form the general solution of the homogeneous system corresponding to equation (8). The remaining 
terms are a particular solution of the nonhomogeneous system. 


If the coefficient matrix A in equation (3) is not diagonalizable (because of repeated 
eigenvalues and a shortage of eigenvectors), it can nevertheless be reduced to a Jordan 
form J by a suitable transformation matrix T involving both eigenvectors and generalized 
eigenvectors. In this case the differential equations for y;, ... , y, are not totally uncoupled 
since some rows of J have two nonzero elements: an eigenvalue in the diagonal position and a 
1 in the adjacent position to the right. However, the equations for y1, ... , y, can still be solved 
consecutively, starting with y,. Then the solution of the original system (3) can be found by the 
relation x = Ty. 


Undetermined Coefficients. A second way to find a particular solution of the 
nonhomogeneous system (1) is the method of undetermined coefficients that we first discussed 
in Section 3.5. To use this method, we assume the form of the solution with some or all of 
the coefficients unspecified, and then seek to determine these coefficients so as to satisfy the 
differential equation. As a practical matter, this method is applicable only if the coefficient 
matrix P is a constant matrix, and if the components of g are polynomial, exponential, or 
sinusoidal functions, or sums or products of these. In these cases the correct form of the 
solution can be predicted in a simple and systematic manner. The procedure for choosing 
the form of the solution is substantially the same as that given in Section 3.5 for a single 
linear second-order differential equation. The main difference is illustrated by the case of 
a nonhomogeneous term of the form ue^', where A is a simple root of the characteristic 
equation. In this situation, rather than assuming a solution of the form are’, it is necessary to 
use ate + be*’, where a and b are determined by substituting into the differential equation. 


EXAMPLE 2 


Use the method of undetermined coefficients to find a particular solution of 


r —2 1 2e -— ! 16 
x= 12 x+ 3r = Ax + g(t). (16) 


Solution: 


This is the same system of equations as in Example 1. To use the method of undetermined 
coefficients, we write g(t) in the form 


NS M 17 
go =o) [Je (17) 


Observe that r = —1 is an eigenvalue of the coefficient matrix, and therefore we must include both 
ate™ and be™ in the assumed solution. Thus we assume that 


x=v(t) = ate" -- be ! 4- ct +d, (18) 
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where a, b, c, and d are vectors to be determined. Substituting equation (18) into equation (16) and 
collecting like terms, we obtain the following algebraic equations for a, b, c, and d: 


Aa — —a, 
Ab b E 
—a—b- 0 i 
0 (19) 
se=-(5). 
3 
Ad — c. 


From the first of equations (19), we see that a is an eigenvector of A corresponding to the eigenvalue 
r = —]. Thus a = (a,a)!, where a is any nonzero constant. Then we find that the second of 
equations (19) can be solved only if œ = 1 and that, in this case, 


-Q-0 ^ 


for any constant k. The simplest choice is k = 0, from which b = (0, —1)7. 
1 
Then the third and fourth of equations (19) yield c = (1, 2) T andd = — 3(^ 5) T respectively. 


Then, substituting a, b, c, and d into equation (18) we obtain the particular solution 


PY ze Wu ng A 1/4 
v= (1 et-(uje* tor 3l.) (21) 


The particular solution (21) is not identical to the one contained in equation (15) of Example 1 because 


the term in e™ is different. However, if we choose k = 2 in equation (20), then b = — 2i 1, 1) T 


and the two particular solutions agree. 


Variation of Parameters. Now letus turn to more general problems in which the coefficient 
matrix is not constant or not diagonalizable. Let 


x = P(r)x + g(t), (22) 


where P(t) and g(t) are continuous on o < t < 8. Assume that a fundamental matrix W(t) 
for the corresponding homogeneous system 


x’ = P(t)x (23) 
has been found. We use the method of variation of parameters to construct a particular solution, 
and hence the general solution, of the nonhomogeneous system (22). 

Since the general solution of the homogeneous system (23) is W(f)e, it is natural to 


proceed as in Section 3.6 and to seek a solution of the nonhomogeneous system (22) by 
replacing the constant vector c by a vector function u(t). Thus we assume that 


x = V(r)u(t), (24) 


where u(t) is a vector to be found. Upon differentiating x as given by equation (24) and 
requiring that equation (22) be satisfied, we obtain 


V'(r)u(f) + V(r)u(r) = P(t) W(t)u(t) +g). (25) 
Since W(t) is a fundamental matrix, W(t) = P(t) W(t); hence equation (25) reduces to 
Put) = g). (26) 


Recall that Y(t) is nonsingular on any interval where P is continuous. Hence V^! (1) exists, 
and therefore 


u(t) = V (ng). (27) 


Thus for u(t) we can select any vector from the class of vectors that satisfy equation (27). 
These vectors are determined only up to an arbitrary additive constant vector; therefore, we 
denote u(t) by 


u(t) = [ waar +e, (28) 
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where the constant vector c is arbitrary. If the integrals in equation (28) can be evaluated, then 
the general solution of the system (22) is found by substituting for u(t) from equation (28) 
in equation (24). However, even if the integrals cannot be evaluated, we can still write the 
general solution of equation (22) in the form 


x= woe wo | V !(s)g(s)ds, (29) 


where f, is any point in the interval (o, 9). The first term on the right-hand side of 
equation (29) is the general solution of the corresponding homogeneous system (23), and the 
second term is a particular solution of equation (22). 

Now let us consider the initial value problem consisting of the differential equation (22) 
and the initial condition 


x(t) = x. (30) 


We can find the solution of this problem most conveniently if we choose the lower limit of 
integration in equation (29) to be the initial point £y. Then the general solution of the differential 
equation is 


x= woe wo | V !(s)g(s)ds. (31) 


For t = tọ the integral in equation (31) is zero, so the initial condition (30) is also satisfied if 
we choose 


c= vV-!(nyx. (32) 


Therefore, 
t 
x = W()9- (x + V) f V7! (s)gs)ds 63) 
to 


is the solution of the given initial value problem. Again, although it is helpful to use V ^! to 
write the solutions (29) and (33), it is usually better in particular cases to solve the necessary 
equations by row reduction than to calculate V! and to substitute into equations (29) and 
(33). 

The solution (33) takes a slightly simpler form if we use the fundamental matrix (1) 
satisfying ®(t 9) = I. In this case we have 


x= bx! eo | $ !(s)g(s)ds. (34) 


Equation (34) can be simplified further if the coefficient matrix P(t) is a constant matrix (see 
Problem 16). 


EXAMPLE 3 


Use the method of variation of parameters to find the general solution of the system 


3 —2 1 ert = : 45 
x = um x+ 3r = Ax + g(t). (35) 


This is the same system of equations as in Examples 1 and 2. 


Solution: 


The general solution of the corresponding homogeneous system was given in equation (10). Thus 


e? et 
W(t) = Ge 2 36) 
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is a fundamental matrix. Then the solution x of equation (35) is given by x = W(r)u(7), where u(t) 


satisfies W(f)u'(t) = g(t), or 
eg? e ui 2e^! 37 
—e9 e)un Nt JE ep 


Solving equation (37) by row reduction, we obtain 


uj = e” — ste 
3 
,=1+4+ ite’. 
Uy + 2 e 
Hence 
1 1 1 
u(t) = ae end zte” + Tc, 
3 3 
uy(t) =t + rad — xe +c, 
and 


x = W(r)u(r) 


sal isal rs mE S 
CENE. See Nat ey T. 1) 32] BAS) 


which is the same as the solution obtained in Example 1 (compare with equation (15)) and is 
equivalent to the solution obtained in Example 2 (compare with equation (21)). 


Laplace Transforms. We used the Laplace transform in Chapter 6 to solve linear equations 
of arbitrary order. It can also be used in very much the same way to solve systems of equations. 
Since the transform is an integral, the transform of a vector is computed component by 
component. Thus £{x(t) } is the vector whose components are the transforms of the respective 
components of x(t), and similarly for Z(x'(1)). We will denote C(x(1)] by X(s). Then, by an 
extension of Theorem 6.2.1 to vectors, we also have 


L{x'(t)} = sX(s) — x(0). (39) 


EXAMPLE 4 


Use the method of Laplace transforms to solve the system 


7 i e e err. 40 
x = 1-2 x+ by = Ax + g(t). (40) 


This is the same system of equations as in Examples 1, 2, and 3. 


Solution: 
We take the Laplace transform of each term in equation (40), obtaining 
sX(s) — x(0) = AX(s) + G(s), (41) 
where G(s) is the transform of g(t). The transform G(s) is given by 
2 


G(s) = 1). (42) 


s+ 
3 
s2 


To proceed further we need to choose the initial vector x(0). For simplicity let us choose x(0) = 0. 
Then equation (41) becomes 


(sI — A)X(s) = G(s), (43) 
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where, as usual, Lis the 2 x 2 identity matrix. Consequently, X(s) is given by 
X(s) = (sI — A) !G(s). (44) 


The matrix (sI — A) ^! is called the transfer matrix because multiplying it by the transform of the 
input vector g(t) yields the transform of the output vector x(t). In this example we have 


s+2 E 
sI-A= ] (45) 
—1 s+2 


and by a straightforward calculation, we obtain 


(I-A) t= l es : ) (46) 


"(s-DG-3V 1 542 


Then, substituting from equations (42) and (46) in equation (44) and carrying out the 
indicated multiplication, we find that 


2(s 4-2) 3 
1)? 3 A 1 3 
X) = (s4-1)(s +3)  s?(s +1)(s 4-3) l (47) 
2 3(s + 2) 


(s + 1)2(s + 3) s2(s +1)(s +3) 


Finally, we need to obtain the solution x(t) from its transform X(s). This can be done by expanding 
the expressions in equation (47) in partial fractions and using Table 6.2.1, or (more efficiently) by 
using appropriate computational tools. In any case, after some simplification the result is 


a it Oe Tee ee quu m 
Ts dim LIU Oa E hee (48) 


Equation (48) gives the particular solution of system (40) that satisfies the initial condition x(0) = 0. 
As a result, it differs slightly from the particular solutions obtained in the preceding three examples. 
To obtain the general solution of equation (40), you must add to the expression in equation (48) the 
general solution (10) of the homogeneous system corresponding to equation (40). 


Each of the methods for solving nonhomogeneous equations has its own advantages and 
disadvantages. The method of undetermined coefficients requires no integration, but it is 
limited in scope and may entail the solution of several sets of algebraic equations. The method 
of diagonalization requires finding the inverse of the transformation matrix and the solution of 
a set of uncoupled first-order linear differential equations, followed by a matrix multiplication. 
Its main advantage is that for Hermitian coefficient matrices, the inverse of the transformation 
matrix can be written down without calculation—a feature that is more important for large 
systems. The method of Laplace transforms involves a matrix inversion to find the transfer 
matrix, followed by a multiplication, and finally by the determination of the inverse transform 
of each term in the resulting expression. It is particularly useful in problems with forcing 
functions that involve discontinuous or impulsive terms. Variation of parameters is the most 
general method. On the other hand, it involves the solution of a set of linear algebraic equations 
with variable coefficients, followed by an integration and a matrix multiplication, so it may 
also be the most complicated from a computational viewpoint. For many small systems with 
constant coefficients, such as the one in the examples in this section, all of these methods work 
well, and there may be little reason to select one over another. 


Problems 


In each of Problems 1 through 8 find the general solution of the given 1 1 et 
system of equations. 3. 4 —2 —~2e! 
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1 1 2 

Jo t 

5. x= (| "2 (i): 
2 -1 1 

t an, t 

6. x= E 3 (i) 


2 —5 0 
8. x’ = x+ , 0«t«m 
] -2 cost 


9. The electric circuit shown in Figure 7.9.1 is described by the 
system of differential equations 


1 1 
E o 1 
dx 2 8 z 
, ae 0 


where x, is the current through the inductor, x? is the voltage drop 
across the capacitor, and /(t) is the current supplied by the external 
source. 
a. Determine a fundamental matrix W(t) for the homogeneous 
system corresponding to equation (49). Refer to Problem 20 of 
Section 7.6. 
b. If I(t) = e~*/?, determine the solution of the system (49) 
that also satisfies the initial conditions x(0) — 0. 


L = 8 henrys 


(e) 
/ C= 3 farad 


R=4ohms R=4ohms 


AAIR AAN The circuit in Problem 9. 


In each of Problems 10 and 11, verify that the given vector is the 
general solution of the corresponding homogeneous system, and then 
solve the nonhomogeneous system. Assume that t > 0. 


2 =l 1-2 

10. EE ! 

s (| al a 
IY 


12. Let x = (ft) be the general solution of x’ = P(t)x + g(t), 
and let x = v(t) be some particular solution of the same system. By 
considering the difference c(t) —v(t), show that b(t) = u(t) 4-v(t), 
where u(t) is the general solution of the corresponding homogeneous 
system x’ = P(r)x. 

Alternate Derivation of Variation of Parameters. When we 
first encountered variation of parameters for a second-order linear 
differential equation in Section 3.6 and again for higher-order linear 


differential equations in Section 4.4, some of the equations used 
to determine the unknown variable coefficients appeared to have 
been chosen primarily to prevent higher-order derivatives of the 
unknown variable coefficients from entering into the process. In fact, 
as we show in Problems 13 through 15, the variation of parameter 
equations are completely explained when viewed from the perspective 
of the equivalent system of first-order linear differential equations. 
Problems 13 and 14 reconsider two problems from Section 3.6; 
Problem 15 shows that this connection is true for any second-order 
linear differential equation. The same ideas can be used to explain 
variation of parameters for higher-order linear differential equations. !° 
In Problems 13 and 14, you are given a nonhomogeneous 
second-order linear differential equation and two linearly independent 
solutions, y, and y», to the corresponding homogeneous differential 
equation. Use this information to complete the following steps: 
a. Find the equivalent nonhomogeneous system of first-order 
linear differential equations for x; = y and x» = y’. 
b. Show that x? = (yj, y)? and x? = (y, yj)" are 
solutions to the homogeneous system of differential equations 
corresponding to the system found in a. (As a consequence, 
w= (x() |x@)) is a fundamental matrix for the same homo- 
geneous system.) 
c. Find the variation of parameters equations that have to be 
satisfied for y = y,(t)u,(t) + yo(t)u2(t) to be a particular 
solution of the given nonhomogeneous second-order differential 
equation. 
d. Find the variation of parameters equations that have to be 
satisfied for x = W(t)u(t) to be a particular solution of 
the nonhomogeneous system of first-order linear differential 
equations found in a. 
e. Use the definition of x“) and x in b to show that the 
systems of equations found in c and the equations found in d are 
equivalent. 


13. y”—Sy’+6y = 2e!, y, = e”, y; = e? (Problem 1, Section 3.6) 


14. D?y"—t(t -2)y' c (t -2)y = 2P (t > 0), y =t, yo = te 
(Problem 11, Section 3.6) 

15. Carry out steps a through e for the general nonhomogeneous 
second-order linear differential equation y" + p(t) y' 4-q(1) y = g(t), 
where y; = y;(f) and y2 = y2(t) form a fundamental set of solutions 
to the corresponding homogeneous differential equation. 


16. Consider the initial value problem 
x = Ax +g), x(0) =x°. 


a. By referring to Problem 12c in Section 7.7, show that 
x= (t)x? + f (t — s)g(s)ds. 
0 
b. Show also that 
x = exp(Ar)x? + [ exp(A(t — s)) g(s)ds. 
0 


Compare these results with those of Problem 22 in Section 3.6. 


lÜThese problems were motivated by correspondence with Weishi Liu, 
University of Kansas. 


17. Use the Laplace transform to solve the system 


pope A 2e-t 
v=( l ahg =o (50) 


used in the examples in this section. Instead of using zero initial 
conditions, as in Example 4, let 
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x(0) = (2). (51) 
a2 


where a, and o» are arbitrary. How must a, and o» be chosen so 
that the solution is identical to equation (38)? 
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Up to this point we have discussed methods for solving differential equations by using 
analytical techniques such as integration or series expansions. Usually, the emphasis was on 
finding an exact expression for the solution. Unfortunately, there are many important problems 
in engineering and science, especially nonlinear ones, to which these methods either do not 
apply or are very complicated to use. In this chapter we discuss an alternative approach, the use 
of numerical approximation methods to obtain an accurate approximation to the solution of an 
initial value problem. We present these methods in the simplest possible context, namely, 
a single scalar first-order equation. However, they can readily be extended to systems of 
first-order equations, and this is outlined briefly in Section 8.5. The procedures described 
here can be executed easily on a wide variety of computational devices, from smartphones 
to supercomputers. 
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To discuss the development and use of numerical approximation procedures, we will 
concentrate mainly on the first-order initial value problem consisting of the differential 
equation 

dy 

— = Jtt, 1 

g 0 USD (1) 


and the initial condition 


y(to) = yo- (2) 


We assume that the functions f and f, are continuous on some rectangle in the ty-plane 
containing the point (to, yo). Then, by Theorem 2.4.2, there exists a unique solution y = ¢ (t) 
of the given problem in some interval about fo. If equation (1) is nonlinear, then the interval 
of existence of the solution may be difficult to determine and may have no simple relationship 
to the function f. However, in all our discussions we assume that there is a unique solution of 
the initial value problem (1), (2) in the interval of interest. 

In Section 2.7 we described the oldest and simplest numerical approximation method, 
namely, the Euler or tangent line method. To derive this method, let us write the differential 
equation (1) at the point t = f, in the form 


do u 
a = f(t: $(1,)). (3) 


Then we approximate the derivative in equation (3) by the corresponding (forward) difference 
quotient, obtaining 


Cha) — [167 


tnt — Ín 


l & (i (ty). 2 


Finally, if we replace $(1,,1) and @(t,) by their approximate values y,,; and yn, respectively, 
and solve for y,,,, we obtain the Euler formula 


Yn+1 = yn + fta, Yn) (tati = fi); n = 0, 1, 2, eero (5) 
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If the step size t, 4; — t, has a uniform value A for all n and if we denote f(t,, Yn) by fn, then 
equation (5) simplifies to 


Yn+1 = Yn + hfn, n —0,1, 2, eth (6) 
Euler's method consists of repeatedly evaluating equation (5) or (6), using the result of each 
step to execute the next step. In this way we obtain a sequence of values yo, yj, Yo, <<- 5 Yns +++ 
that approximate the values of the solution ¢ (t) at the points to, ti, £5, ... p tns =<. 


A computer program for Euler's method has a structure such as that shown below. The 
specific instructions can be written in any convenient programming language. 


The Euler Method 


Step 1. define f(t, y) 

Step 2. input initial values ż = 10 and y = yO 
Step 3. input step size h and number of steps n 
Step 4. output /0 and yO 

Step 5. for j from 1 to n do 


Step 6. fn = f(t, y) 


y=y+ h x " 

t=t+h 
Step 7. output t and y 
Step 8. end 


Some examples of Euler's method appear in Section 2.7. As another example, consider 
the initial value problem 


y =1-1+4y, (7) 
y(0) — I. (8) 


Equation (7) is a first-order linear equation, and you can easily verify that the solution 
satisfying the initial condition (8) is 


1 
a EU ee dE (9) 


Since the exact solution is known, we do not need numerical methods to approximate the 
solution of the initial value problem (7), (8). On the other hand, the availability of the exact 
solution makes it easy to monitor the accuracy of any numerical procedure that we use on this 
problem. We will use this problem throughout the chapter to illustrate and to compare different 
numerical methods. The solutions of equation (7) diverge rather rapidly from each other, so 
we should expect that it will be fairly difficult to approximate the solution (9) well over any 
interval of moderate length. Indeed, this is the reason for choosing this particular problem; it 
will be relatively easy to observe the benefits of using more efficient methods. 


EXAMPLE 1 


Using the Euler formula (6) and step sizes h = 0.05, 0.025, 0.01, and 0.001, determine approximate 
values of the solution y = ¢ (t) of the problem (7), (8) on the interval 0 < t < 2. 


Solution: 


The indicated calculations were carried out on a computer, and some of the results are shown in 
Table 8.1.1. Their accuracy is not particularly impressive. For h = 0.01 the percentage error is 
3.85% at t = 0.5, 7.49% at t = 1.0, and 14.4% at t = 2.0. The corresponding percentage errors for 
h = 0.001 are 0.40%, 0.79%, and 1.58%, respectively. Observe that if h = 0.001, then it requires 
2000 steps to traverse the interval from t = 0 to t = 2. Thus considerable computation is needed 
to obtain even reasonably good accuracy for this problem using the Euler method. When we discuss 
other numerical approximation methods later in this chapter, we will find that it is possible to obtain 
comparable or better accuracy with much larger step sizes and many fewer computational steps. 
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A Comparison of Results for the Numerical Approximation of 
TABLE 8.1.1 the Solution of y’ = 1 — t + 4y, y (0) = 1 Using the Euler 
Method for Different Step Sizes h 


h = 0.05 h = 0.025 h=0.01 h = 0.001 
0.0 1.0000000 1.0000000 1.0000000 1.0000000 1.0000000 
0.1 1.5475000 1.5761188 1.5952901 1.6076289 1.6090418 
0.2 2.3249000 2.4080117 2.4644587 2.5011159 2.5053299 
0.3 3.4333560 3.6143837 3.7390345 3.8207130 3.8301388 
0.4 5.0185326 5.3690304 5.6137120 5.7754845 5.7942260 
0.5 7.2901870 7.9264062 8.3766865 8.6770692 8.7120041 
1.0 45.588400 53.807866 60.037126 64.382558 64.897803 
1:5 282.07187 361.75945 426.40818 473.55979 479.25919 
2.0 1745.6662 2432.7878 3029.3279 3484. 1608 3540.2001 


To begin to investigate the errors in using numerical approximations, and also to suggest 
ways to construct more accurate algorithms, it is helpful to mention some alternative ways to 
look at the Euler method. 

One way to proceed is to write the problem as an integral equation. Let y = @(f) be the 
solution of the initial value problem (1), (2); then, by integrating from t, to f,,., we obtain 


[ERI laşi 


$'(r)dt = f(t, pA) dt, 


tn In 


or 


tnt 


$43) = 65) + f(t, o) dt. (10) 
tn 
The integral in equation (10) is represented geometrically as the area under the curve in 
Figure 8.1.1 between t = ¢, and f = t,+ı. If we approximate the integral by replacing 
f(t, d(t)) by its value f(t,, O(t,)) att = tn, then we are approximating the actual area 
by the area of the shaded rectangle. Assuming each step is of size h, that is, t41 — t, = h for 
all n, we obtain 


(tapı) = O (tn) F fCin, $ (tn) ) (trys - tn) 
= (tn) + hf (tn, P(tn))- (11) 


Finally, to obtain an approximation y,44 for $ (1,41), we make a second approximation by 
replacing #(t,) by its approximate value y, in equation (11). This gives the Euler formula 
Yn+1 = yn + hf (tn, Yn). A more accurate algorithm can be obtained by approximating the 
integral more accurately. This is discussed in Section 8.2. 


y'A 


y'= f(t, ot) 


Ftna: o(t,)) 


> 
t 


ty tn+1 


| FIGURE 8.1.1 | Integral derivation of the Euler method. 


8.1 The Euler or Tangent Line Method 


Another approach is to assume that the solution y = (t) has a Taylor series about the 
point tn. Then 


h2 
$ (t, th) = Ptn) - ó'(G)h + XORT T 


or 


h2 
(toot) = O(tr) + fI, O(n) h + COE Ten (12) 


If the series is terminated after the first two terms, and ¢(t,+1) and ¢ (t) are replaced by 
their approximate values y,,, and y,, we again obtain the Euler formula (6). If more terms in 
the series are retained, a more accurate formula is obtained. Further, by using a Taylor series 
with a remainder, it is possible to estimate the magnitude of the error in the formula. This is 
discussed later in this section. 


The Backward Euler Formula. A variation on the Euler formula can be obtained by 
approximating the definite integral in integral equation (10) by the area of the rectangle with 
height determined by the value of the function at the right-hand end of the interval. That is, 


Q toi) = $ (15) RE F (teens O D) ssi E tn). 
This yields what is commonly known as the backward Euler formula 
Yat = Yn + Af tiis Ynş1)- (13) 


Assuming that y, is known and y„+ı is to be calculated, observe that equation (13) does not 
provide an explicit formula for y,,,. Rather, it is an equation that implicitly defines y, 
and must be solved to determine the value of y„+1. For this reason this method is sometimes 
referred to as the implicit Euler formula. How difficult it is to solve for y, ; depends entirely 
on the nature of the function f. 


EXAMPLE 2 


Use the backward Euler formula (13) and step sizes h = 0.05, 0.025, 0.01, and 0.001 to find 
approximate values of the solution of the initial value problem (7), (8) on the interval 0 < t < 2. 


Solution: 


For this problem, the backward Euler formula (13) becomes 
Yn+1 = Yn + h(1 — Igi Ayn41)- 


Because the differential equation (7) is linear, the above equation for y,,; is also linear. Solving this 
equation for y, 4), we obtain 


Yn + h(1— tayı) 
Yn+1 = 1-4h C 
At the first step with h = 0.05 and n = 0, we have 


Solving this equation for yı, we obtain 


_ 1.0475 


= = 1.309375. 
Hag 
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Then, using the formula for y,,, with n = 1, we find that 
yo = yı - h(1— t + 4y2) = 1.309375 + (0.05) (1 — 0.1 + 4y2), 


and thus 
.. 1.354375 
m WS 

Continuing the computations, we obtain the results shown in Table 8.1.2. The values given by 

the backward Euler method are uniformly too large for this problem, whereas the values obtained 

from the Euler method were too small. In this problem the errors are somewhat larger for the 

backward Euler method than for the Euler method, although for small values of h the differences 

are insignificant. 


— 1.69296875. 


A Comparison of Results for the Numerical Approximation of the 
TABLE 8.1.2 Solution of y’ = 1 — t + 4y, y(0) = 1 Using the Backward Euler 
Method for Different Step Sizes h 


h = 0.05 h — 0.025 h — 9.01 h — 0.001 
0.0 1.0000000 1.0000000 1.0000000 1.0000000 1.0000000 
0.1 1.6929688 1.6474375 1.6236638 1.6104634 1.6090418 
0.2 2.7616699 2.6211306 2.5491368 2.5095731 2.5053299 
0.3 4.4174530 4.0920886 3.9285724 3.8396379 3.8301388 
0.4 6.99055 16 6.3209569 5.9908303 5.8131282 5.942260 
0.5 10.996956 9.7050002 9.0801473 8.7472667 8.7120041 
1.0  103.06171 80.402761 10.452395 65.419964 64.897803 
1.5 959.44236 661.00731 542.12432 485.05825 479.25919 
2.0  8934.0696 5435.7294 4172.7228 3597.4478 3540.2001 


Observe that the backward Euler formula gets its name from the fact that it can be obtained 


1 
by using the backward difference quotient — (ó (tr) — $t 1) ) to approximate the derivative 
n 


in the differential equation (3) instead of the forward difference quotient used in equation (4). 

Since the backward Euler method appears to be no more accurate than the Euler method, 
and is somewhat more complicated, a natural question is why it should even be mentioned. 
The answer is that it is the simplest example of a class of methods known as backward 
differentiation formulas that are very useful and more accurate for certain types of differential 
equations. We will return to this issue at the end of Section 8.4. 


Errorsin Numerical Approximations. The use of a numerical procedure, such as the Euler 
formula, on an initial value problem raises a number of questions that must be answered 
before the approximate numerical solution can be accepted as satisfactory. One of these is the 
question of convergence. That is, as the step size h tends to zero, do the values of the numerical 
approximation y1, Yo. ... >Yn, .-- approach the corresponding values of the actual solution? 
If we assume that the answer is affirmative, there remains the important practical question of 
how rapidly the numerical approximation converges to the solution. In other words, how small 
a step size is needed in order to guarantee a given level of accuracy? We want to use a step 
size that is small enough to ensure the required accuracy, but not too small. An unnecessarily 
small step size slows down the calculations and in some cases may even cause a loss of 
accuracy. 

There are three fundamental sources of error in approximating the solution of an initial 
value problem numerically. 


1. The formula, or algorithm, used in the calculations is an approximate one. For instance, 
the Euler formula uses straight-line approximations to the actual solution. 


2. Except for the first step, the input data used in the calculations are only approximations 
to the actual values of the solution at the specified points. 
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3. The computer used for the calculations has finite precision; in other words, at each stage 
only a finite number of digits can be retained. 


Let us temporarily assume that our computer can execute all computations exactly; that is, 
it can retain infinitely many digits (if necessary) at each step. Then the difference E, between 
the solution y = $(t) of the initial value problem (1), (2) and its numerical approximation y; 
at the point t = f, is given by 


En = (tn) — Yn. (14) 


The error E, is known as the global truncation error. It arises entirely from the first 
two error sources listed above—that is, by applying an approximate formula to approximate 
data. 

However, in reality we must carry out the computations using finite-precision arithmetic, 
which means that we can keep only a finite number of digits at each step. This leads to a 
round-off error R, defined by 


R, = Yn — Yn, (15) 


where Y, is the value actually computed from the given numerical method. 
The absolute value of the total error in computing ¢ (f,) is given by 


I$ (tn) — Yn| = 16 (tn) — Yn + Yn — Ynl. (16) 
Making use of the triangle inequality, |a + b| < |a| + |b|, we obtain, from equation (16), 


IO (th) — Yal € lé) — yal + lys — Yanl 
< |E,| + [Ra]. (7) 


Thus the total error is bounded by the sum of the absolute values of the global truncation and 
round-off errors. 

For the numerical procedures discussed in this book, it is possible to obtain useful 
estimates of the global truncation error. The round-off error is more difficult to analyze, since it 
depends on the type of computer used, the sequence in which the computations are carried out, 
the method of rounding off, and so forth. A careful examination of round-off error is beyond 
the scope of this book, but see, for example, the book by Henrici listed in the references. Some 
of the dangers of round-off error are discussed in Problems 22 through 24 and in Section 8.6. 

It is often useful to consider separately the part of the global truncation error that is due 
only to the use of an approximate formula. We can do this by assuming at the nth step that the 
input data are accurate that is, that y, = @(t,). This error is known as the local truncation 
error; we will denote it by e,. 


Local Truncation Error for the Euler Method. Assume that the solution y = ¢(t) of the 
initial value problem (1), (2) has a continuous second derivative in the interval of interest. To 
ensure this, we can assume that f, f;, and f, are continuous. For if f has these properties and 
if é is a solution of the initial value problem (1), (2), then 


o(t) = f(t, $), 
and, by the chain rule, 
b(t) = fit, O(t)) + HC, PADA 
= fit, 01) + AG OM) fü, OC). (18) 


Since the right-hand side of this equation is continuous, $" is also continuous. 
Then, making use of a Taylor polynomial with a remainder to expand @ about f,, we 
obtain 


1 - 
Plta +h) = (t) + o'(t)h + 59 (GR, (19) 
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where f, is some point in f, < f, < tn + h. Then, noting that ó (t, +h) = (t)41) and 
b'(tn) = f (tr, 6(t,)), we can rewrite equation (19) as 


1 — 
Cnt) = Gn) FAP Cn, O(tn)) + 59 Gon". (20) 


Now let us use the Euler formula to calculate an approximation to $(f,,;) under the 
assumption that we know the correct value for y, at tn, namely y, = ó(t,). The result is 


Yn = (tn) T hf (tn, $ (15)). Q1) 


where the asterisk is used to designate this hypothetical approximate value for 9 (1,44). The 
difference between $(1,,,) and y; +1 is the local truncation error for the (n + 1)“ step in 
the Euler method, which we will denote by e,,. Thus, by subtracting equation (21) from 
equation (20), we find that 


1 Nes 
€n41 = @(tn41) aes Nu = 5? (5), Q2) 


since the remaining terms in equations (20) and (21) cancel. 

Thus the local truncation error for the Euler method is proportional to the square of the 
step size h, and the proportionality factor depends on the second derivative of the solution ¢. 
The expression given by equation (22) depends on n and, in general, is different for each step. 
A uniform bound, valid on an interval [a, b], is given by 


1 2 
len] < ; Mh, (23) 


where M is the maximum of ló"(t) | on the interval [a, b]. Since equation (23) is based on 
a consideration of the worst possible case—that is, the largest possible value of l") |—it 
may well be a considerable overestimate of the actual local truncation error in some parts of 
the interval [a, b]. 

One use of equation (23) is to choose a step size that will result in a local truncation error 
no greater than some given tolerance level. For example, if the local truncation error must be 
no greater than e, then from equation (23) we have 


1 2 2€ 
-Mh <e or h< 4| —. (24) 
2 M 


The primary difficulty in using equation (22), (23), or (24) lies in estimating |o" | or M. 
However, the central fact expressed by these equations is that the local truncation error is 
proportional to h?. For example, if a new value of h is used that is one-half of its original 
value, then the resulting error will be reduced to one-fourth of its previous value. 

More important than the local truncation error is the global truncation error E,. The 
analysis for estimating E, is much more difficult than that for e,. Nevertheless, it can be shown 
that the global truncation error in using the Euler method on a finite interval is no greater than 
a constant times A. Thus 


|En| < Kh (25) 


for some constant K ; see Problem 20 for more details. The Euler method is called a first-order 
method because its global truncation error is proportional to the first power of the step size. 

Because it is more accessible, we will hereafter use the local truncation error as our 
principal measure of the accuracy of a numerical method and for comparing different methods. 
If we have a priori information about the solution of the given initial value problem, we can 
use the result (22) to obtain more precise information about how the local truncation error 
varies with f. 

As an example, consider the illustrative problem 


y =1-t+4y, y(0)=1 (26) 


on the interval 0 < rt < 2. Let y = ó(t) be the solution of the initial value problem (26). 
Then, as noted previously, 


o(t) = a(t a: 19¢*") 
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and therefore 


o" (t) = 19e*. 
Equation (22) then states that 
19e*» n? = 
eni m My da <n < tn +h. (27) 


The appearance of the factor 19 and the rapid growth of e^ explain why the results in 
Table 8.1.1 are not very accurate. 
For instance, for A = 0.05 the error in the first step is 


|. 19e*^(0.0025) 


e =¢(t)- y = 2 , 0< ft < 0.05. 


It is clear that e; is positive, and since e^^ < e9? we have 


19e°7(0.0025) |. 
= 0.02901. (28) 


e| X 
2 


- 19 
Note also that ef% > 1; hence e) > 5 (0-0025) = 0.02375. The actual error is 


0.02542. It follows from equation (27) that the error becomes progressively worse with 
increasing f; this is also clearly shown by the results in Table 8.1.1. Similar computations 
for bounds for the local truncation error give 


19e3-8(0.002 19e4(0.002 
1.0617 = 1° oo 2 < em < OO) = 1.2967 (29) 


in going from 0.95 to 1.0 and 


19e75(0.0025 19e8(0.0025 
» l < e40 S DINE E ) = 70.80 (30) 


57.96 = 


in going from 1.95 to 2.0. 

These results indicate that for this problem, the local truncation error is about 2500 times 
larger near t = 2 than near t = 0. Thus, to reduce the local truncation error to an acceptable 
level throughout 0 < t < 2, we must choose a step size h based on an analysis near t = 2. 
Of course, this step size will be much smaller than necessary near t = 0. For example, to 
achieve a local truncation error of 0.01 for this problem, we need a step size of about 0.00059 
near £ = 2 and a step size of about 0.032 near t = 0. The use of a uniform step size that is 
smaller than necessary over much of the interval results in more calculations than necessary, 
more time consumed, and possibly more danger of unacceptable round-off errors. 

Another approach is to keep the local truncation error approximately constant throughout 
the interval by gradually reducing the step size as ¢ increases. In the example problem, we 
would need to reduce h by a factor of about 50 in going from t = 0 tot = 2. A method 
that provides for variations in the step size is called adaptive. All modern computer codes for 
solving differential equations have the capability of adjusting the step size as needed. We will 
return to this question in the next section. 


Problems 


@ 1. Complete the calculations leading to the entries in columns In each of Problems 3 through 7, find approximate values of the 
three and four of Table 8.1.1. solution of the initial value problem at t = 0.1, 0.2, 0.3, and 0.4. 
o 2. Complete the calculations leading to the entries in columns Oo a. Use the Euler method with h = 0.05. 
three and four of Table 8.1.2. Q b. Use the Euler method with h = 0.025. 
Q c. Use the backward Euler method with h = 0.05. 
Q d. Use the backward Euler method with h = 0.025. 


362  CHAPTER8 Numerical Methods 


3. y'=5t-3/y, y(0) =2 

4. y'’=2y-3t, y(0)=1 

5. y'-2tte, y(0) = 1 

6. y =(974+2ry)/(3407), y(0) =0.5 
7. y -(?-y5sny y(0 2-1 


In each of Problems 8 through 12, find approximate values of the 
solution of the initial value problem at t = 0.5, 1.0, 1.5, and 2.0. 

@ a. Use the Euler method with ^ = 0.025. 

[N | b. Use the Euler method with h = 0.0125. 

(9 c. Use the backward Euler method with h = 0.025. 

[N ] d. Use the backward Euler method with h = 0.0125. 


8. y'=0.5-14+2y, y(0)=1 
9. y 25: -3Jy, y(0) =2 
10. y'=2%+e%, (0) =1 


11. y'=(4-ty)/(1+y?), y02-2 
12. y'= (y? +2ty)/(3 +13, y(0) =0.5 


13. Using three terms in the Taylor series given in equation (12) and 
taking h = 0.1, determine approximate values of the solution of the 
illustrative example y' = 1 — t + 4y, y(0) = l att = 0.1 and 0.2. 
Compare the results with those using the Euler method and with the 
exact values. Hint: If y' = f(t, y), what is y”? 


In each of Problems 14 and 15, 
@ a. Estimate the local truncation error for the Euler method in 
terms of the solution y = ¢(ft). 
@ b. Obtain a bound for e, , , in terms of t and ¢ (1) that is valid 
on the interval 0 <r < 1. 
Q c. By using a formula for the solution, obtain a more accurate 
error bound for e,,,. 
Q d. For h = 0.1 compute a bound for e; and compare it with 
the actual error at t = 0.1. 
@ e. Compute a bound for the error e4 in the fourth step. 


14. y'22y-1, y(0) =1 
y(0) 21 


In each of Problems 16 through 18, obtain a formula for the local 
truncation error for the Euler method in terms of t and the exact 
solution y = ó(t). 


15. y 21-t42y, 


16. y'=5t-3yy, y(0) =2 
17. w= tty, yl) 23 
18. y'22t re, y(0) =1 


19. Consider the initial value problem 


y —cos(5zt), y(00-l. 


@ a. Determine approximate values of @(f) att = 0.2, 0.4, 
and 0.6 using the Euler method with h = 0.2. 

@ b. Determine the solution y = ¢(t), and draw a graph of 
y=¢(t) forO <t <1. 

Q c. Draw a broken-line graph for the approximate solution, 
and compare it with the graph of the exact solution. 

@ d. Repeat the computation of part a for 0 < t < 0.4, but take 
h — 0.1. 

Q e. Show by computing the local truncation error that neither 
of these step sizes is sufficiently small. 


O f. Determine a value of h to ensure that the local truncation 
error is less than 0.05 throughout the interval 0 < t < 1. That 
such a small value of h is required results from the fact that 
max |ó"(1)| is large. 


20. Inthis problem we discuss the global truncation error associated 
with the Euler method for the initial value problem y' = f(t, y), 
y(to) = yo. When the functions f and f, are continuous in a 
closed, bounded region R of the ry-plane that includes the point 
(to, yo), it can be shown that there exists a constant L such that 
|f. y - f(t, »| < L|y — y|, where (t, y) and (t, y) are any two 
points in R with the same ¢ coordinate (see Problem 14 of Section 
2.8). Further, we assume that f, is continuous, so the solution o has a 
continuous second derivative. 


a. Using equation (20), show that 
[Enl Ss lEn] + h| f (tn, O (t4)) — f tu. Yn) 


1 
+5h°16"in)| 


(31) 


€ a|E,| + Bh’, 


1 
where a = 1+ AL and B = max zo" On fy € f X fg. 


b. Assume that if Ey = 0, and if |E,| satisfies equation (31), 
then |E,| < Bh?(a” — 1)/(a — 1) fora z 1. Use this result to 
show that 


(1+AL)"-1 
L 


Equation (32) gives a bound for |E,| in terms of h, L, n, and 
B. Notice that for a fixed h, this error bound increases with 
increasing n; that is, the error bound increases with distance from 
the starting point fg. 

€. Show that (1 + AL)" < e””}; hence 


enl = 
Base. Bh 


[Enl < Bh. (32) 


If we select an ending point T greater than tọ and then choose 

the step size h so that n steps are required to traverse the interval 

[to. T], then nh = T — to, and 
(T-L _ 

|En| < Bh = Kh, 

which is equation (25). Note that K depends on the length T — to 

of the interval and on the constants L and 6 that are determined 

from the function f. 


21. Derive an expression analogous to equation (22) for the local 
truncation error for the backward Euler formula. Hint: Construct a 
suitable Taylor approximation to $ (t) about t = f, 4. 


22. Using a step size h — 0.05 and the Euler method, but retaining 
only three digits throughout the computations, determine approximate 
values of the solution at tf = 0.1, 0.2, 0.3, and 0.4 for each of the 
following initial value problems: 


Qa. y'=1-14+4y, yO) =1 
Ob. y =3+t-y, y(0)=1 
Qc. y =2y-3t, y(0)=1 


Compare the results of a with those obtained in Example 1 and in 
Problem 1 and the results of c with those obtained in Problem 4. The 
small differences between some of those results rounded to three digits 
and the present results are due to round-off error. The round-off error 
would become important if the computation required many steps. 


23. The following problem illustrates a danger that occurs because 
of round-off error when nearly equal numbers are subtracted and the 
difference is then multiplied by a large number. Evaluate the quantity 


6.010 
2.004 


18.04 


1909; | 6.000 


in the following ways: 
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O b. First round each entry in the determinant to three digits. 
@ c. Retain all four digits. Compare this value with the results 
in parts a and b. 


24. The distributive law a(b — c) = ab — ac does not hold, in 
general, if the products are rounded off to a smaller number of digits. 
To show this in a specific case, takea = 0.22, b = 3.19, andc = 2.17. 
After each multiplication, round off the last digit. 


@ a. First round each entry in the determinant to two digits. 


82 Improvements on the Euler Method 


For many problems the Euler method requires a very small step size to produce sufficiently 
accurate results. Much effort has been devoted to the development of more accurate methods. 
In the next three sections, we will discuss some of these methods. Consider the initial value 
problem 


y = f(t, y), y(t) = yo (1) 


and let y = @(t) denote its solution. Recall from equation (10) of Section 8.1 that by 
integrating the given differential equation from f, to f, ,,, we obtain 


[ESI 


P(t) = (th) + f(t, $ (1)) dt. Q) 


In 


The Euler formula 


Yn+1 = Yn + hf (th; Yn) (3) 


is obtained by replacing the integrand f(t, $(1)) in equation (2) by its approximate value 
f (tn, Yn) at the left endpoint of the interval of integration. Other approximations of the definite 
integral lead to other numerical solution methods for initial value problems. 


| 


3 


y'= f(t, e(t) 


fa, $ (55,1) 


--- 
5 (Fln, Olt) + 


ft, , Oln) 


f(t,, 0 (t,)) 


t 


n+l 


| FIGURE 8.2.1 | Derivation of the improved Euler method. 


Improved Euler Formula. A better approximate formula for the solution of initial value 
problem (1) can be obtained if the definite integral in equation (2) is approximated more 
accurately. One way to do this is to replace the integrand by the average of its values at the two 


1 
endpoints, namely, = tis th)) + fF (tuas. tn+1 . This is equivalent to approximatin, 
dpoi ly, 3 Uf (tn)) sis 0 (tn41)) ). This is equival pproximating 


the area under the curve in Figure 8.2.1 between t = t, and t = t„+1 by the area of the shaded 
trapezoid. Further, we replace @(t,) and $ (1,41) by their respective approximate values y;, 
and y,,,. In this way we obtain, from equation (2), 


fts, Yn) F fna Yn41) h 
2 R 


Yn+1 = Yn + (4) 
Since the unknown y,,, appears as one of the arguments of f on the right-hand side of 
equation (4), this equation defines y,,, implicitly rather than explicitly. Depending on the 
nature of the function f , it might be fairly difficult to solve equation (4) for y„+1. This difficulty 
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can be overcome by replacing y,; on the right-hand side of equation (4) by the value obtained 
using the Euler formula (3). Thus 


fs, Yn) + ft + h, Yn + hf (tn, Xn), 
2 


Yn+1 = Yn + 


c th th, n hf, 
a e REMMI THD, 


(5) 


where f,,,; has been replaced by t, + ^. 

Equation (5) gives an explicit formula for computing y,,,. the approximate value of 
$ (1,41). in terms of the data at ¢,,. This formula is known as the improved Euler formula or 
the Heun! formula. The improved Euler formula is an example of a two-stage method; that 
is, we first calculate y, + hf, from the Euler formula and then use this result to calculate y,, 
from equation (5). 

The improved Euler formula (5) is an improvement over the Euler formula (3) because 
the local truncation error in using equation (5) is proportional to /?, while for the Euler method 
it is proportional to h?. This error estimate for the improved Euler formula is established in 
Problem 12. It can also be shown that for a finite interval, the global truncation error for the 
improved Euler formula is bounded by a constant times h?, so this method is a second-order 
method. Note that this greater accuracy is achieved at the expense of more computational work, 
since it is now necessary to evaluate f(t, y) twice in order to go from f, to f, ,. 

If f(t, y) depends only on £ and not on y, then solving the differential equation 
y' = f(t, y) reduces to integrating f(r). In this case the improved Euler formula (5) becomes 


h 
Yn+1 = Jn EFS n) HSn +h)), (6) 


which is just the trapezoid rule for numerical integration. 


EXAMPLE 1 


Use the improved Euler formula (5) with step sizes of h = 0.025 and h = 0.01 to calculate 
approximate values of the solution of the initial value problem 


y-21-t-4y y(0)=1 (7) 
on the interval 0 < t < 2. 


Solution: 


To make clear exactly what computations are required, we show a couple of steps in detail. For this 
problem f(t, y) = 1 — t + Ay; hence 


fn = d — tn +4yn 

and 

fn + hy yn +h fn) = 1 — (tn +h) t AC yn + hfa). 
Further, to = 0, yg = 1, and fy = 1 — to + 4yo = 5. When h = 0.025, then 

f(to +h, yo +Afo) = 1 — 0.025 + 4(1 + (0.025)(5)) = 5.475. 

Then, from equation (5), 

yı = 1 + (0.5)(5 + 5.475) (0.025) = 1.1309375. (8) 

At the second step we must calculate 
fi = 1 — 0.025 + 4(1.1309375) = 5.49875, 
yı hf, = 1.1309375 + (0.025) (5.49875) = 1.26840625, 


'The formula is named for the German mathematician Karl Heun (1859-1929), who was a professor at Technische 
Hochschule Karlsruhe. 
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and 
Sf (ta, yı - hf) = 1 — 0.05 + 4(1.26840625) = 6.023625. 
Then, from equation (5), 
y2 = 1.1309375 + (0.5) (5.49875 + 6.023625) (0.025) = 1.2749671875. (9) 


Further results for 0 < t < 2 obtained by using the improved Euler method with h = 0.025 
and h = 0.01 are given in Table 8.2.1. To compare the results of the improved Euler method with 
those of the Euler method, note that the improved Euler method requires two evaluations of f at 
each step, while the Euler method requires only one. This is significant because typically, most of 
the computing time in each step is spent in evaluating f, so counting these evaluations is a reasonable 
way to estimate the total computing effort. Thus, for a given step size h, the improved Euler method 
requires twice as many evaluations of f as the Euler method. Alternatively, the improved Euler 
method for step size h requires the same number of evaluations of f as the Euler method for step 
size h/2. 

The approximate solutions shown in Table 8.2.1 confirm that the improved Euler method with 
h = 0.025 gives much better results than the Euler method with h = 0.01. Note that to reach t = 2 
with these step sizes, the improved Euler method requires 160 evaluations of f, while the Euler 
method requires 200. More noteworthy is that the improved Euler method with h = 0.025 is also 
slightly more accurate than the Euler method with h = 0.001 (2000 evaluations of f). In other words, 
with something like one-twelfth of the computing effort, the improved Euler method yields results 
for this problem that are comparable to, or a bit better than, those generated by the Euler method. This 
illustrates that, compared to the Euler method, the improved Euler method is clearly more efficient, 
yielding substantially better results or requiring much less total computing effort, or both. 

The percentage errors at t = 2 for the improved Euler method are 1.23% for h = 0.025 and 
0.21% for h = 0.01. 


A Comparison of Results Using the Euler and Improved Euler 


EL LL Methods for the Initial Value Problem y'—-1-—t-44y,y(0) =1 


Improved Euler 


h — 0.01 h — 0.001 h — 0.025 h — 0.01 


0.0 1.0000000 1.0000000 1.0000000 1.0000000 1.0000000 
0.1 1.5952901 1.6076289 1.6079462 1.6088585 1.6090418 
0.2 2.4644587 2.5011159 2.5020618 2.5047827 2.5053299 
0.3 3.7390345 3.8207130 3.8228282 3.8289146 3.8301388 
0.4 5.6137120 5.7754845 5.7796888 5.7917911 5.7942260 
0.5 8.3766865 8.6770692 8.6849039 8.7074637 8.7120041 
1.0 60.037126 64.382558 64.497931 64.830722 64.897803 
1.5 426.40818 473.55979 474.83402 478.51588 479.25919 
20029279. 3484.1608 3496.6702 3532.8789 3540.2001 


A computer program for the Euler method can be readily modified to implement the 
improved Euler method instead. All that is required is to replace Step 6 in the algorithm in 
Section 8.1 by the following: 


The Improved Euler Method 
Step 6.  kl= f(t, y) 
k2 = f(t+h,y +h xkl) 
y =y+(h/2) « (k1 + 2) 
t=t+h 
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Variation of Step Size. In Section 8.1 we mentioned the possibility of adjusting the step 
size as a calculation proceeds so as to maintain the local truncation error at a more or less 
constant level. The goal is to use no more steps than necessary and, at the same time, to 
keep some control over the accuracy of the approximation. Here, we will describe how this 
can be done. First, we choose the error tolerance e, which is the local truncation error that 
we are willing to accept. Suppose that after n steps we have reached the point (tn, Yn). We 
choose a step size h and calculate y,,.;. Next we need to estimate the error we have made in 
calculating y,,,. Not knowing the actual solution, the best that we can do is to use a more 
accurate method and repeat the calculation starting from (ta, Yn). For example, if we used 
the Euler method for the original calculation, we might repeat it with the improved Euler 
method. Then the difference between the two calculated values is an estimate B of the 
error in using the original method. If the estimated error is larger than the error tolerance e, 
then we adjust the step size and repeat the calculation. The key to making this adjustment 
efficiently is knowing how the local truncation error e,,; depends on the step size h. For 
the Euler method, the local truncation error is proportional to h°, so to bring the estimated 
error down (or up) to the tolerance level e, we must multiply the original step size by the 


factor V ele ME 


To illustrate this procedure, consider the example problem (7): 
y =1-t+4y, y(0) =1. 


Suppose that we choose the error tolerance € to be 0.05. You can verify that after one step 
with h = 0.1, we obtain the values 1.5 and 1.595 from the Euler method and the improved 
Euler method, respectively. Thus the estimated error in using the Euler method is 0.095. Since 
this is larger than the tolerance level of 0.05, we need to adjust the step size downward by 
the factor 4/0.05/0.095 = 0.73. Rounding downward to be conservative, let us choose the 
adjusted step size h = 0.07. Then, from the Euler formula, we obtain 


yı = 1+ (0.07) f(0, 1) = 1.35 = $ (0.07). 


Then, using the improved Euler method with ^ = 0.07, we obtain y; = 1.39655, so the 
estimated error in using the Euler formula is 0.04655, which is slightly less than the specified 
tolerance. The actual error, based on a comparison with the exact solution, is somewhat greater, 
namely, 0.05122. 

We can follow the same procedure at each step of the calculation, thereby keeping the 
local truncation error approximately constant throughout the entire numerical process. Modern 
adaptive codes for solving differential equations adjust the step size in very much this way as 
they proceed, although they usually use more accurate formulas than the Euler and improved 
Euler formulas. Consequently, they are able to achieve both efficiency and accuracy by using 
very small steps only where they are really needed. 


Problems 


@ 1. Complete the calculations leading to the entries in columns 5. y = (y? +2ty)/(3 +t, y(0) = 0.5 
four and five of Table 8.2.1. 6. 


In each of Problems 2 through 6, find approximate values of the 
solution of the given initial value problem at tf = 0.1, 0.2, 0.3, and 
0.4. Compare the results with those obtained by the Euler method and 
the backward Euler method in Section 8.1 and with the exact solution 


y'=(?—y?)siny, y(0) = —1 

In each of Problems 7 through 11, find approximate values of the 
solution of the initial value problem at t = 0.5, 1.0, 1.5, and 2.0. 

@ a. Use the improved Euler method with h = 0.025. 

O b. Use the improved Euler method with A = 0.0125. 


(if available). : 
@ a. Use the improved Euler method with ^ = 0.05. 7. y -05-tt2y y(0)=1 
O b. Use the improved Euler method with ^ = 0.025. 8. y'25:—-3J/y, y(022 
@ c. Use the improved Euler method with h = 0.0125. 9. y= J/r*y, x0) 23 
2 y'=3+t-y, y0-1 10. y'=2t+e-, y(0-1 
m ee MM H. y-0^-2))/G4:5), y0) =0.5 


4. y'=2t+e", y(0)21 


12. In this problem we establish that the local truncation error for 
the improved Euler formula is proportional to h°. If we assume that 
the solution ¢ of the initial value problem y' = f(t, y), y(to.) = yo 
has derivatives that are continuous through the third order (f has 
continuous second partial derivatives), then it follows that 
P” (t5) o” n) 

(tn +h) = Gln) + O'Cin)h + —Á, 7 MT 
where f, < tn < t, +h. Assume that y, = ¢ (tn). 

a. Show that, for y,,, as given by equation (5), 


, 


€n41 = Q4) — Yn 
= o'(tr)h — Cf Cth +h, yn + hf (th, Yn)) = ftu. Yn)) h 
2! 


à" (1,)h? 
35 7 
b. Use the facts that 9"(t) = F(t, (1) + fy (t 6(0)9 (1) 


and that the Taylor approximation with a remainder for a function 
F(t, y) of two variables is 


F(a+h,b+k) = F(a,b) + F,(a, b)h + Fy(a, b)k 


(10) 


1 
+5 Fu + 2hk Fry +k? Fy) ' 


t—é.y-m 


where € lies between a and a + h, and 7 lies between b and 
b + k, to show that the first term on the right-hand side of 
equation (10) is proportional to h? plus higher-order terms. This 
is the critical estimate needed to prove that the local truncation 
error is proportional to /?. 

C. Show that if f(t, y) is linear in t and y, then 


Cnt = go inh for some t, with th < f, < t,44. 
Hint: What are frr, fry, and fyy? 


13. Consider the improved Euler method for solving the illustrative 
initial value problem y’ = 1 — t + 4y, y(0) = 1. 
a. Using the result of Problem 12c and the exact solution of the 
initial value problem, determine e,, ; and a bound for the error at 
any step onO < t <2. 
b. Compare the error found in a with the one obtained in 
equation (27) of Section 8.1 using the Euler method. 
C. Alsoobtain a bound for e, for  — 0.05, and compare it with 
equation (28) of Section 8.1. 
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In each of Problems 14 and 15, 
a. Use the actual solution o (f) to determine e,,; and a bound 
for e, 4; at any step on O < t < 1 for the improved Euler method 
for the given initial value problem. 
b. Also obtain a bound for e; for h = 0.1, and compare it with 
the similar estimate for the Euler method and with the actual error 
for the improved Euler method. 

14. y’=2y-1, y(0-21 

15. y =0.5—t+2y, y(0)=1 

In each of Problems 16 through 19, carry out one step of the Euler 

method and of the improved Euler method, using the step size h = 0.1. 

Suppose that a local truncation error no greater than e = 0.0025 is 


required. Estimate the step size that is needed for the Euler method to 
satisfy this requirement at the first step. 


O 16. y =05-14+2y, y(0)=1 
O 17. »/-25:-3y»  y(0) =2 
O 18. y= /ftFy, y(0) =3 


© 19. y=)? 4 2ry)/(3 +t), y(0) =0.5 
20. The modified Euler formula for the initial value problem 
y' = f(t, y), y(to) = yo is given by 


1 1 
Yn+1 = Yn + hf (a +P z” Yn + ^f n, ») : 


Following the procedure outlined in Problem 12, show that the local 
truncation error in the modified Euler formula is proportional to h°. 


In each of Problems 21 through 24, use the modified Euler formula 
of Problem 20 with h = 0.05 to compute approximate values of the 
solution of the given initial value problem at t = 0.1, 0.2, 0.3, and 0.4. 


@ 21. y’=3+t-y, y(0) = 1 (Compare with Problem 2) 
@ 22. y =St—3/y, y(0) =2 

0 23. y'—2y —3t, y(0) = 1 (Compare with Problem 3) 

O 24. y/—2r e, (0) = 1 (Compare with Problem 4) 
25. Show that the modified Euler formula of Problem 20 is identical 


to the improved Euler formula of equation (5) for y' = f(t, y) if f is 
linear in both and y. 


The Euler formula, the backward Euler formula, and the improved Euler formula were 
introduced, in Sections 8.1 and 8.2, as ways to numerically approximate the solution of the 
initial value problem 


y =f, y), yu) = Yo. e 
The local truncation errors for these methods are proportional to h?, A?, and h?, respectively. 
The Euler and improved Euler methods belong to what is now called the Runge-Kutta? class 
of methods. 

In this section we discuss the method originally developed by Runge and Kutta. This 
method is now called the classic fourth-order four-stage Runge-Kutta method, but it is 
often referred to simply as the Runge-Kutta method, and we will follow this practice for 
?Carl David Runge (1856-1927), a German mathematician and physicist, worked for many years in spectroscopy. 
The analysis of data led him to consider problems in numerical computation, and the Runge-Kutta method originated 
in his paper on the numerical solution of differential equations in 1895. The method was extended to systems of 


equations in 1901 by Martin Wilhelm Kutta (1867-1944). Kutta was a German mathematician and aerodynamicist 
who is also well known for his important contributions to classical airfoil theory. 
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brevity. This method has a local truncation error that is proportional to hî. Thus it is two orders 
of magnitude more accurate than the improved Euler method and three orders of magnitude 
better than the Euler method. It is relatively simple to use and is sufficiently accurate to handle 
many problems efficiently. This is especially true of adaptive Runge-Kutta methods, in which 
provision is made to vary the step size as needed. We return to this issue at the end of the 
section. 

The Runge-Kutta formula involves a weighted average of values of f(t, y) at four 
different points in the interval f, < t < t,,,. Itis given by 


kni + 2kn2 + 2k,3 + kn 
rns = tA aes s), (2) 
6 
where 
kai = f Cta; Yn), 
ET EE E 
n2 — n 2 > Yn 2 nl J> 
1 1 (3) 
kn = tn zh, n zhk, > 
3 s( + 5h, yn + 5 ;) 


kn4 = ftu + h, Yn + hky3). 


1 
The sum gU + 2k,» + 2k,3 + k,4) can be interpreted as an average slope. Note that k,,; is 


the slope at the left end of the interval, k,» is the slope at the midpoint using the Euler formula 
to go from f, to t, + h/2, k,3 is a second approximation to the slope at the midpoint, and &,4 
is the slope at t, + ^ using the Euler formula and the slope k,,3 to go from f, to t, + h. 

Although in principle it is not difficult to show that equation (2) differs from the Taylor 
expansion of the solution ¢ by terms that are proportional to h°, the algebra is rather lengthy.? 
Thus we simply state without proof that the local truncation error in using equation (2) is 
proportional to hî and that for a finite interval the global truncation error is at most a constant 
times h*. The earlier description of this method as a fourth-order four-stage method reflects 
the facts that the global truncation error is of fourth order in the step size h and that there are 
four intermediate stages in the calculation (the calculation of kni, ... , &,4). 

It is obvious that the Runge-Kutta formula, equations (2) and (3), is more complicated 
than any of the formulas discussed previously. This is of relatively little significance, however, 
since it is not hard to write a computer program to implement this method. Such a program 
has the same structure as the algorithm for the Euler method outlined in Section 8.1. To be 
specific, the lines in Step 6 in the Euler algorithm must be replaced by the following: 


The Runge-Kutta Method 
kl = f(t, y) 
k2 = f(t +h/2, y - (h/2) x k1) 
k3 = f(t +h/2, y - (h/2) « k2) 


k4 = f(t +h, y +h x k3) 
y 2 y - (h/6) x (k1l-- 2 k2 -2 x k3 + k4) 
t=t+h 


Note that if f does not depend on y, then 
kni = f (ta), km = kn3 = rs F ;) kn4 = f (tn +h), (4) 
and equation (2) reduces to 
Ynt1 — Yn = = (Fem sar(ns =) fa em). (5) 


Equation (5) can be identified as Simpson's^ rule for the approximate evaluation of the integral 
of y = f(t). The fact that Simpson's rule has an error proportional to h’ is consistent with 
the local truncation error in the Runge-Kutta formula. 

3See, for example, Chapter 3 of the book by Henrici listed in the references. 


4Simpson’s rule is named for Thomas Simpson (1710-1761), an English mathematician and textbook author, who 
published it in 1743. 
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EXAMPLE 1 


Use the Runge-Kutta method to calculate approximate values of the solution y = ó (t) of the initial 
value problem 


y =1—t+4y, y(0) =1. (6) 
Solution: 
Taking h = 0.2, we have 
ky = f(0, 1) =5; hg, = 05, 
ko; = f(0 + 0.1, 1 + 0.5) = 6.9; hko = 0.69, 
kog = f(0 + 0.1, 1 + 0.69) = 7.66; hko = 1.532, 


ko, = f(0 + 0.2, 1 + 1.532) = 10.928. 


Thus 


0.2 
yı = 1+ AG 206.9) + 2(7.66) + 10.928) 


1 + 1.5016 = 2.5016. 


Further results using the Runge-Kutta method with h = 0.2, h = 0.1, and h = 0.05 are given 
in Table 8.3.1. Note that the Runge-Kutta method yields a value at t = 2 that differs from the exact 
solution by only 0.122% if the step size is h = 0.1, and by only 0.00903% if h = 0.05. In the latter 
case, the error is less than one part in 10,000, and the calculated value at t = 2 is correct to four digits. 

To compare the computational requirements of these methods, note that both the Runge-Kutta 
method with A = 0.05 and the improved Euler method with h = 0.025 require 160 evaluations of f 
to reach t = 2. The improved Euler method yields a result at t = 2 that is in error by 1.23%. Although 
this error may be acceptable for some purposes, it is more than 135 times the error yielded by the 
Runge-Kutta method with comparable computing effort. Note also that the Runge-Kutta method with 
h = 0.2, or 40 evaluations of f, produces a value at t = 2 with an error of 1.40%, which is only 
slightly greater than the error in the improved Euler method with h = 0.025, or 160 evaluations 
of f. Thus we see again that a more accurate algorithm is more efficient; it produces better results 
with similar effort, or similar results with less effort. 


A Comparison of Results for the Numerical Approximation of the 
Solution of the Initial Value Problem y’ = 1 — t + 4y, y (0) = 1 


TABLE 8.3.1 


Improved Euler Runge - Kutta 
h — 0.02 : h — 0.1 

0.0 1.0000000 1.0000000 1.0000000 1.0000000 1.0000000 
0.1 1.6079462 1.6089333 1.6090338 1.6090418 
On 2.5020618 2.5016000 2.5050062 2.5053060 2.5053299 
0.3 3.8228282 3.8294145 3.8300854 3.8301388 
0.4 5.7796888 5.7776358 57927859 5.7941197 5.7942260 
0.5 8.6849039 8.7093175 8.7118060 8.7120041 
1.0 64.497931 64.441579 64.858107 64.894875 64.897803 
15 474.83402 478.81928 479.22674 479.25919 
2.0 3496.6702 3490.5574 3535.8667 3539.8804 3540.2001 


The classic Runge-Kutta method suffers from the same shortcoming as other methods 
with a fixed step size for problems in which the local truncation error varies widely over the 
interval of interest. That is, a step size that is small enough to achieve satisfactory accuracy in 
some parts of the interval may be much smaller than necessary in other parts of the interval. 
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This has stimulated the development of adaptive Runge-Kutta methods that provide for 
modifying the step size automatically as the computation proceeds, so as to maintain the local 
truncation error near or below a specified tolerance level. As explained in Section 8.2, this 
requires the estimation of the local truncation error at each step. One way to do this is to repeat 
the computation with a fifth-order method—which has a local truncation error proportional 
to h°—and then to use the difference between the two results as an estimate of the error. If 
this is done in a straightforward manner, then the use of the fifth-order method requires at 
least five more evaluations of f at each step, in addition to those required originally by the 
fourth-order method. However, if we make an appropriate choice of the intermediate points 
and the weighting coefficients in the expressions for k,), ... , k,4 in a certain fourth-order 
Runge-Kutta method, then these expressions can be used again, together with one additional 
stage, in a corresponding fifth-order method. This results in a substantial gain in efficiency. It 
turns out that this can be done in more than one way. 

The first fourth- and fifth-order Runge-Kutta pair was developed by Erwin Fehlberg? 
in the late 1960s and is now called the Runge-Kutta-Fehlberg, or RKF,° method. The 
popularity of the RKF method was considerably enhanced by the appearance in 1977 of 
its Fortran implementation RKF45 by Lawrence F. Shampine and H. A. Watts. The RKF 
method and other adaptive Runge-Kutta methods are very powerful and efficient means of 
approximating numerically the solutions of an enormous class of initial value problems. 
Specific implementations of one or more of them are widely available in commercial software 


packages. 


Problems 


@ 1. Confirm the results in Table 8.3.1 by executing the 
indicated computations. 


In each of Problems 2 through 6, find approximate values of the 
solution of the given initial value problem at t = 0.1, 0.2, 0.3, and 
0.4. Compare the results with those obtained by using other methods 
and with the exact solution (if available). 

@ a. Use the Runge-Kutta method with h = 0.1. 

Q b. Use the Runge-Kutta method with ^ = 0.05. 


2. y'=34+r-y, yO=1 

3. y'25t-3Jy, y(0) =2 

4. y'=2t+e", yO) =1 

5. y = (y? + 2ty)/(3 +t, y(0) = 0.5 
6. y 2(2—y)5siny y(0-2-1 


In each of Problems 7 through 11, find approximate values of the 
solution of the given initial value problem at f = 0.5, 1.0, 1.5, and 
2.0. Compare the results with those obtained by other methods and 
with the exact solution (if available). 

@ a. Use the Runge-Kutta method with h = 0.1. 

Q b. Use the Runge-Kutta method with h = 0.05. 


Erwin Fehlberg (1911-1990) was born in Germany, received his doctorate 
from the Technical University of Berlin in 1942, emigrated to the United States 
after World War II, and was employed by NASA for many years. The Runge- 
Kutta-Fehlberg method was first published in a NASA Technical Report in 
1969. 


*'The details of the RKF method may be found, for example, in the books 
by Ascher and Petzold and by Mattheij and Molenaar that are listed in the 
References. 


7. y'=0.5-t+2y, y(0)=1 
8. y =5t-3/7, (0) =2 
9. y= tty, y(0) =3 
10. y'=2t+e", yO=1 


11. y-49^--X49/0404 90) =05 


12. Consider the initial value problem 


y = 317/(3y? — 4), y(0) =0. 


Let ty be the right-hand endpoint of the interval of existence of this 
solution. 
@ a. Draw a direction field for this equation. 
b. Use the direction field created in a to estimate ty. What 
happens at fy to prevent the solution from continuing farther? 
Q c. Use the Runge-Kutta method with various step sizes to 
determine an approximate value of ty. 
d. If you continue the Runge-Kutta computation for t > tm, you 
can continue to generate values of y. What significance, if any, 
do these values have? 
@ e. Suppose that the initial condition is changed to y(0) = 1. 
Repeat parts b and c for this problem. 


84 Multistep Methods 


In previous sections we have discussed numerical procedures for approximating the solution 
of the initial value problem 


y =f(t,y), y(to) = yo (1) 


in which data at the point t = t, are used to calculate an approximate value of the solution 
(tn41) at the next mesh point t = f,,,. In other words, the calculated value of the exact 
solution à at any mesh point depends only on the data at the preceding mesh point. Such 
methods are called one-step methods. However, once approximate values of the exact solution 
y = ó(t) have been obtained at a few points beyond tọ, it is natural to ask whether we can make 
use of more of this information — not just the value at the last point— to calculate the value of 
Q (t) at the next point. Specifically, if y; at fj, y2 at t5, ... , y, at t, are known, how can we 
use this information to determine y,,, at t,,;? Methods that use information at more than 
the last mesh point are referred to as multistep methods. In this section we will describe two 
types of multistep methods: Adams" methods and backward differentiation methods. Within 
each type, we can achieve various levels of accuracy, depending on the number of preceding 
data points that are used. For simplicity, we will assume throughout our discussion that the 
step size h is constant. 


Adams Methods. Recall that the solution o (1) of the initial value problem (1) satisfies 


Il 


$i) — (tn) = o'(t) dt. Q) 
tn 
The basic idea of an Adams method is to approximate $'(f) by a polynomial P(t) of degree 
k and to use the polynomial to evaluate the integral on the right-hand side of equation (2). The 
coefficients in P(t) are determined by using k + 1 previously calculated data points. 

For example, suppose that we wish to use a first-degree polynomial P,(f) = At + B. 
Then we need only the two data points (ty, Yn) and (1, 4, y, 1). For P, to interpolate ¢’ 
at both t = f, and t = f,_1, we require both that Pi(f,) = $'(t,) = f(t,, Yn) and that 
Pi(tn—-1) = (t) = fcis Yn-1). Recall that we denote f(t;, yj) by fj for an integer j. 
Thus A and B must satisfy the equations 


At, + B= das 
At, 1 + B= fua. 


(3) 


Solving for A and B, we obtain 


tn ! dni 


n—ln — Intn— 


h 
Replacing $'(t) by Pj(t) and evaluating the integral in equation (2), we find that 


tn A/2 
Onan) - ea) = [ Gre BY dt = S (ia) + Bea =. 


tr 
Finally, we replace @(t,41;) and @(t,) by Yn+ı and yp, respectively, and carry out some 
algebraic simplification. For a constant step size h, we obtain 


(4) 


3 1 
n = Yn Shifa — <hfn-1- 5 
Yn41 Yn + hf, zf- (5) 


Equation (5) is the second-order Adams-Bashforth® formula. It is an explicit formula for 
Yn+1 in terms of y, and y, , and has a local truncation error proportional to h?. 


"John Couch Adams (1819-1892), an English mathematician and astronomer, is most famous as codiscoverer, with 
Joseph Leverrier, of the planet Neptune in 1846. He was associated with Cambridge University for most of his life, as 
student (1839-1843), fellow, Lowdean Professor, and director of the Observatory. Adams was extremely skilled at 
computation; his procedure for numerical integration of differential equations appeared in 1883 in a book on capillary 
action written with Francis Bashforth. 

*Francis Bashforth (1819-1912), English mathematician and Anglican priest, was a classmate of J. C. Adams at 
Cambridge. He was particularly interested in ballistics and invented the Bashforth chronograph for measuring the 
velocity of artillery projectiles. 
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We note in passing that the first-order Adams-Bashforth formula, based on the 
polynomial Po(t) = fa of degree zero, is just the original Euler formula. (See Problem 11a.) 

More accurate Adams formulas can be obtained by following the procedure outlined 
above, but using a higher degree polynomial and correspondingly more data points. For 
example, suppose that a polynomial P3(t) of degree three is used. The coefficients 
are determined from the four points (ta, Yn), Cty. Yn—1)> (tn—2> Yn-2), and (1,5, Yn-3). 
Substituting this polynomial for $'(1) in equation (2), evaluating the integral, and simplifying 
the result, we eventually obtain the fourth-order Adams-Bashforth formula 


h 
Yn+1 = Yn + 24 9n — 59 f, a 37 f, o = 9 fn—3) - (6) 


The local truncation error of this fourth-order formula is proportional to h°. 

A variation on the derivation of the Adams-Bashforth formulas gives another set of 
formulas called the Adams-Moulton? formulas. To see the difference, let us again consider 
the second-order case. Again we use a first-degree polynomial Q;(t) = at + B, but we 
determine the coefficients by using the points (tn, Yn) and (ta+1, Yn+1). Thus o and 3 must 
satisfy 


Qty + B = Stns 
(7) 
At41+ DB = Taitis 
and it follows that 
Tati m fn ftia m fna 

= ——————; = —. 8 
a h B h (8) 

Substituting Q(t) for ó'(1) in equation (2) and simplifying, we obtain 

1 1 

Yn+1 = Yn + 5^» + 3^ f (nir, Yn41); (9) 


which is the second-order Adams-Moulton formula. We have written f (1,44. Yn+1) in the 
last term to emphasize that the Adams-Moulton formula is implicit, rather than explicit, since 
the unknown y,,, appears on both sides of the equation. The local truncation error for the 
second-order Adams-Moulton formula is proportional to h°. 

The first-order Adams-Moulton formula is just the backward Euler formula, as you 
might anticipate by analogy with the first-order Adams-Bashforth formula. (See Problem | 1b.) 

More accurate higher-order formulas can be obtained by using an approximating 
polynomial of higher degree. The fourth-order Adams-Moulton formula, with a local 
truncation error proportional to h°, is 


h 
Yn+1 = Ya + 240 fa +19 fam Sfr- + fi). (10) 


Observe that this is also an implicit formula, because y„+ı appears in f,+1. 

Although both the Adams-Bashforth and Adams-Moulton formulas of the same order 
have local truncation errors proportional to the same power of h, the Adams-Moulton formulas 
of moderate order are in fact considerably more accurate. For example, while the two 
fourth-order formulas (6) and (10) both have local truncation errors proportional to h>, the 


proportionality constant for the Adams-Moulton formula is less than 10 of the proportionality 


constant for the Adams-Bashforth formula. 

Is it better to use the explicit (and faster) Adams-Bashforth formula or the more accurate 
but implicit (and slower) Adams-Moulton formula? The answer depends on whether, by using 
the more accurate formula, you can increase the step size, and thereby reduce the number of 
steps enough to compensate for the additional computations required at each step. 

In fact, numerical analysts have attempted to achieve both simplicity and accuracy by 
combining the two formulas in what is called a predictor-corrector method. Once y,. 5. 
Yn-2» Ya—1, and y, are known, we can compute f,3, fn—2, fr—1, and fa, and then use the 


?Forest Ray Moulton (1872-1952), an American astronomer and administrator of science, was for many years 
professor of astronomy at the University of Chicago. During World War I, he was in charge of the Ballistics Branch 
of the U.S. Army at Aberdeen (MD) Proving Ground. In the course of calculating ballistics trajectories he devised 
substantial improvements in the Adams formula. 


Adams-Bashforth (predictor) formula (6) to obtain a first value for y,,,. Then we compute 
f»41 and use the Adams-Moulton (corrector) formula (10), which is no longer implicit, to 
obtain an improved value of y, ,,. We can, of course, continue to use the corrector formula 
(10) if the change in y,,, is too large. However, if it is necessary to use the corrector formula 
more than once or perhaps twice, it means that the step size h is too large and should be 
reduced. 

In order to use any of the multistep methods, it is necessary first to calculate a few y; by 
some other method. For example, the fourth-order Adams-Moulton method requires values 
for y, and y2, while the fourth-order Adams-Bashforth method also requires a value for y3. 
One way to proceed is to use a one-step method of comparable accuracy to calculate the 
necessary starting values. Thus, for a fourth-order multistep method, we might use the fourth- 
order Runge-Kutta method to calculate the starting values. This is the method used in the next 
example. 

Another approach is to use a low-order method with a very small h to calculate y;, and 
then to increase gradually both the order and the step size until enough starting values have 
been determined. 


EXAMPLE 1 


Consider again the initial value problem 


y -21-t-4y, y(0)=1. (11) 


With a step size of h = 0.1, determine an approximate value of the solution y = $(t) att = 0.4 
using the fourth-order Adams-Bashforth formula, the fourth-order Adams-Moulton formula, and the 
associated fourth-order predictor- corrector method. 


Solution: 


For starting data, we use the values of y1, y2, and y3 found by the Runge-Kutta method. These are 
tabulated in Table 8.3.1. Next, calculating the corresponding values of f(t, y), we obtain 


yo=l, fo =5, 

yı = 1.6089333, fi = 7.3357332, 
yy = 2.5050062, fə = 10.820025, 
ys = 3.8294145, fa = 16.017658. 


Then, from the Adams-Bashforth formula, (6), we find that y4 = 5.7836305. The exact value of the 
solution at t = 0.4, correct through eight digits, is 5.7942260, so the error is —0.0105955. 
The Adams-Moulton formula, (10), leads to the equation 


ya = 4.9251275 + 0.15y,, 


from which it follows that y4 — 5.7942676 with an error of only 0.0000416. 

Finally, using the result from the Adams-Bashforth formula as a predicted value of 9 (0.4), we 
can then use equation (10) as a corrector. Corresponding to the predicted value of y4, we find that 
fa = 23.734522. Hence, from equation (10), the corrected value of y4 is 5.7926721. This result is 
in error by —0.0015539. 

Observe that the Adams-Bashforth method is the simplest and fastest of these methods, since 
it involves only the evaluation of a single explicit formula. It is also the least accurate. Using 
the Adams-Moulton formula as a corrector increases the amount of calculation that is required, 
but the method is still explicit. In this problem the error in the corrected value of y4 is reduced 
by approximately a factor of 7 (14.7%) when compared to the error in the predicted value. The 


1 
Adams-Moulton method alone yields by far the best result, with an error that is about 30 as large 


(2.7%) as the error from the predictor- corrector method. 

Remember, however, that the Adams-Moulton method is implicit, which means that an 
equation must be solved at each step. In the problem considered here this equation is linear, so the 
solution is quickly found, but in nonlinear problems this part of the procedure may be much more 
time-consuming. 
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The Runge-Kutta method with h = 0.1 gives y4 = 5.7927853 with an error of —0.0014407; 
see Table 8.3.1. Thus, for this problem, the Runge-Kutta method is comparable in accuracy to the 
predictor- corrector method. 


Backward Differentiation Formulas. Another type of multistep method uses a polynomial 
P(t) to approximate the solution @(t) of the initial value problem (1) rather than its 
derivative @'(t), as in the Adams methods. We then differentiate P,(t) and set Pj(t,41) 
equal to f(ta+1, Yn+1) to obtain an implicit formula for y,,;. These are called backward 
differentiation formulas. These methods became widely used in the 1970s because of the 
work of C. William Gear!? on so-called stiff differential equations, whose solutions are very 
difficult to approximate by the methods discussed up to now; see Section 8.6. 

The simplest case uses a first-degree polynomial Pj(1) = At + B. The coefficients are 
chosen so that Pi(1,) and P;(t,+1) agree with the computed values of the solution y, and y,4+1, 
respectively: Pi(ta) = y, and Pj(t,4;) = Yn41- Thus A and B must satisfy 


At, + B = Yn 
(12) 
At +B= Yn+1- 
Solving the linear algebraic equations (12) for A and B yields 
n — Jn n+itn — Yatn 
A= fmi » and p clon (13) 


Since PIG) = A, the requirement that Pi(tn41) = f Gia, Yn41) is just A= f( Inl Yn41)- 
Equating this value of A and the value of A given in equation (13) and rearranging terms, we 
obtain the first-order backward differentiation formula 


Yn+1 = Yn + hf (t+, Yn41)- (14) 


Note that equation (14) is just the backward Euler formula that we first saw in Section 8.1. 

By using higher-order polynomials and correspondingly more data points, we can obtain 
backward differentiation formulas of any order. The second-order backward differentiation 
formula is 


1 
Yn+1 = 3 (45 — Yn-1 + 2hf (tuii, Ynt1))s (15) 


and the fourth-order backward differentiation formula is 
1 
Yri = 5s 48s = 36yn-1 F 16yn—2 ni 3Yn—3 F 12Af( Ina]; Js) s (16) 


These formulas have local truncation errors proportional to h? and /?, respectively. 


EXAMPLE 2 


Use the fourth-order backward differentiation formula with h = 0.1 and the data given in Example 


1 to determine an approximate value of the solution y = ¢(t) at t = 0.4 for the initial value 
problem (11). 

Solution: 

Using equation (16) with n = 3, h = 0.1, and with yo, ... , y3 given in Example 1, we obtain the 
equation 


y4 = 4.6837842 + 0.192y,. 


10C, William Gear (1935- ), born in London, England, received his undergraduate education at Cambridge University 
and his doctorate in 1960 from the University of Illinois. He was a member of the faculty at the University of Illinois for 
most of his career and made significant contributions to both computer design and numerical analysis. His influential 
book on numerical methods for differential equations is listed in the References. 


Thus 


ya = 5.7967626. 


Comparing the calculated value with the exact value (0.4) = 5.7942260, we find that the error is 
0.0025366. This is somewhat better than the result using the fourth-order Adams-Bashforth method, 
but it is not as good as the result using the fourth-order predictor—corrector method, and not nearly 
as good as the result using the fourth-order Adams-Moulton method. 


A comparison between one-step and multistep methods must take several factors into 
consideration. The fourth-order Runge-Kutta method requires four evaluations of f at each 
step, while the fourth-order Adams-Bashforth method (once past the starting values) requires 
only one, and the predictor—corrector method only two. Thus, for a given step size h, the latter 
two methods may well be considerably faster than Runge-Kutta. However, if Runge-Kutta is 
more accurate and therefore can use fewer steps, the difference in speed will be reduced and 
perhaps eliminated. 

The Adams-Moulton and backward differentiation formulas also require that the 
difficulty in solving the implicit equation at each step be taken into account. All multistep 
methods have the possible disadvantage that errors in earlier steps can feed back into later 
calculations with unfavorable consequences. On the other hand, the underlying polynomial 
approximations in multistep methods make it easy to approximate the solution at points 
between the mesh points, should this be desirable. Multistep methods have become popular 
largely because it is relatively easy to estimate the error at each step and to adjust the order 
or the step size to control it. For a further discussion of such questions as these, see the books 
listed at the end of this chapter; in particular, Shampine (1994) continues to be an authoritative 
source. 


Problems 
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In each of Problems | through 5, determine an approximate value of @ b. Use the fourth-order Adams-Moulton method with 


the solution at ? = 0.4 and t = 0.5 using the specified method. For h — 0.05. 


starting values, use the values given by the Runge-Kutta method; see (9 c. Use the fourth-order backward differentiation method 


Problems 2 through 6 of Section 8.3. Compare the results of the with h = 0.05. 
various methods with each other and with the actual solution (if 
available). 
Q a. Use the fourth-order predictor—corrector method with 
h = 0.1. Use the corrector formula once at each step. 
(9 b. Use the fourth-order Adams-Moulton method with 
h 20.1. 
Qc. Use the fourth-order backward differentiation method 


y-Xtty y(0) =3 


eo0 2325 


y-22te9, (0) =1 
10. y =(y? 4 2ty)/(3 427), 


y =05-t+2y, y(0)=1 
y =5t-3/y, y(0) =2 


y(0) = 0.5 


with h — 0.1. 11. a. Show that the first-order Adams-Bashforth method is the 


Eul thod. 
Éh y=3+t-y, hc uler metho 


E b. Show that the first-order Adams-Moulton method is the 
2. y «5t-3Jy, y(0) =2 backward Euler method. 
3. y'22tke", y(0)=1 12. Show that the third-order Adams-Bashforth formula is 


4. y 2 (y? -2ty)/(3- 2), y(0) =0.5 
5. y'=(t?—y*)siny, y(0) 2-1 


h 
Yn = Yn + Fz (BSa — 16 ft 55-2). 


In each of Problems 6 through 10, find approximate values of the 13. Show that the third-order Adams-Moulton formula is 


solution of the given initial value problem at t = 0.5, 1.0, 1.5, and 2.0, 


h 
using the specified method. For starting values, use the values given Yn+1 = Yn + 12 (5 fai T8f,— fn) 


by the Runge-Kutta method; see Problems 7 through 11 in Section 8.3. 


Compare the results of the various methods with each other and with 14. Derive the second-order backward differentiation formula given 


the actual solution (if available). by equation (15) in this section. 
Q a. Use the fourth-order predictor—corrector method with 
h = 0.05. Use the corrector formula once at each step. 
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85 Systems of First-Order Equations 


So far in this chapter we have discussed only numerical methods for approximating the 
solution of an initial value problem associated with a single first-order differential equation. 
These methods can also be applied to a system of first-order differential equations. Since a 
higher-order differential equation can always be reduced to a system of first-order differential 
equations, it is sufficient to deal with systems of first-order differential equations alone. For 
simplicity, we consider a system of two first-order differential equations 


X -—9JUx y Hee), (1) 
with the initial conditions 


X(to) — Xo,  y(to) = yo- Q) 
The functions f and g are assumed to satisfy the conditions of Theorem 7.1.1 so that the initial 
value problem (1) has a unique solution in some interval of the t-axis containing the point tọ. 
We wish to determine approximate values x1, X2, ... , Xn, ... and yj, yo, ..., Yn, ..- ofthe 
solution x = $(t), y = v (t) at the points t, = to + nh with n = 1,2, .... 
In vector notation, the initial value problem (1) can be written as 
x =f(t,x), x(t) = Xo, (3) 


where x is the vector with components x and y, f is the vector function with components f 
and g, and Xo is the vector with components xo and yo. One advantage of the vector notation 
is that, regardless of the number of equations in the system, the initial value problem always 
has the same form. All that changes is the number of components in the vectors x, f, and Xp. 

The methods of the previous sections can be readily generalized to handle systems of two 
(or more) equations. All that is needed (formally) is to replace the scalar variable y by the 
vector x and the scalar function f by the vector function f in the appropriate equations. For 
example, the Euler formula becomes 


Xnt+1 = Xn + hf,, (4) 


where f, = f(1,, Xn), or, in component form, 


ea = (3) i h e. ths Xn, 2 . (5) 
Yn41 Yn 8(tns Xn» Yn) 

The initial conditions are used to determine fọ = f( to, xy). The observation that f; = (15) 
means that fọ is the vector tangent to the graph of the solution x = (tf) at the initial point xo 
in the xy-plane. The approximate solution moves in the direction of this tangent vector for a 
time step h in order to find the next point x,. Then we calculate a new tangent vector f, at xj, 
move along it for a time step A to find x», and so forth. 


In a similar way, the Runge-Kutta method can be extended to a system. For the step from 
tn to t,4, we have 


h 
Xn+1 = Xn + 6 (ki + 2k, + 2kn3 + ka). (6) 
where 
k;i = f( ths Xn), 
h h 
k,2 =f tn + 2 X, + zén , 


(7) 


k, fit "m. dk 
n3 — n zo Xn An]: 
3 2 2 2 


ka = fru +h, X, + hk,,3) | 


The formulas for the fourth-order Adams-Moulton predictor-corrector method as it 
applies to the initial value problem (1) is given in Problem 8. 

Building on the previous observation that every system of first-order differential equations 
can be written in the form given in (3), equation (4) is the Euler formula and equations (6) 
and (7) are the fourth-order Runge-Kutta formulas for any system of first-order differential 
equations. All that changes is the number of components in the vectors. 
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EXAMPLE 1 


Determine approximate values of the solution x = @(t), y = w(t) of the initial value problem 
x =x— 4y, x(0-21 y=-x+y, y(0)=0 


at the point f = 0.2. Use the Euler method with A = 0.1 and the Runge-Kutta method with h = 0.2. 
Compare the results with the values of the exact solution: 


e 3t =t e 


+e B = 
(t) = E oe and w(t) = EN (8) 


Solution: 


Let us first use the Euler method. For this problem, f, = x, — 4y, and gy = —x, + yn; hence 
fo=1—(4)(0) —1 and go ——-140-—-1. 
Then, from the Euler formula (5), we obtain 
xı —1-(0.0(1) 21.1 and y; =0+4+(0.1)(—1) = —0.1. 
At the next step, 


fi-214-(4(-0.1 =1.5 and g;—-—1.14-(—0.1) = —12. 


Consequently, 

xı =1.14+(0.1)(1.5) 21.25 and yy = —0.1 + (0.1)( 1.2) = —0.22. 
The values of the exact solution, correct to eight digits, are 6(0.2) = 1.3204248 and (0.2) = 
—0.25084701. Thus the values calculated from the Euler method are in error by about 0.0704 and 
0.0308, respectively, corresponding to percentage errors of about 5.3% and 12.3%. 


Now let us use the Runge-Kutta method to approximate $ (0.2) and ~(0.2). To make use of 
the vector-valued equations (7), define 


_(* (fe Y _ (x-4y fi 
s- (Jao (853) -CEES nn Q) 


With h = 0.2 we obtain the following values from equations (7): 


«-()-) 
rn = e(o. (D) (D) (o C0) =(_ 
- (a5) 


( 2342 
~ (—-1.580/^ 


or 


1 
1 
1 1.5 1.15 
wo = e(o. (2) 401(_18)) eis. (138) 
1 1.63 1.326 
wa =t(02,($) +02(_48)) -efo ( 135 


Then, substituting these values in equation (6), we obtain 


1 PRU 1 42 1:5 42 1.63 P 
0 6 \\-1 -12 -1.27 
_ 1.3200667 
~ X —0.25066667 J ` 
These values of x, and y; are in error by about 0.000358 and 0.000180, respectively, with percentage 
errors much less than one-tenth of 1%. 
This example again illustrates the great gains in accuracy that are obtainable by using a more 
accurate approximation method, such as the Runge-Kutta method. In the calculations we have just 
outlined, the Runge-Kutta method requires only twice as many function evaluations as the Euler 


method, but the error in the Euler method is about 200 times greater than that in the Runge-Kutta 
method. 


X] 
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Problems 


In each of Problems | through 5, determine approximate values of 
the solution x = (ft), y = w(t) of the given initial value problem 
att = 0.2, 0.4, 0.6, 0.8, and 1.0. Compare the results obtained by 
different methods and different step sizes. 

@ a. Use the Euler method with h = 0.1. 

Q b. Use the Runge-Kutta method with h = 0.2. 

Q c. Use the Runge-Kutta method with h = 0.1. 


1. x’ =x+yt+t, y =4x—2y; x(0)=1, y(0 =0 

2. x =-tx-—y-—1, y =x; x(0)=1, y(0)=1 

3. x’ =x-—ytxy, y'23x-2y-xy; x(0)=0, y(0)=1 
4. x'—x(1—0.5x — 0.5y), y'= y(—0.25 + 0.5x); 


KOSA y(0) = 1 


5. x’ =exp(—x + y) —cosx, y'= sin(x — 3y); 

x(0) =1, y(0 =2 

@ 6. Consider the example problem x’ = x — 4y, y 2 —x + y 
with the initial conditions x(0) = 1 and y(0) = 0. Use the 
Runge-Kutta method to find approximate values of the solution of this 
problem on the interval 0 < ¢ < 1. Start with h = 0.2, and then 
repeat the calculation with step sizes h = 0.1, 0.05, ... , each half as 
long as in the preceding case. Continue the process until the first five 
digits of the solution at t = 1 are unchanged for successive step sizes. 
Determine whether these digits are accurate by comparing them with 
the exact solution given in equations (8) in the text. 


@ 7. Consider the initial value problem 


x" +x’ 4+3x=t, x(0 =1, x'(0) =2. 


Convert this problem to a system of two first-order equations, and 
determine approximate values of the solution at t = 0.5 and t = 1.0 
using the Runge-Kutta method with h = 0.1. 


@ 8. Consider the general initial value problem x’ = f(t, x, y) 
and y’ = g(t, x, y) with x(t9) = xo and y(19) = yo. The Adams- 
Moulton predictor-corrector method of Section 8.4 generalizes to 


1 
Xp] = Xn + 24! 09 fn = 39 fri F 37 fn-2 ES 9 fn—3)> 


] 
Yat = Yn + 24" 558 = 59g, ik 37g, 2-— 9g, 3) 
and 


1 
= Xn + 24" 0 fai + 19 fn E Sfn—1 + fea) 


Xn+1 


1 
Yn+1 = Yn + zg 8 + 19g; 2 584-1 + En-2). 


Determine an approximate value of the solution at t = 0.4 for the 
example initial value problem x’ = x — 4y, y' = —x + y with 
x(0) = 1, y(0) = 0. Take A = 0.1. Correct the predicted value 
once. For the values of x;, ... , y3 use the values of the exact solution 
rounded to six digits: x, = 1.12735, x = 1.32042, x3 = 1.60021, 
yı = —0.111255, y, = —0.250847, and y3 = —0.429696. 
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In Section 8.1 we discussed some ideas related to the errors that can occur in a numerical 
approximation of the solution of the initial value problem 


y = f(t, y), y(to) = yo. (1) 


In this section we continue that discussion and also point out some other difficulties that can 
arise. Some of the points that we wish to make are fairly difficult to treat in detail, so we will 
illustrate them by means of examples. 


Truncation and Round-Off Errors. 


Recall that, for the Euler method with equal time steps 


of size h, we showed the local truncation error is proportional to h? and, for a finite interval, 
the global truncation error is at most a constant times A. In general, for a method of order p, 
the local truncation error is proportional to h?*! and the global truncation error on a finite 
interval is bounded by a constant times A". For example, Euler's method is an order 1 method. 

To achieve high accuracy, we normally use a numerical procedure for which p is fairly 
large, perhaps 4 or higher. As p increases, the formula used in computing y,,; normally 
becomes more complicated, and hence more calculations are required at each step. However, 
this is usually not a serious problem unless f(t, y) is very complicated or the calculation must 


be repeated very many times. 


If the step size h is decreased, the global truncation error is decreased by the same factor 
raised to the power p. However, as we mentioned in Section 8.1, if h is very small, a great 
many steps will be required to cover a fixed interval, and the global round-off error may be 
larger than the global truncation error. The situation is shown schematically in Figure 8.6.1. We 
assume that the round-off error R, is proportional to the number of computations performed 


Error A 
|E, | + IR] 


|R, | 


hog h 
AUAA The dependence of truncation 


error | E, | (black), round-off error |R, | (blue), and 
total error | E,| + |R,| (red) on the step size A. 


and therefore is inversely proportional to the step size h. On the other hand, the truncation 
error E, is proportional to a positive power of h. From equation (17) of Section 8.1, we know 
that the total error is bounded by |F,,| + | S; |; hence we wish to choose h so as to minimize 
this quantity. The optimum value of / occurs when the rate of increase of the truncation error 
(as h increases) is balanced by the rate of decrease of the round-off error, as indicated in 
Figure 8.6.1. 


EXAMPLE 1 


Consider the example problem 


y =1—t+4y, y0)=1. (2) 


Using the Euler method with various step sizes, calculate approximate values for the solution 
o(t) att = 0.5 and t = 1. Try to determine the optimum step size. 


Solution: 


Table 8.6.1 shows the results of applying Euler's method for nine different values of h. The results 
were obtained using software configured to use only four significant digits. This was done on purpose 
to have round-off errors become significant for larger values of h than if more significant digits are 
used in floating-point operations. The first two columns are the step size h and the number of steps N 
required to traverse the interval 0  / < 1. Then yy» and yy are approximations to $ (0.5) = 8.712 
and 9 (1) = 64.90, respectively. These quantities appear in the third and fifth columns. The fourth 
and sixth columns display the differences between the calculated values and the actual value of the 
solution. 


Approximations to the Solution of the Initial Value Problem 
TABLE 8.6.1 y’ =1-—t-+4y,y(0) = 1 Using the Euler Method with 
Different Step Sizes 


0.01 100 8.390 —0.322 60.12 —4.78 
0.005 200 8.551 —0.161 62.51 = 2.39 
0.002 500 8.633 —0.079 63.75 Seals 
0.001 1000 8.656 —0.056 63.94 —0.96 
0.0008 1250 8.636 —0.076 63.78 ITO 
0.000625 1600 8.616 —0.096 64.35 —0.55 
0.0005 2000 8.772 0.060 64.00 —0.90 
0.0004 2500 8.507 0.205 63.40 —1.50 


0.00025 4000 8.231 0.481 56.77 =, Ils} 
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For relatively large step sizes, the round-off error is much less than the global truncation error. 
Consequently, the total error is approximately the same as the global truncation error, which for the 
Euler method is bounded by a constant times h. Thus, as the step size is reduced, the error is reduced 
proportionally. The first three lines in Table 8.6.1 show this type of behavior. For h = 0.001 the 
error has been further reduced, but much less than proportionally; this indicates that round-off error 
is becoming important. As h is reduced still more, the total error begins to fluctuate, and further 
improvements in accuracy become problematic. For values of h less than 0.0005, the total error is 
clearly increasing, which indicates that round-off error is now the dominant part of the error. 

These results can also be expressed in terms of the number of steps N. For N less than about 
1000, accuracy is improved by taking more steps, while for N greater than about 2000, using more 
steps has an adverse effect. Thus for this problem it is best to use an N somewhere between 1000 
and 2000. For the calculations shown in Table 8.6.1, the best result at t = 0.5 occurs for N = 1000, 
while at t — 1.0 the best result is for N — 1600. 


You should be careful not to read too much into the results shown in Example 1. The 
optimum ranges for h and N depend on the differential equation, the numerical method that 
is used, and the number of digits that are retained in floating-point calculations. Nevertheless, 
it is generally true that if too many steps are required in a calculation, then eventually round- 
off error is likely to accumulate to the point where it seriously degrades the accuracy of the 
procedure. For many problems this is not a concern: for them, any of the fourth-order methods 
we have discussed in Sections 8.3 and 8.4 will produce good results with a number of steps far 
less than the level at which round-off error becomes important. For some problems, however, 
round-off error does become vitally important. For such problems, the choice of method 
may be crucial. This is also one reason why modern codes provide a means of adjusting the 
step size as they go along, using a larger step size wherever possible and a very small step size 
only where necessary. 


Vertical Asymptotes. As a second example, consider the problem of approximating the 
solution y = ġ (t) of 
y=tty, y0)-1. 6) 

Since the differential equation is nonlinear, the existence and uniqueness theorem 
(Theorem 2.4.2) guarantees only that there is a solution in some interval about t = 0. Suppose 
that we try to compute an approximation to the solution of the initial value problem on the 
interval 0 <r < 1 using different numerical procedures. 

If we use the Euler method with h = 0.1, 0.05, and 0.01, we find the following 
approximate values at t — 1: 7.189548, 12.32093, and 90.75551, respectively. The large 
differences among the computed values are convincing evidence that we should use a 
more accurate numerical procedure—the Runge-Kutta method, for example. Using the 
Runge-Kutta method with h = 0.1, we find the approximate value 735.0991 at t = 1, which is 
quite different from those obtained using the Euler method. Repeating the calculations using 
step sizes of h = 0.05 and h = 0.01, we obtain the information shown in Table 8.6.2. 


Approximations to the Solution of the 
TABLE 8.6.2 Initial Value Problem y’ = t? + y?, y(0) = 1 
Using the Runge-Kutta Method 


h t — 0.90 t=1.0 
0.1 14.02182 735.0991 
0.05 14.27117 1.75863 x 10 
0.01 14.30478 2100130102527 
0.001 14.30486 


The values at t = 0.9 are reasonable, and we might well believe that the solution has a 
value of about 14.305 at t = 0.9. However, it is not clear what is happening between t = 0.9 
and t = 1.0. To help clarify this, let us turn to some analytical approximations to the solution 
of the initial value problem (3). Note that on O < t < 1, 


y<rPt+y<1t+y’. (4) 


This suggests that the solution y = $1(1) of 
y-1«y, y0-1 (5) 
and the solution y = @2(t) of 


y=y, y(0)=1 (6) 
are upper and lower bounds, respectively, for the solution y = ¢ (t) of the original problem, 
since all these solutions pass through the same initial point. Indeed, it can be shown (for 
example, by the iteration method of Section 2.8) that @2(t) < 9 (t) < ġı(t) as long as these 
functions exist. The important thing to note is that we can solve initial value problems (5) and 
(6) for ó, and $» by separation of variables. We find that 


T 1 
(t) = un; + i) (t) = m (7) 


Thus jo(f) — coast — l, and $j(f) > wast > Z = 0.785. These 


calculations show that the solution of the original initial value problem exists at least for 
T 
O<t< 4 and at most for 0 < t < 1. The solution of the problem (3) has a vertical 


io U . SLM 
asymptote for some ¢ in T <t < 1 and thus does not exist on the entire interval 0 < t < 1. 


Our numerical calculations, however, suggest that we can go beyond t = 2 and probably 


beyond ¢ = 0.9. Assuming that the solution of the initial value problem exists at t = 0.9 and 
has the value 14.305, we can obtain a more accurate estimate of what happens for larger t by 
considering the initial value problems (5) and (6) with y(0) — 1 replaced by y(0.9) — 14.305. 
Then we obtain 


o\(t) = tan(t + 0.60100), (t) = (8) 


0.96991 — t' 
where five decimal places have been kept in the calculation. Therefore, $1(f) — oo as 
t — ; — 0.60100 = 0.96980 and ġ2(t) — oo as t — 0.96991. We conclude that the 


asymptote of the solution of the initial value problem (3) lies between these two values. This 
example illustrates the sort of information that can be obtained by a judicious combination of 
analytical and numerical work. 


Stability. The concept of stability is associated with the possibility that small errors that are 
introduced in the course of a mathematical procedure may die out as the procedure continues. 
Conversely, instability occurs if small errors tend to increase, perhaps without bound. For 
example, in Section 2.5 we identified equilibrium solutions of a differential equation as 
(asymptotically) stable or unstable, depending on whether solutions that were initially near 
the equilibrium solution tended to approach it or to depart from it as t increased. Somewhat 
more generally, the solution of an initial value problem is asymptotically stable if initially 
nearby solutions tend to approach the given solution, and unstable if they tend to depart from 
it. Visually, in an asymptotically stable problem the graphs of solutions will come together, 
while in an unstable problem they will separate. 

If we are investigating an initial value problem numerically, the best that we can hope for 
is that the numerical approximation will mimic the behavior of the actual solution. We cannot 
make an unstable problem into a stable one merely by approximating its solution numerically. 
However, it may well happen that a numerical procedure will introduce instabilities that were 
not part of the original problem, and this can cause trouble in approximating the solution. 
Avoidance of such instabilities may require us to place restrictions on the step size h. 

To illustrate what can happen in the simplest possible context, consider the differential 
equation 

dy 

dto ry, (9) 
where r is a constant. Suppose that in approximating the solution of this equation, we have 
reached the point (tn, Yn). Let us compare the exact solution of equation (9) that passes through 
this point, namely, 


$t) = y, exp(r(t — ta)), (10) 
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with numerical approximations obtained from the Euler formula 


Yntt = Yn +hf (tn, Yn) (11) 
and from the backward Euler formula 
Yn = Yn + Af (tnsi, Yn). (12) 
From the Euler formula (11) we obtain 
Yn4A = Yn + hry, = Ya( 1 + rh). (13) 


Similarly, from the backward Euler formula (12) we have 


Yn+1 = Yn + hr Yn41, 


or 
Yn 2 
Ymi = LA = (1 +rh+ (rh) +) (14) 
]1—rh 
Finally, evaluating the solution (10) at t, +h, we find that 
(rh)? 
Q (141) = Yn exp(rh) = yn| 1 rh + 2 Spi ls (15) 


Comparing equations (13), (14), and (15), we see that the errors y,1; — (1,41) in both the 
Euler formula and the backward Euler formula are of order h?, as the theory predicts. 

Now suppose that we change the value y, to y, + ô. Think, if you wish, of 6 as the error 
that has accumulated by the time we reach t = f,. The question is whether this error increases 
or decreases in going one more step to f, ,. 

For the exact solution (15), the change in $(f,,;) due to the change 6 in y, is just 
ó exp(rh). This quantity is less than 6 if exp(rh) < 1, or in other words ifr < 0. This 
confirms our conclusion in Chapter 2 that equation (9) is asymptotically stable if r « 0, and 
is unstable if r > 0. 

For the backward Euler method, the change in y,,.; in equation (14) due to replacing y, 
by y, + 6 isó/(1 — rh). Forr < 0 the quantity 1/(1 — rA) is always nonnegative and less 
than 1. Thus, if the differential equation is stable, then so is the backward Euler method for an 
arbitrary step size h. 

On the other hand, for the Euler method, the change in y,,, in equation (13) due to 
replacing y, by y, + 6 is 6(1 + rh). Forr < 0 we can write 1 + rh as 1 — |r|h. Then 
the requirement that |1 + rh| < 1 is equivalent to 


-1« 1-|r|h « 1, or 0« [rh « 2. 


2 
Consequently, h must satisfy h < rl i Thus the Euler method is not stable for this problem 
r 


unless A is sufficiently small. 

The restriction on the step size / in using the Euler method in the preceding example 
is rather mild unless |r| is quite large. Nonetheless, the example illustrates that it may be 
necessary to restrict / in order to achieve stability in the numerical method, even though the 
initial value problem itself is stable for all values of h. Problems for which a much smaller 
step size is needed for stability than for accuracy are called stiff. The backward differentiation 
formulas described in Section 8.4 (of which the backward Euler formula is the lowest order 
example) are the most popular formulas for dealing with stiff problems. The following example 
illustrates the kind of instability that can occur when we try to approximate the solution of a 
stiff problem. 


EXAMPLE2 | AStiff Problem 


Consider the initial value problem 


y'= —100y + 100 +1, y(0) — I. (16) 


Find numerical approximations to the solution for 0 < t < 1 using the Euler, backward Euler, 
and Runge-Kutta methods. Compare the numerical results with the exact solution. 


Solution: 


Since the differential equation is linear, it is easy to solve, and the solution of the initial value problem 
(16) is 


y=o(t) 2e pr, (17) 


A graph of the solution is shown in Figure 8.6.2. There is a thin layer (sometimes called a boundary 
layer) to the right oft = 0 in which the exponential term is significant and the solution varies rapidly. 
Once past this layer, however, o (t) = t and the graph of the solution is essentially a straight line. 
The width of the boundary layer is somewhat arbitrary, but it is certainly small. At £ = 0.1, for 
example, exp( —1007) = 0.000045. 


The solution of the initial value problem y’ = —100y + 1007 + 1, y(0) — I. 


Some values of the solution $ (1), correct to six significant digits, are given in the second column 
of Table 8.6.3. 

If we plan to approximate the solution (17) numerically, we might intuitively expect that a small 
step size will be needed only in the boundary layer. To make this expectation a bit more precise, 
recall from Section 8.1 that the local truncation errors for the Euler and backward Euler methods 
are proportional to ó"(1). For this problem $"(1) = 10*e7!99'. which varies from a value of 10^ at 
t = 0 to nearly zero for t > 0.2. Thus a very small step size is needed for accuracy near t = 0, but 
a much larger step size is adequate once f is a little larger. 

On the other hand, the stability analysis in equations (9) through (15) also applies to this 
problem. Since r = —100 for equation (16), it follows that for stability we need h < 2/|r| = 0.02 
for the Euler method, but there is no corresponding restriction for the backward Euler method. 

Some results obtained from the Euler method are shown in columns three (h = 0.025) and four 
(h = 0.01666 ...) of Table 8.6.3. The values for h = 0.025 are worthless because of instability, 
while those for h = 0.01666 ... are reasonably accurate fort > 0.2. However, comparable accuracy 
for this range of t is obtained for h = 0.1 by using the backward Euler method, as shown by the 
results in column seven of the table. 


Numerical Approximations to the Solution of the 
Initial Value Problem y’ = —100y + 1007 + 1, y(0) = 1 


TABLE 8.6.3 


Backward 

Euler Euler Runge-Kutta Runge-Kutta Euler 

Exact h = 0.025 h=0.0166... h=0.0333... h=0.025 h=0.1 
0.0 1.000000 1.000000 1.000000 1.000000 1.000000 1.000000 

0.05 0.056738 2.300000 | —0.246296 0.470471 

0.1 0.100045 5.162500 0.187792 10.6527 0.276796 0.190909 
0.2 0.200000 25.8289 0.207707 111.559 0.231257 0.208264 
0.4 0.400000 657.241 0.400059 1.24 x 104 0.400977 0.400068 
0.6 0.600000 1.68 x 104 0.600000 1.38 x 10° — 0.600031 0.600001 
0.8 | 0.800000 4.31 x 10? 0.800000 1.54 x 108 0.800001 0.800000 


1.0 1.000000 1.11 x 107 1.000000 1.71 x 10!° 1.000000 1.000000 
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The situation is not improved by using, instead of the Euler method, a more accurate one, such 
as Runge-Kutta. For this problem the Runge-Kutta method is unstable for h = 0.033 ... but stable 
for h = 0.025, as shown by the results in columns five and six of Table 8.6.3. 

The results given in the table for £ = 0.05 and for £ = 0.1 show that in the boundary layer, 
a smaller step size is needed to obtain an accurate approximation. You are invited to explore this 
matter further in Problem 3. 


The following example illustrates some other difficulties that may be encountered in 
dealing with numerical approximations for unstable differential equations. 


EXAMPLE 3 


Consider the task of finding numerical approximations to two linearly independent solutions of the 
second-order linear differential equation 


y'—10x?y =0 (18) 


for t > 0. Note any difficulties that may arise. 


Solution: 


Since we contemplate a numerical approach to this problem, we first convert equation (18) into a 
system of two first-order differential equations so that we can use the methods of Section 8.5. Thus 
we let x, = y and x; = y’. In this way we obtain the system 


xi =x 3-100 x, 


: 0 1 as 
x= X. 
10r? 0 


The eigenvalues and eigenvectors of the coefficient matrix in equation (19) are 


1 1 
= (D : - YY 
r =y 10r, ESV = (o r2 = —4/ 10r, E = ( a (20) 


so two linearly independent solutions of the system (19) are 


1 1 
(1) = NATOS (2) = =y 0n. 21 
db (a) B (yo) P 


The corresponding solutions of the second-order differential equation (18) are the first components 
of x P (1) and x (t): (1) = ev OTt and y(t) = eV 0r, respectively. 
We also wish to consider another pair of linearly independent solutions obtained by forming 
linear combinations of x“) (t) and x(t): 
cosh (^ / irr) 


or, if x = (x1, x2)", 


ny Tau a WG = (22) 
2 2 /107 sinh (/1071) 
and 
sinh | \/107¢t 
xP) = x = OW - | Q3) 


V/10n cosh ( V/10n ) 


Although the expressions for x? (1) and x(t) are quite different, recall that for large t we have 
cosh (^ / irr) = Lev rt and sinh (4 / Ion) = Lev Ont, Thus if ¢ is large enough and only a 


fixed number of digits are retained, then numerically the two vector functions x(t) and x (t) 
look exactly the same. For instance, correct to eight digits, we have, for t = 1, 


sinh ( (107) = cosh( y 107) = 10,315.894. 


If we retain only eight digits, then the two solutions x(t) and x(? (1) are identical at t = 1 and 
indeed for all £ > 1. Even if we retain more digits, eventually the two solutions will appear 
(numerically) to be identical. This phenomenon is called numerical dependence. 


Which solution a numerical method produces depends on the initial condition. In particular, x“ is 
T T 
obtained with the initial condition x(0) = (1 547 107) . x? from x(0) = (1 ,-vV/ 107) , xO 


from x(0) = (1, 0)”, and x from x(0) = (0, 1)". 
For the system (19) we can avoid the issue of numerical dependence by calculating instead the 


solutions x‘ (r) and x (t). From equation (21) we know that x‘"(t) is proportional to eV Onr, 


while x‘?)(r) is proportional to e~V!"', so they behave very differently as t increases. Even so, 
we encounter difficulty in calculating x ? (1) correctly on a large interval. Recall that x? (1) is the 


solution of equation (19) subject to the initial condition 


1 
x(0) = ( m.) Q4) 


If we attempt to approximate the solution of the initial value problem (19), (24) numerically, then 
at each step of the calculation we introduce truncation and round-off errors. Thus, at any point tn, 
the data to be used in advancing to the next point are not precisely the values of the components of 
x) (1,). The solution of the initial value problem with these data at ¢, involves not only e- Tort 
but also ev rt . Because the error in the data at t, is small, the latter function appears with a very 
small coefficient. Nevertheless, since eri tends to zero and ev ort grows very rapidly, the latter 
eventually dominates, and ultimately the calculated solution is very far from x? (1). 

To be specific, suppose that we try to approximate the solution of the initial value problem (19), 
(24), the first component of which is the solution y2(t) = eV Ort of the second-order initial value 
problem 


y’—10r?2y=0, y(0)=1, y(0)—-4/10m. (25) 


Using the Runge-Kutta method with a step size h = 0.01 and keeping eight digits in the calculations, 
we obtain the results in Table 8.6.4. It is clear from these results that the numerical approximation 
begins to deviate significantly from the exact solution for tf > 0.5, and soon differs from it by 
many orders of magnitude. The reason is the presence, in the numerical approximation, of a small 
component of the exponentially growing quantity ev Ort, With eight-digit arithmetic we can expect 
a round-off error of the order of 107? at each step. Since ev l0nt grows by a factor of 3.7 x 10?! from 
t = 0 to t = 5, an error of order 107? near t = 0 can produce an error of order 10! at £ = 5 even 
if no further errors are introduced in the intervening calculations. The results given in Table 8.6.4 
demonstrate that this is exactly what happens. 


Exact Solution of y" — 107?y = 0, y(0) = 1, y'(0) = —4/10z 
TABLE 8.6.4 and Numerical Approximation Using the Runge-Kutta Method 
with h = 0.01 


Runge-Kutta (h = 0.001) 


0.0 1.0000 1.0000 

0.25 8.3439 x 10? 8.3438 x 10? 
0.5 6.9623 x 1073 6.9620 x 10? 
0.75 5.8409 x 1074 5.8089 x 1074 
1.0 8.6688 x 105 4.8469 x 1075 
1.5 5.4900 x 10? 3.3744 x 1077 
20) 1E8852 Ome 2.3492 x 10? 
2.5 1.1326 x 10? 1.6355 x 10-!! 
3.0 1.6268 x 10* 1.1386 x 10-? 
3.5 2.3368 x 106 7.9272 x 10-16 
4.0 3.3565 x 108 5.5189 x 10-15 
4.5 4.8211 x 10!9 3.8422 x 10-?9 
5.0 6.9249 x 1022 2.6749 x 10-2? 
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Problems 


You should bear in mind that the numerical values of the entries in the second column 
of Table 8.6.4 are extremely sensitive to slight variations in how the calculations are executed. 
Regardless of such details, however, the exponential growth of the approximation will be clearly 
evident. 

Equation (18) is highly unstable, and the behavior shown in this example is typical of unstable 
problems. We can track a solution accurately for a while, and the interval can be extended by using 
smaller step sizes or more accurate methods, but eventually the instability in the problem itself takes 
over and leads to large errors. 


Some Comments on the Selection of a Numerical Method. In this chapter we 
have introduced several numerical methods for approximating the solution of an initial 
value problem. We have tried to emphasize some important ideas while limiting the level 
of complexity. For one thing, except for the comments at the end of Section 8.2, we 
have always used a uniform step size, whereas production codes that are currently in use 
provide for varying the step size as the calculation proceeds. 

There are several considerations that must be taken into account in choosing step sizes. 
Of course, one is accuracy; too large a step size leads to an inaccurate result. Normally, an 
error tolerance is prescribed in advance, and the step size at each step must be consistent with 
this requirement. As we have seen, the step size must also be chosen so that the method is 
stable. Otherwise, small errors could grow and render the subsequent computations worthless. 
Finally, for implicit methods an equation must be solved at each step, and the method used to 
solve the equation may impose additional restrictions on the step size. 

In choosing a method, one must also balance the considerations of accuracy and stability 
against the amount of time required to execute each step. An implicit method, such as the 
Adams-Moulton method, requires more calculations for each step, but if its accuracy and 
stability permit a larger step size (and consequently fewer steps), then this may more than 
compensate for the additional calculations. The backward differentiation formulas of moderate 
order (say, four) are highly stable and are therefore indicated for stiff problems, for which 
stability is the controlling factor. 

Some current production codes also permit the order of the method to be varied, as well as 
the step size, as the calculation proceeds. The error is estimated at each step, and the order and 
step size are chosen to satisfy the prescribed error tolerance. In practice, Adams methods up to 
order twelve and backward differentiation formulas up to order five are in use. Higher-order 
backward differentiation formulas are unsuitable because of a lack of stability. 

Finally, we note that the smoothness of the function f — that is, the number of continuous 
derivatives that it possesses— is a factor in choosing the order of the method to be used. High- 
order methods lose some of their accuracy if f is not smooth to a corresponding order. 

A numerical analysis course is likely to provide a more in-depth investigation of errors, 
stability, and efficiency. Similar information can be found in the References at the end of this 
chapter. 


1. To obtain some idea of the possible dangers of small errors in the Using the Runge-Kutta method with step size h, we obtain the results 
initial conditions, such as those due to round-off, consider the initial ^ in Table 8.6.5. These results suggest that the solution has a vertical 


value problem 


asymptote between t = 0.9 and t = 1.0. 


y'=t+y—3, y(0) =2. Approximations to the Solution of the 
uu TABLE 8.6.5 Initial Value Problem y’ = t? + e", 
a. Show that the solution is y = @,(t) —2— t. y(0) = 0 Using the Runge-Kutta Method 


b. Suppose that in the initial condition a mistake is made, and 
2.001 is used instead of 2. Determine the solution y = ¢2(f) in 
this case, and compare the difference ¢2(t) — ġı(t) att = 1 and 


ast > oo. 


2. Consider the initial value problem 


y=rte’, 


0.02 3.42985 > 10 


0.01 3.42982 > 1038 
y(0) = 0. (26) 


a. Let y = ó(t) be the solution of initial value problem (27). 
Further, let y = $,(f) be the solution of 


y =1+e, y(0) =0, (27) 


and let y = ġ2(t) be the solution of 


y(0) =0. (28) 


| 
y SE, 


Show that 


Qt) € o(t) € t) (29) 


on some interval, contained in O < ż < 1, where all three 
solutions exist. 

b. Determine $ (1) and $5(1). Then show that 6 (f) — oo for 
some f£ between t = In 2 = 0.69315 and t = 1. 

C. Solve the differential equations y’ = e” and y' = 1 + e”, 
respectively, with the initial condition y(0.9) — 3.4298. Use the 
results to show that @(t) — oo when t = 0.932. 


3. Consider again the initial value problem (16) from Example 2. 
Investigate how small a step size h must be chosen to ensure that the 
error at t = 0.05 and at t = 0.1 is less than 0.0005. 

o a. Use the Euler method. 
o b. Use the backward Euler method. 
Q c. Use the Runge-Kutta method. 
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4. Consider the initial value problem 


y = —10y -2.5:? -0.5t, (0) = 4. 


a. Find the solution y = ¢ (t) and draw its graph forO < t < 5. 
(9 b. The stability analysis in the text suggests that for this 
problem, the Euler method is stable only for h < 0.2. Confirm 
that this is true by applying the Euler method to this problem for 
0 x t <5 with step sizes near 0.2. 

@ c. Apply the Runge-Kutta method with various step sizes to 
this problem for 0 < t < 5. What can you conclude about the 
stability of this method? 

@ d. Apply the backward Euler method with various step sizes 
to this problem for 0 < t < 5. What step size is needed to ensure 
that the error at t = 5 is less than 0.01? 


In each of Problems 5 and 6: 
a. Find a formula for the solution of the initial value problem, 
and note that it is independent of A. 
@ b. Use the Runge-Kutta method with h = 0.01 to compute 
approximate values of the solution for O < t£ < 1 for various 
values of À such as À = 1, 10, 20, and 50. 
c. Explain the differences, if any, between the exact solution 
and the numerical approximations. 


5. y'—dy=1-At, (0) =0 
6. y —Ay 2t - AO, y(0) 20 
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There are many differential equations, especially nonlinear ones, that are not susceptible to 
analytical solution in any reasonably convenient manner. Numerical approximation methods, 
such as those discussed in Chapter 8, provide one means of dealing with these equations. 
Another approach, presented in this chapter, is geometric in character and leads to a qualitative 
understanding of the behavior of solutions rather than to detailed quantitative information. 


9.1 The Phase Plane: Linear Systems 


Since many differential equations cannot be solved conveniently by analytical methods, it 
is important to consider what qualitative! information can be obtained about their solutions 
without actually solving the equations. The questions that we consider in this chapter are 
associated with the idea of stability of a solution, and the methods that we employ are basically 
geometric. Both the concept of stability and the use of geometric analysis were introduced in 
Chapter 1 and used in Section 2.5 for first-order autonomous equations 


— = f(y). (1) 


In this chapter we refine the ideas and extend the discussion to autonomous systems of 
equations. We are particularly interested in nonlinear systems because they typically cannot 
be solved in terms of elementary functions. Further, we consider primarily systems of two 
equations because they lend themselves to geometric analysis in a plane, rather than in a higher- 
dimensional space. 

However, before taking up nonlinear systems, we want to summarize some of the results 
that we obtained in Chapter 7 for two-dimensional systems of first-order linear homogeneous 
equations with constant coefficients. Such a system has the form 

dx 

a5 Ax, (2) 
where A is a2 x 2 constant matrix and x is a2 x 1 vector. In Sections 7.5 through 7.8, we found 
that we could solve such systems by seeking solutions of the form x = e”. By substituting 
for x in equation (2), we find that 


(A — rD£ — 0. (3) 


! The qualitative theory of differential equations was created by Henri Poincaré (1854-1912) in several major papers 
between 1880 and 1886. Poincaré was a professor at the University of Paris and is generally considered the leading 
mathematician of his time. He made fundamental discoveries in several different areas of mathematics, including 
complex function theory, partial differential equations, and celestial mechanics. In a series of papers beginning in 
1894, he initiated the use of modern methods in topology. In differential equations he was a pioneer in the use of 
asymptotic series, one of the most powerful tools of contemporary applied mathematics. Among other things, he used 
asymptotic expansions to obtain solutions about irregular singular points, thereby extending the work of Fuchs and 
Frobenius discussed in Chapter 5. 
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Thus r must be an eigenvalue and € a corresponding eigenvector of the coefficient matrix A. 
The eigenvalues are the roots of the polynomial equation 


det(A — rI) = 0, (4) 


and the eigenvectors are determined from equation (3) up to an arbitrary multiplicative 
constant. Although we presented graphs of solutions of equations of the form (2) in Sections 
7.5, 7.6, and 7.8, our main emphasis there was on finding a convenient expression for the 
general solution. Our goal in this section is to bring together the geometric information for 
linear systems in one place and to develop the ability to sketch trajectories in the phase plane 
by hand. We use this information in the rest of the chapter to introduce comparable qualitative 
methods that can be applied to much more difficult nonlinear systems. 

In Section 2.5 we found that points where the right-hand side of equation (1) is zero are of 
special importance. Such points correspond to constant solutions, or equilibrium solutions, 
of equation (1) and are often called critical points. Similarly, for the system (2), points where 
Ax = 0 correspond to equilibrium (constant) solutions, and again they are called critical 
points. We will assume that A is nonsingular, or that detA + O. It follows that x = 0 is 
the only critical point of the system (2). 

Recall that a solution of equation (2) is a vector function x = x(t) that satisfies the 
differential equation. Such a function can be viewed as a parametric representation for a curve 
in the x,x -plane. It is often useful to regard this curve as the path, or trajectory, traversed 
by a moving particle whose velocity dx/dt is specified by the differential equation. The x,x»- 
plane itself is called the phase plane, and a representative set of trajectories is referred to as a 
phase portrait. 

In analyzing the system (2), we must consider several different cases, depending on the 
nature of the eigenvalues of A. We will characterize the differential equation according to 
the geometric pattern formed by its trajectories in a phase portrait. In each case we discuss the 
behavior of the trajectories in general and illustrate it with an example. With a little practice 
it is easy to visualize or to sketch a qualitatively correct phase portrait once you know the 
eigenvalues and eigenvectors of the system. It is important that you become familiar with 
the types of behavior that the trajectories have for each case, because these are the basic 
ingredients of the qualitative theory of linear and nonlinear differential equations. 


CASE 1 Real, Unequal Eigenvalues of the Same Sign. In this case, the general solution of 
equation (2) is 


x= ci £O et! 4 cog Oen, (5) 


where r; and r, are either both positive or both negative. Suppose first that 
rj < r3 < Oand that the eigenvectors €“) and €) are as shown in Figure 9.1.1. 
It follows from equation (5) that x — 0 as t > co regardless of the values of c; 
and c2; in other words, all solutions approach the critical point at the origin as 
t — oo. 

If the solution starts at an initial point on the line through €“), then c; = 0. 
This solution remains on the line through £(" for all t and approaches the origin as 
t — oo. Similarly, if the initial point is on the line through €™, then the solution 
approaches the origin along that line. 

In the general situation, it is helpful to rewrite equation (5) in the form 


x= e! (eigen 4 c£? ). (6) 


Observe that rı — r2 < 0. Therefore, as long as cp + 0, the term 
ci & P exp((r1 — r2)t) is of negligible magnitude compared to that of c.é for t 
sufficiently large. Thus, as t — oo, the trajectory not only approaches the origin but 
also tends toward the line through £(? . Hence all solutions are tangent to £? at the 
critical point except for those solutions that start exactly on the line through €”. 
Several trajectories are sketched in Figure 9.1.1. This type of critical point is called 
a node or a nodal sink. 
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CASE 2 


Trajectories in the phase 


plane when the origin is a node with 
rj <1ry <0. The solid black and dashed 
black curves show the fundamental 
solutions £ P e"! and £O? e"? , respectively. 


Let us now look backward in time and inquire what happens as t — — oo. Still 
assuming that rj < r5 < 0, we note that if c} + 0, then the dominant term in 
equation (5) as  — —oo is the term involving e^". Thus, except for the trajectories 
lying along the line through £C, for large negative t the trajectories have slopes that 
are very nearly that of the eigenvector £(!. This is also indicated in Figure 9.1.1. 

If rı and r, are both positive, and 0 < rz < rı, then the trajectories have the 
same pattern as in Figure 9.1.1, but the direction of motion is away from, rather than 
toward, the critical point at the origin. In this case x; and x? grow exponentially as 
functions of t. Again, the critical point is called a node or a nodal source. 

Another example of a node occurs in Example 3 of Section 7.5, and its 
trajectories and component plots are shown in Figure 7.5.4. 


Real Eigenvalues of Opposite Sign. The general solution of equation (2) is 
x — c, £U e! 4 cg e, (7) 


where r; > 0 and rz < 0. Suppose that the eigenvectors £(? and € are as shown 
by the solid and dashed black curves in Figure 9.1.2. If the solution starts at an initial 
point on the line through €“”, then it follows that c; = 0. Consequently, the solution 
remains on the line through €“) for all t, and since rı > 0, ||x|| > oo ast > oo. 

If the solution starts at an initial point on the line through €, then it always 
remains on this line and ||x|| — 0 as t — oo because r2 < 0. For solutions starting 
at other initial points the positive exponential is the dominant term in equation (7) 
for large t, so eventually all these solutions approach infinity asymptotic to the 
line determined by the eigenvector £(? corresponding to the positive eigenvalue 
rj. The only solutions that approach the critical point at the origin are those that 
start precisely on the line determined by £O. 

For large negative f, the dominant term in equation (7) is the negative 
exponential, so a typical solution is asymptotic to the line through the eigenvector 
¿© as t — —oo. The exceptions are those solutions that lie exactly on the line 
through the eigenvector £(' ; these solutions approach the origin as t > —oo. The 
phase portrait shown in Figure 9.1.2 is typical of the case in which the eigenvalues 
are real and of opposite sign. The origin is called a saddle point in this case. 

Another example of a saddle point occurs in Example 2 of Section 7.5, and its 
trajectories and component plots are shown in Figure 7.5.2. 
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Trajectories in the phase 
plane when the origin is a saddle point with 
rj > 0,r < 0. The solid black and dashed 
black curves show the fundamental solutions 
EYe"l and £ e"? , respectively. 


CASE 3 Equal Eigenvalues. We now suppose that r; = r2 = r. We consider the case in 
which the eigenvalues are negative; if they are positive, the trajectories are similar 
but the direction of motion is reversed. There are two subcases, depending on 
whether the repeated eigenvalue has two independent eigenvectors or only one. 

(a) Two independent eigenvectors. The general solution of equation (2) is 


x= CE e + EMC", (8) 


where €“!) and £C) are the independent eigenvectors. The ratio x» / x, is independent 
of t but depends on the components of ¿£ and €) and on the arbitrary constants 
c, and c2. Thus every trajectory lies on a straight line through the origin, as shown 
in Figure 9.1.3a and the graphs of the x; and x? components are exponential curves, 
as shown in Figures 9.1.35 and 9.1.3c, respectively. The critical point is called a 
proper node, or sometimes a star point. 

(b) One independent eigenvector. As shown in Section 7.8, the general 
solution of equation (2) in this case is 


x = cj £e" + c( £e"! + ne"), (9) 


where £ is an eigenvector and 7 is a generalized eigenvector associated with the 
repeated eigenvalue, that is, (A — rI) = 0 and (A —rI)n = €. For large t 
the dominant term in equation (9) is c5£te"'. Thus, as £ — co, every trajectory 
approaches the origin tangent to the line through the eigenvector. This is true even 
if c; = 0, for then the solution x = c,£e"' lies on this line. Similarly, for large 
negative ¢ the term c5£te"' is again the dominant one, so as £ — —oo, the slope of 
each trajectory approaches the slope of the eigenvector £. 

The orientation of the trajectories depends on the relative positions of £ and 
7. One possible situation is shown in Figure 9.1.4a. To locate the trajectories, it is 
helpful to write the solution (9) in the form 


x = ((ei£ + en) + e&t)e" = ye", (10) 


where y = (c1Ẹ + co) + c2Ẹt. Observe that the vector y determines the direction 
of x, whereas the scalar quantity e"' affects only the magnitude of x. Also note that 
for fixed values of c, and c3, the expression for y is a vector equation of the straight 
line through the point c,€ + c27 and parallel to €. 

To sketch the trajectory corresponding to a given pair of values of c, and cz, you 
can proceed in the following way. First, draw the line given by (c1 + c27) + c2&t 
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Xi A Xo A 
3 
2 = 
2 
3 t 
3 t 
-1 
xp 
-2L 
(b) (c) 
(a) Trajectories in the phase plane when the origin is a proper node with r; = r2 < 0. (b) and (c) 


show the corresponding component plots for x, vs. t and x» vs. t, respectively. The solid black and dashed black 
curves show the fundamental solutions £ ? e"!' and € e", respectively. The purple curve is for the solution that 
passes through ( —1, 1), orange through ( —1/2, —1), green ( 23/2, 3), red (1, 1/2), brown (1/2, —1/4), and gold 
(0, 2). 


and note the direction of increasing t on this line. Two such lines are shown in 
Figure 9.1.4a, one for c; > 0 and the other for c; < 0. Next, note that the given 
trajectory passes through the point c; -- c51] when t = 0. Further, as t increases, the 
direction of the vector x given by equation (10) follows the direction of increasing t 
on the line, but the magnitude of x rapidly decreases and approaches zero because of 
the decaying exponential factor e"'. Finally, as t decreases toward —oo, the direction 
of x is determined by points on the corresponding part of the line, and the magnitude 
of x approaches infinity. In this way we obtain the black trajectories in Figure 9.1.4a. 
A few other trajectories are sketched as well, to help complete the diagram. 

The other possible situation is shown in Figure 9.1.45, where the relative 
orientation of € and 77 is reversed. This reverses the orientation of the trajectories, 
as can be seen in Figure 9.1.4b. 

If r, = rj > 0, you can sketch the trajectories by following the same 
procedure. In this event the trajectories are traversed in the outward direction, and 
the orientation of the trajectories with respect to that of £ and 77 is also reversed. 

When a double eigenvalue has only a single independent eigenvector, the 
critical point is called an improper or degenerate node. A specific example of 
this case is Example 2 in Section 7.8; the trajectories are shown in Figure 7.8.2. 


9.1 The Phase Plane: Linear Systems 


N 
hs 
Increasing t N t 


(a) (b) 


(a) The phase plane for an improper node with eigenvalues r; = r; < 0 
and one independent eigenvector £. (b) The phase plane for a system with the same 
eigenvalues rj = r; < 0 and eigenvector € but a different generalized eigenvector n. 


CASE 4 Complex Eigenvalues with Nonzero Real Part. Suppose that the eigenvalues are 
A + ip, where A and u are real, A zz 0, and jj > O. In Section 7.6 it was 
shown that it is possible to write the general solution in terms of the eigenvalues 
and eigenvectors. However, we proceed in a different way. 

Let us consider the system 


is s (11) 


xi = Àx + Mx, xX% = —pxy + Ax. (12) 


whose scalar form is 


It is sufficient to examine the system (11) since every 2 x 2 system with eigenvalues 
À + ip can be converted into the form (11) by a linear transformation (an example 
of how this can be done is found in Problem 19). We introduce the polar coordinates 
r, 0 given by 


2 2 x2 
Prax +x, tand=—. 
1 2 X] 


By differentiating these equations, we obtain 
X4X5 — Xx! 

rr! = xix, + x2x5, (sec^8)0! = ERN (13) 

x 
1 
Substituting from equations (12) in the first of equations (13), we find that 
P eR (14) 

and hence 


r=ce™, (15) 


where c is a constant. Similarly, substituting from equations (12) in the second of 
equations (13), and using the fact that sec’) = rx; we have 


0' = —y. (16) 
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(a) 


Trajectories in the phase plane for a linear system with eigenvalues \ + ip. 
(a) A spiral sink, A < 0, and (b) a spiral source, A > 0. 


Hence 
0 = —ut + 06, (17) 


where 09 is the value of 0 when t = 0. 

Equations (15) and (17) are parametric equations in polar coordinates of the 
trajectories of the system (11). Since u > O, it follows from equation (17) that 
0 decreases as t increases, so the direction of motion on a trajectory is clockwise. 
As t — co, we see from equation (15) that r — 0 if À < 0 and r — oo if A > 0. 
Thus the trajectories are spirals, which approach or recede from the origin, 
depending on the sign of A. The case when A < 0 is shown in Figure 9.1.5a. 
Figure 9.1.5b shows the corresponding graph when A > 0. The critical point is called 
a spiral point in these cases. Frequently, the terms spiral sink and spiral source, 
respectively, are used to refer to spiral points whose trajectories approach (A « 0), 
or depart from (A > 0), the critical point. 

More generally, the trajectories are spirals for any system with complex 
eigenvalues A + i u, where A + 0. The spirals are traversed in the inward or outward 
direction, depending on whether A is negative or positive. They may be elongated 
and skewed with respect to the coordinate axes, and the direction of motion may be 
either clockwise or counterclockwise. Further, it is easy to obtain a general idea of 
the orientation of the trajectories directly from the differential equations. Suppose 
that 


dx 


dt E a b C) T 
dy [ s) y a 


dt 


has complex eigenvalues A + ių, and look at the point (0, 1) on the positive 
y-axis. At this point it follows from equations (18) that dx /dt = b and dy /dt = d. 
Depending on the signs of b and d, we can infer the direction of motion and the 
approximate orientation of the trajectories. For instance, if b is positive and d is 
negative, then the trajectories cross the positive y-axis so as to move up and into the 
second quadrant. If A < Oalso, then the trajectories must be inward-pointing spirals 
such as those in Figure 9.1.5(a); when A > 0 the trajectories spiral away from the 
origin as in Figure 9.1.5(b). Another case was given in Example 1 of Section 7.6, 
whose trajectories and component plots are shown in Figure 7.6.2. 
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CASES Pure Imaginary Eigenvalues. In this case A = 0, and the system (11) reduces to 


ges e. ac (19) 


with eigenvalues + i u. Using the same argument as in Case 4, we find that 


r-0, 0'—-u, (20) 
and consequently, 
r=c, 0 =-—ut+ bo, (21) 


where c and 0, are constants. Thus the trajectories are circles, with center at the 
origin, that are traversed clockwise if u > 0 and counterclockwise if uj < 0. A 
complete circuit about the origin is made in a time interval of length 27 / u, so all 
solutions are periodic with period 27 / u. The critical point is called a center. 

In general, when the eigenvalues are pure imaginary, it is possible to show 
(see Problem 16) that the trajectories are ellipses centered at the origin. A typical 
situation is shown in Figure 9.1.6, which also includes some typical graphs of x; 
versus f and x» versus t. See also Example 3 in Section 7.6, especially Figures 7.6.3 
and 7.6.4. 


X2 


(b) (c) 


(a) Trajectories in the phase plane when the linear system has eigenvalues + i p. (b) and (c) 


show the component plots for x, vs. f and x2 vs. t, respectively. The orange curves pass through (1/2, 0), 
red through (1, 0), and the green through (2, 0). 
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Reflecting on these five cases and examining the corresponding figures, we can 
make several observations: 


1. After a long time, each individual trajectory exhibits one of only three types of 
behavior. As t — oo, each trajectory approaches the critical point x = 0, repeatedly 
traverses a closed curve (corresponding to a periodic solution) that surrounds the 
critical point, or becomes unbounded. 

2. Viewed as a whole, the pattern of trajectories in each case is relatively simple. 
To be more specific, through each point (xo, yo) in the phase plane there is only 
one trajectory; thus trajectories do not cross each other. Do not be misled by the 
figures, in which it sometimes appears that many trajectories pass through the critical 
point x — 0. In fact, the only solution passing through the origin is the equilibrium 
solution x — 0. The other solutions that appear to pass through the origin actually 
only approach this point as  — oo ort > —oo. 


3. In each case, the set of all trajectories is such that one of three situations occurs. 


(a) All trajectories approach the critical point x = 0 as t — oo. This is the case if 
the eigenvalues are real and negative or complex with negative real part. The 
origin is either a nodal sink or a spiral sink. 

(b) All trajectories remain bounded but do not approach the origin as t — oo. This 
is the case if the eigenvalues are pure imaginary. The origin is a center. 

(c) Some trajectories, and possibly all trajectories except x = 0, become unbounded 
as t — oo. This is the case if at least one of the eigenvalues is positive or if the 
eigenvalues have positive real part. The origin is a nodal source, a spiral source, 
or a saddle point. 


The situations described in 3(a), (b), and (c) above illustrate the concepts 
of asymptotic stability, stability, and instability, respectively, of the equilibrium 
solution x — 0 of the system (2). The precise definitions of these terms will be 
given in Section 9.2, but their basic meaning should be clear from the geometric 
discussion in this section. The information that we have obtained about the system 
(2) is summarized in Table 9.1.1. Also see Problems 17 and 18. 


Stability Properties of Linear Systems x' — Ax with det(A — rI) — 0 


TABLE 9.1.1 


and det A 4 0 
Eigenvalues Type of Critical Point Stability 

Tj E50 Node Unstable 
ri<r <0 Node Asymptotically stable 
ro «rm Saddle point Unstable 
ry—rj,»0 Proper or improper node Unstable 
VA EU «e U Proper or improper node Asymptotically stable 
r,rj2—A-iu 

A20 Spiral point Unstable 

A«0 Spiral point Asymptotically stable 

À-0 Center Stable 


The analysis in this section applies only to second-order systems x’ = Ax with 
det A z 0 whose solutions can be visualized as curves in the phase plane. A similar, 
though more complicated, analysis can be carried out for ann" order system, with an 
n x n coefficient matrix A, whose solutions are represented geometrically by curves 
in an n-dimensional phase space. The cases that can occur in higher-order systems 
are essentially combinations of those we have seen in two dimensions. For instance, 
in a third-order system with a three-dimensional phase space, one possibility is that 
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solutions in a certain plane may spiral toward the origin, while other solutions may 
tend to infinity along a line transverse to this plane. This will be the case if the 
coefficient matrix has two complex eigenvalues with negative real part and one 
positive real eigenvalue. However, because of their complexity, we will not discuss 


systems of higher than second order. 


Problems 


For each of the systems in Problems 1 through 10: 
a. Find the eigenvalues and eigenvectors. 
b. Classify the critical point (0, 0) as to type, and determine 
whether it is stable, asymptotically stable, or unstable. 
c. Sketch, by hand (without using any graphing device), several 
trajectories in the phase plane. 
@ d. Use an appropriate graphing device to plot accurately 
several trajectories in the phase plane and the corresponding 
graphs of x, versus f and x» versus f. 


& 2 


TE 
x [. z) 
«(i i) 
«(i 3) 
«(i 3) 
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In each of Problems 11 through 13, determine the critical point x = x°, 


and then classify its type and examine its stability by making the 
transformation x = x? + u. 


" (i a)-) 
1 —1 0 
eb ei ai 
n2. e 3) 5) 
0 = 
13. D )* G5) a, B,y,6 > 0 


14. The equation of motion of a spring—mass system with damping 
(see Section 3.7) is 


where m, c, and k are positive. Write this second-order equation as a 
system of two first-order equations for x = u, y = du / dt. Show that 
x =0, y =0 is a critical point, and analyze the nature and stability 
of the critical point as a function of the parameters m, c, and k. Note 
that the same analysis can be applied to the electric circuit equation 


(see Section 3.7) 


po yR” + 5 0 
dt? dt € 
15. Consider the system x' — Ax, and suppose that A has one zero 


eigenvalue. 
a. Show that det A = 0. 
b. Show that x = 0 is a critical point and that, in addition, every 
point on a certain straight line through the origin is also a critical 
point. 
c. Letr; = Oand r; Æ 0, and let ¿£ and €™ be corresponding 
eigenvectors. Show that the trajectories are as indicated in Figure 
9.1.7. 


AAU: IAEA Trajectories for a linear system 


with nonisolated critical points, that is r; = 0 and 
r2 X 0. Every point on the line through £V! is a 
critical point. 


What is the direction of motion on the trajectories? What 
determines the direction of motion along a trajectory? 


16. Inthis problem we indicate how to show that the trajectories are 
ellipses when the eigenvalues are pure imaginary. Consider the system 


à 
x a 
y an 422) NY 
a. Show that the eigenvalues of the coefficient matrix are pure 


imaginary if and only if 


a1 +422 =0, anan — apan > 0. (23) 


b. The trajectories of the system (22) can be found by converting 
equations (22) into the single equation 
dy = dy /dt = 51X + any 
dx — dx/dt ax tay" 


(24) 


Use the first of equations (23) to show that equation (24) is an 
exact differential equation. (See Section 2.6.). 
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C. Solve equation (24) as an exact differential equation and 
show that 


anx? + 2Zayxy — ayy? =k, (25) 


where k is a constant. Use equations (23) to conclude that the 
graph of equation (25) is always an ellipse. Hint: What is the 
discriminant of the quadratic form in equation (25)? 


17. Consider the linear system 


dx d 
ae + ayy, E 
where a1, 412, 421, and az are real-valued constants. Let 
p = 4), 45,4 = 41165; — à1245,, and A = p? — 4q. Observe that 
p and q are the trace and determinant, respectively, of the coefficient 
matrix of the given system. Show that the critical point (0, 0) is a 

a. Nodeifq > O0 and A > 0; 

b. Saddle pointifq < 0; 

C. Spiral point if p # 0 and A < 0; 

d. Center if p = 0 and q > 0. 
Hint: These conclusions can be reached by studying the eigenvalues r, 
and r2. It may also be helpful to establish, and then to use, the relations 
rır =q and rı +r. = p. 


= aX + any, 


18. Continuing Problem 17, show that the critical point (0, 0) is 

a. Asymptotically stable if q > 0 and p < 0; 

b. Stable if g > Oand p = 0; 

C. Unstable ifg < O or p > 0. 
The results of Problems 17 and 18 are summarized visually in 
Figure 9.1.8. 


q 
Asymp. stable, spiral point Unstable, spiral point 
Stable 


Asymp. stable 


Unstable, saddle point 


A=p?-4q>0 | 2X 


LAUETA Stability and classification of critical point (0, 0) 
as a function of p = a, + a»; and q = 411472 — 412421. 


19. In this problem we illustrate how a 2 x 2 system with eigen- 
values À + iji can be transformed into the system (11). 


2 —2.5 
x= x = Ax. (26) 
1.8 —1 


a. Show that the eigenvalues of this system are r; 2 = 0.541.5i. 
b. Show that the eigenvector corresponding to r; is 


wal 5 \_(\,,f ° 
ex)" e 


C. LetP bethe matrix whose columns are the real and imaginary 


parts of £(?. Thus 
P : i (28) 
(wv -3 


Let x — Py and substitute for x in equation (26). Show that 
y = (PAP). Q9) 


d. Find P~! and show that 


B 05 L5 
P'AP = ! (30) 
-L5 0.5 


Thus equation (30) has the form of equation (11). 
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In this section we begin to draw together, and to expand on, the geometric ideas introduced in 
Section 2.5 for certain first-order equations and in Section 9.1 for systems of two first-order 
linear homogeneous equations with constant coefficients. These ideas concern the qualitative 
study of differential equations and the concept of stability, an idea that will be defined precisely 
later in this section. 


Autonomous Systems. 
equations of the form 


We are concerned with systems of two simultaneous differential 


n ora, Seq» (1) 
== x, , = , s 
dt pe di m 


Assume that the functions F and G are continuous and have continuous partial derivatives in 
some domain D of the x y-plane. If (xo, yo) is a point in this domain, then by Theorem 7.1.1 


9.2 Autonomous Systems and Stability 


there exists a unique solution x = x(t), y = y(t) of the system (1) satisfying the initial 
conditions 


X(to) = Xo, y(to) = yo. (2) 


The solution is defined in some time interval / that contains the point tọ. 
Frequently, we will write the initial value problem (1), (2) in the vector form 


= =f), x(t) =x’, (3) 
where x = (x, y)", f(x) = (F(x, y), G(x, y))", andx? = (xo, yo)”. In this case the solution 
is expressed as x = (x(t), y(1))7. As usual, we interpret a solution x = x(t) as a curve traced 
by a moving point in the xy-plane, the phase plane. 

Observe that the functions F and G in equations (1) do not depend on the independent 
variable t, but only on the dependent variables x and y. A system with this property is said to 
be autonomous. The system 


x’ = Ax, (4) 


where A is a constant 2 x 2 matrix, is a simple example of a two-dimensional autonomous 
system. On the other hand, if one or more of the elements of the coefficient matrix A is a 
function of the independent variable t, then the system is nonautonomous. The distinction 
between autonomous and nonautonomous systems is important, because the geometric 
qualitative analysis in Section 9.1 can be effectively extended to two-dimensional autonomous 
systems in general but is not nearly as useful for nonautonomous systems. 

In particular, the autonomous system (1) has an associated direction field that is 
independent of time. Consequently, there is only one trajectory passing through each point 
(Xo, yo) in the phase plane. In other words, all solutions that satisfy an initial condition of 
the form (2) lie on the same trajectory, regardless of the time tọ at which they pass through 
(xo, yo). Thus, just as for the constant coefficient linear system (4), a single phase portrait 
simultaneously displays important qualitative information about all solutions of the system 
(1). We will see this fact confirmed repeatedly in this chapter. 

Autonomous systems occur frequently in applications. Physically, an autonomous system 
is one whose configuration, including physical parameters and external forces or effects, is 
independent of time. The response of the system to given initial conditions is then independent 
of the time at which the conditions are imposed. 


Stability and Instability. The concepts of stability, asymptotic stability, and instability 
have already been mentioned several times in this book. It is now time to give a precise 
mathematical definition of these concepts, at least for autonomous systems of the form 


x’ = f(x). (5) 


In the following definitions, and elsewhere, we use the notation ||x|| to designate the length, 
or magnitude, of the vector x. 

The points, if any, where f(x) = 0 are called critical points of the autonomous system (5). 
At such points, x’ = 0 also, so critical points correspond to constant, or equilibrium, solutions 
of the system of differential equations. A critical point x° of the system (5) is said to be stable 
if, given any e > 0, there isaé > 0 such that every solution x = x(t) of the system (1) which 
at f = O satisfies 


[xo -xl < 5 © 
both exists for all positive t and satisfies 
x) - xl] < « (7) 


for allt > 0. This is illustrated geometrically in Figures 9.2. 1a and 9.2.15. These mathematical 
statements say that all solutions that start “sufficiently close” (i.e., within the distance 6) to 
x? stay “close” (within the distance c) to x. Note that in Figure 9.2. la the trajectory is within 
the circle [x — x? | = ó at t = 0 and, although it soon passes outside of this circle, it remains 


within the circle |x — x? | = e for all t > 0. However, the trajectory of the solution does not 


have to approach the critical point x° as £ — oo; it only has to remain within the circle of 
radius e, as illustrated in Figure 9.2.15. A critical point that is not stable is said to be unstable. 
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(a) (b) 
Graphical representation of trajectories that exhibit 


(a) asymptotic stability and (5) stability. 


A critical point x? is said to be asymptotically stable if, in addition to being stable, there 
exists a óo (69 > 0) such that if a solution x = x(t) satisfies 


Ix(0) — x? || < ôo, (8) 
then 
lim x(t) = x. (9) 
t— oo 


Thus trajectories that start “sufficiently close" to x? not only must stay "close" to x? but also 
must eventually approach x? as t — oo. This is the case for the trajectory in Figure 9.2.1a, 
but not for the one in Figure 9.2.15. Note that asymptotic stability is a stronger property than 
stability, since a critical point must be stable before we can even consider whether it might be 
asymptotically stable. On the other hand, the limit condition (9), which is an essential feature 
of asymptotic stability, does not by itself imply even ordinary stability. Indeed, examples can 
be constructed in which all the trajectories approach x? as t — oo, but for which x is not a 
stable critical point. Geometrically, all that is needed is a family of trajectories having members 
that start arbitrarily close to x? and then depart an arbitrarily large distance before eventually 
approaching x^ as t — oo. 

In this chapter we are concentrating on systems of two equations, but the definitions just 
given are independent of the size of the system. If you interpret the vectors in equations (5) 
through (9) as n-dimensional, then the definitions of stability, asymptotic stability, and 
instability apply also to systems of n equations. The concepts expressed in these definitions 
can be seen more clearly by interpreting them in terms of a specific physical problem. 


The Oscillating Pendulum. The concepts of asymptotic stability, stability, and instability 
can be easily visualized in terms of an oscillating pendulum. Consider the configuration shown 
in Figure 9.2.2, in which a mass m is attached to one end of a rigid, but weightless, rod of length 
L. The other end of the rod is supported at the origin O, and the rod is free to rotate in the plane 
of the paper. The position of the pendulum is described by the angle 0 between the rod and 
the downward vertical direction, with the counterclockwise direction taken as positive. The 
gravitational force mg acts downward, while the damping force c|d0/dt|, where c is positive, is 
always opposite to the direction of motion. We assume that both 0 and d0/dt are positive. The 
equation of motion can be quickly derived from the principle of angular momentum, which 
states that the time rate of change of angular momentum about any point is equal to the moment 
of the resultant force about that point. The angular momentum about the origin is mL7(d0/dt), 
so the governing equation is 


ga dé . 
mL dà = B mgL sin. (10) 


The factors L and L sin 0 on the right-hand side of equation (10) are the moment arms of 
the resistive force and of the gravitational force, respectively; the minus signs are due to the 
fact that the two forces tend to make the pendulum rotate in the clockwise (negative) direction. 
You should verify, as an exercise, that the same equation is obtained for the other three possible 
sign combinations of 0 and d0/dt. 
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L im 0) 


| 


| FIGURE 9.2.2 | An oscillating pendulum. 


By straightforward algebraic operations, we can write equation (10) in the standard form 


do c d) g 
inf = 0, 11 
aj wae rp gu 


or 


d?0 d0 


qu ug; sno =0, (12) 


c : : ; 
where y = — and w? = > To convert equation (12) to a system of two first-order equations, 
m 


we let x = 0 and y = d0/dt; then 


d d 
: ay = —w?° sinx — yy. (13) 


dt 7 ow 


Since y and w ? are constants, the system (13) is an autonomous system of the form (1). 
The critical points of equations (13) are found by solving the equations 


y=0, —w’sinx — yy — 0. 


We obtain y = Oand x = n7 , where n is an integer. These points correspond to two physical 
equilibrium positions, one with the mass directly below the point of support (0 = 0) and the 
other with the mass directly above the point of support (0 = m). Our intuition suggests that 
the first is stable and the second is unstable. 

More precisely, if the mass is slightly displaced from the lower equilibrium position, it 
will oscillate back and forth with gradually decreasing amplitude, eventually converging to 
the equilibrium position as the initial potential energy is dissipated by the damping force. This 
type of motion illustrates asymptotic stability and is shown in Figure 9.2.3a. 


(b) (c) 


Qualitative motion of a pendulum. 
(a) With air resistance. (b) With or without air resistance. 
(c) Without air resistance. 
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On the other hand, if the mass is slightly displaced from the upper equilibrium position, 
it will rapidly fall, under the influence of gravity, and will ultimately converge to the lower 
equilibrium position in this case also. This type of motion illustrates instability. See Figure 
9.2.3b. In practice, it is impossible to maintain the pendulum in its upward equilibrium position 
for any extended length of time without an external constraint mechanism, since the slightest 
perturbation will cause the mass to fall. 

Finally, consider the ideal situation in which the damping coefficient c (or y) is zero. 
In this case, if the mass is displaced slightly from its lower equilibrium position, it will 
oscillate indefinitely with constant amplitude about the equilibrium position. Since there is 
no dissipation in the system, the mass will remain near the equilibrium position but will not 
approach it asymptotically. This type of motion is stable but not asymptotically stable, as 
indicated in Figure 9.2.3c. In general, this motion is impossible to achieve experimentally, 
because the slightest degree of air resistance or friction at the point of support will eventually 
cause the pendulum to converge to its rest position. 

Solutions of the pendulum equations are discussed in more detail in the next section. 


The Importance of Critical Points. Critical points correspond to equilibrium solutions — 
that is, solutions for which x(t) and y(t) are constant. For such a solution, the system 
described by x and y is not changing; it remains in its initial state forever. It might 
seem reasonable to conclude that such points are not very interesting. Recall, however, that 
for linear homogeneous systems with constant coefficients, x' — Ax, the nature of the critical 
point at the origin determines to a large extent the behavior of trajectories throughout the 
xy-plane. 

For nonlinear autonomous systems this is no longer true, for at least two reasons. First, 
there may be several, or many, critical points that are competing, so to speak, for influence on 
the trajectories. Second, the nonlinearities in the system are also of great importance, especially 
far away from the critical points. Nevertheless, critical points of nonlinear autonomous systems 
can be classified just as for linear systems. We will discuss this in detail in Section 9.3. 
Here we illustrate how it can be done graphically, assuming that you have software that 
can construct direction fields and perhaps plot good numerical approximations to a few 
trajectories. 


EXAMPLE 1 


Consider the system 


Spee ko 
= X y X y). di 


=x(2 + y). (14) 


Find the critical points for this system, and draw direction fields on rectangles containing the critical 
points. By inspecting the direction fields, classify each critical point as to type, and state whether it 
is asymptotically stable, stable, or unstable. 


Solution: 


The critical points are found by solving the algebraic equations 
(x-—y)(l-x-y)=0, x(2+y) =0. (15) 


One way to satisfy the second equation is by choosing x = 0. Then the first equation becomes 
y(1— y) = 0, so y = Oor y = 1. More solutions can be found by choosing y = —2 in the second 
equation. Then the first equation becomes (x + 2)(3 — x) = 0, so x = —2 or x = 3. Thus we have 
obtained the four critical points (0, 0), (0, 1), ( —2, —2), and (3, —2). 

Figure 9.2.4 shows a direction field containing the first two of the critical points. Comparing 
this figure with those in Section 9.1 and in Chapter 7 should make it clear that the origin is a saddle 
point and that (0, 1) is a spiral point (actually, a spiral sink). Of course, the saddle point is unstable. 
The trajectories near the spiral point appear to be approaching this point, so we conclude that it is 
asymptotically stable. 
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Direction field for the system 
(14) containing the critical points (0,0) and 
(0, 1); the former is a saddle point and the latter 
is a spiral sink. 


Direction field for the system 
(14) containing the critical points (—2, —2) and 
(3, —2); the former is a nodal sink and the latter 
is a nodal source. 


A direction field for the other two critical points is shown in Figure 9.2.5. Each of these points 
is a node. The arrows point toward the point (—2, —2) and away from the point (3, —2); thus the 
former is asymptotically stable (a sink) and the latter is unstable (a source). 


For a two-dimensional autonomous system with at least one asymptotically stable critical 
point, it is often of interest to determine where in the phase plane the trajectories lie that 
ultimately approach a given critical point. Let P be a point in the x y-plane with the property 
that a trajectory passing through P ultimately approaches the critical point as £ — oo. Then 
this trajectory is said to be attracted by the critical point. Further, the set of all such points P 
is called the basin of attraction or the region of asymptotic stability of the critical point. 
A trajectory that bounds a basin of attraction is called a separatrix because it separates 
trajectories that approach a particular critical point from other trajectories that do not do so. 
Determination of basins of attraction is important in understanding the large-scale behavior of 
the solutions of an autonomous system. 


EXAMPLE 2 


Consider again the system (14) from Example 1. Describe the basin of attraction for each of the 
asymptotically stable critical points. 


Solution: 


Figure 9.2.6 shows a phase portrait for this system with a direction field in the background. Observe 
that there are two trajectories that approach the saddle point at the origin as t — oo. One of these 
lies in the fourth quadrant and is almost a straight line from the unstable node at (3, —2). The other 
approaches the saddle point from the second quadrant, and if we follow it backward in time we 
find that it loops around the spiral point and ultimately approaches the unstable node (3, —2) as 
t — —oo. These two trajectories are separatrices; the region between them (but not including the 
separatrices themselves) is the basin of attraction for the spiral point at (0, 1). This region is shaded 
in Figure 9.2.6. 
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Direction field, trajectories, and critical 


points of the system (14). The separatrices are shown in 
black. The basis of attraction for the spiral point (0, 1) is 
shaded. 


The basin of attraction for the asymptotically stable node at (—2, —2) consists of the rest of the 
xy-plane, with only a handful of exceptions. The separatrices approach the saddle point, as we have 
noted already, rather than the node. The saddle point itself and the unstable node are equilibrium 
solutions and thus remain fixed for all time. Finally, there is a trajectory lying on the line y = —2 for 
x > 3 on which the direction of motion is always to the right; this trajectory also does not approach 
the point (—2, —2). 


Figures 9.2.4, 9.2.5, and 9.2.6 show that in the immediate vicinity of a critical point, the 
direction field and pattern of trajectories resemble those for a linear system with constant 
coefficients. This becomes even more unmistakable if you use a graphing utility to zoom in 
closer and closer to a critical point. Thus we have visual evidence that a nonlinear system 
behaves very much like a linear system, at least in the neighborhood of a critical point. We 
will pursue this idea in the next section. 


Determination of Trajectories. The trajectories of a two-dimensional autonomous 
system 


x dy 
— = F » T — = G 7 16 
di (x, y) di (x, y) (16) 
can sometimes be found by solving a related first-order differential equation. From 
equations (16) we have 
dy  dy/dt _ G(x, y) 
dx  dx/dt | F(x,y) 


(17) 


which is a first-order differential equation in the variables x and y. Observe that such a 
reduction is not usually possible if F and G depend also on f. If equation (17) can be solved 
by any of the methods of Chapter 2, and if we write solutions (implicitly) in the form 


H(x, y) =c, (18) 


then equation (18) is an equation for the family of trajectories of the system (16). In other 
words, each trajectory lies on a level curve of H(x, y). Keep in mind that there is no general 
way of solving equation (17) to obtain the function H, so this approach is applicable only in 
special cases. 
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EXAMPLE 3 


Find the trajectories of the autonomous system 


dx dy 
— = y, — — Me (19) 
dt dt 
Solution: 
In this case, equation (17) becomes 
d x 
es (20) 
dx y 
This equation is separable since it can be written as 
ydy —xdx, 
and its solutions are given by 
H(x,y) =y - x? =c, (21) 


where c is arbitrary. Therefore, the trajectories of the system (19) are the hyperbolas shown in 
Figure 9.2.7. The two black trajectories correspond to the two solutions with H(x, y) = 0, that 
is, y = x (solid) and y = —x (dashed). The direction of motion on the trajectories can be inferred 
from the fact that both dx /dt and dy /dt are positive in the first quadrant. The only critical point is 
the saddle point at the origin. 
Another way to obtain the trajectories is to solve the system (19) by the methods of Section 7.5. 
We omit the details, but the result is 
x-—ce toe, y-ce -oe*. 


Eliminating t between these two equations again leads to equation (21). 


AUIII Direction field and trajectories of the system (19); the origin is a 


saddle point. 
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EXAMPLE 4 


Find the trajectories of the system 


dx dy 2 
q 4-29 = 12 — 3x", (22) 


Solution: 
From the equations 
4—2y20, 12-3x?20 


we find that the critical points of the system (22) are the points ( —2, 2) and (2, 2). To determine the 
trajectories, note that for this system, equation (17) becomes 


dy 12—-3x? 
— = ————. Q3) 
dx 4 —2y 
Separating the variables in equation (23) and integrating, we find that solutions satisfy 
H(x, y) 24y - y? - 12x +x? =c, Q4) 


where c is an arbitrary constant. A computer plotting routine is helpful in displaying the level curves 
of H(x, y), some of which are shown in Figure 9.2.8. The direction of motion on the trajectories can 
be determined by drawing a direction field for the system (22), or by evaluating dx /dt and dy /dt 
at one or two selected points. From Figure 9.2.8 you can see that the critical point (2, 2) is a saddle 
point and the critical point ( —2, 2) is a center. Observe that there is a separatrix (shown in black) 
that leaves the saddle point (as £ — —oo), loops around the center, and returns to the saddle point 
(as t — +00). Within the separatrix are closed trajectories, or periodic solutions, that surround the 
center. Outside the separatrix, trajectories become unbounded, except for the (red) trajectory that 
enters the saddle point from the right. 


Trajectories of the system (22). The point ( —2, 2) isa center, 
and the point (2, 2) is a saddle point. The black curve is a separatrix. 


Problems 


In each of Problems 1 through 3, use an appropriate graphing device Q 2. dx/dt 2 —x, dy/dt 22y; 
to draw the direction field and sketch the trajectories corresponding to x(0) =4, y(0) =2 and x(0) 24, y(0) 20 
the solution satisfying the specified initial conditions, and indicate the Ü 3. dxjdt=ay, dyjdt=-bx, a>0, 2949 
direction of motion for increasing t. 
x(0) = Ja, y(0) 20 


Q 1. dx/adt2 —, dy/dt 2—2y; x(0)=4, y(0 22 


For each of the systems in Problems 4 through 13: 


a. Find all the critical points (equilibrium solutions). 

Q b. Use an appropriate graphing device to draw a direction 
field and phase portrait for the system. 

c. From the plot(s) in part b, determine whether each critical 
point is asymptotically stable, stable, or unstable, and classify it 
as to type. 


d. Describe the basin of attraction for each asymptotically stable 
critical point. 


dx/dt =x — xy, dy/dt = y + 2xy 

dx/dt =1+2y, dy/dt = 1 -— 3x? 

dx/dt 22x — x? — xy, dy/dt = 3y — 2y? — 3xy 
dx/dt = =(2+ y)\(x +y), dy/dt = —y(1—x) 

dx/dt = y(2— x — y), dy/dt = —x — y —2xy 

dx /dt =(2+x)(y—x), dy/dt = y(2 +x — x?) 
10. dx/dt =(24+x)(y—x), dy/dt =(4—x)(y +x) 
11. dx/dt 2(2—x)(y- x), dy/dt = y(2—x — x?) 
12. dx/dt 2x(2—x-—y), dy/dt = —x + 3y —2xy 
13. dx/dt 2x(2—x —y), dy/dt 2(1— y) (24x) 

In each of Problems 14 through 20: 


a. Find an equation of the form H(x, y) = c satisfied by the 
trajectories. 
Ga b. Plot several level curves of the function H. These are 


trajectories of the given system. Indicate the direction of motion 
on each trajectory. 


een ans 


14. dx/dt=2y, dy/dt = 8x 
15. dx/dt=2y, dy/dt = —8x 
16. dx/dt=y, dy/dt=2x+y 


17. dx/dt=—-x+y, dy/dt = —x — y 
18. dx/dt 22x?y — 3x? — 4y, dy/dt = —2xy? + 6xy 


19. dx/dt =y,  dy/dt — —sinx | (undamped pendulum) 
d. d j 
20. = =y, = =—x + (=) (Duffing's? equation) 


?Georg Duffing (1861-1944), a German engineer, was a pioneer in the study 
of the oscillations of nonlinear mechanical systems. His most important work 
was the influential monograph Erzwungene Schwingungen bei verdnderlicher 
Eigenfrequenz und ihre technische Bedeutung [Forced Oscillations with 
Variable Natural Frequency and Their Technical Meaning], published in 1918. 


93 Locally Linear Systems 


9.3 Locally Linear Systems 407 


21. Given that x = $(1t), y = v (t) isa solution of the autonomous 
system 
dx F( ) 
— = F(x, y), 
dt i 


fora < t < ß, show that 


x= O(t)=¢(t-s), 


d 
= = G(x, y) 


yc W(t)—wv(t—s) 


is a solution fora +s < t < 8 + s for any real number s. 
22. Prove that for the system 


dx d 
D F(x,y), - 


dt OP 


dt 
there is at most one trajectory passing through a given point ( xo, yo). 
Hint: Let Co be the trajectory generated by the solution x = ġọ(t), 
y = wWo(t), with do(to) = xo. Wo(to) = yo. and let C, be the 
trajectory generated by the solution x = @,(t), y = Y(t), with 
Q1(1j) = xo, V1 (t1) = yo. Use the fact that the system is autonomous, 
and also the existence and uniqueness theorem, to show that C, and 
C; are the same. 


23. Prove that if a trajectory starts at a noncritical point of the system 


dx F dy G 

ar (x, y), a (x, y), 
then it cannot reach a critical point (xo, yo) in a finite length of time. 
Hint: Assume the contrary; that is, assume that the solution x = ¢ (t), 
y = W(t) satisfies (a) = x9, v(a) = yo. Then use the fact that 
X = xg, y = yo İs a solution of the given system satisfying the initial 
condition x = xo, y = yo at t =a. 


24. Assuming that the trajectory corresponding to a solution 
x=¢ġ(t), y = W(t), —oo < t < oo, of an autonomous system 
is closed, show that the solution is periodic. 

Hint: Since the trajectory is closed, there exists at least one point 
(xo. yo) such that (to) = xo, v(tg) = yo and a number 
T > 0 such that ó(to + T) = xo, v(to + T) = yo. Show that 
x — b(ft) 2 ó(t --T) and y = W(t) = w(t + T) is a solution, 
and then use the existence and uniqueness theorem to show that 
(t) = (t) and W(t) = v(t) for all t. 


In Section 9.1 we described the stability properties of the equilibrium solution x — 0 of the 


two-dimensional linear system 


x = Ax. 


(1) 


The results are summarized in Table 9.1.1. Recall that we required that det A Z 0, so x = 0 is 
the only critical point of the system (1). Now that we have defined the concepts of asymptotic 
stability, stability, and instability more precisely, we can restate these results in the following 


theorem. 
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Theorem 9.3.1 


The critical point x = 0 of the linear system (1) 


1. is asymptotically stable if the eigenvalues r;, rz are real and negative or have negative real part; 
2. is stable, but not asymptotically stable, if rı and rz are pure imaginary; and 


3. is unstable if rı and r3 are real and either is positive, or if they have positive real part. 


Effect of Small Perturbations. It is apparent from this theorem or from Table 9.1.1 that the 
eigenvalues rı and r, of the coefficient matrix A determine the type of critical point at x = 0 
and its stability characteristics. In turn, the values of r; and r depend on the coefficients in 
the system (1). When such a system arises in some applied field, the coefficients usually result 
from the measurements of certain physical quantities. Such measurements are often subject to 
small uncertainties, so it is of interest to investigate whether small changes (perturbations) in 
the coefficients can affect the stability or instability of a critical point and/or significantly alter 
the pattern of trajectories. 
Recall that the eigenvalues rı and rz are the roots of the polynomial equation 


det(A — rI) = 0. (2) 


It is possible to show that small perturbations in some or all of the coefficients are reflected in 
small perturbations in the eigenvalues. The most sensitive situation occurs when r; = iu and 
ry = —i u. Then the critical point is a center and the trajectories are closed curves (ellipses) 
surrounding it. If a slight change is made in the coefficients, then the eigenvalues rı and r2 
will take on new values r; = A' + ip’ and r) = A' — ip’, where A' is small in magnitude and 
u’ = u (see Figure 9.3.1). If A” 4 0, which almost always occurs, then the trajectories of the 
perturbed system are spirals rather than ellipses. The system is asymptotically stable if A’ < 0 
but unstable if \’ > 0. Thus, in the case of a center, small perturbations in the coefficients 
may well change a stable system into an unstable one and, in any case, may be expected to 
change the trajectories from ellipses to spirals (see Problem 24). 

Another slightly less sensitive case occurs if the eigenvalues r; and rz are equal; in 
this case the critical point is a node. Small perturbations in the coefficients will normally 
cause the two equal roots to separate (bifurcate). If the separated roots are real, then the 
critical point of the perturbed system remains a node, but if the separated roots are complex 
conjugates, then the critical point becomes a spiral point. These two possibilities are shown 
schematically in Figure 9.3.2. In this case the stability or instability of the system is not affected 
by small perturbations in the coefficients, but the type of the critical point may be changed (see 
Problem 25). 

In all other cases the stability or instability of the system is not changed, nor is the type of 
critical point altered, by sufficiently small perturbations in the coefficients of the system. For 
example, if r; and r, are real, negative, and unequal, then a small change in the coefficients 
will neither change the sign of rı and rz nor allow them to coalesce. Thus the critical point 
remains an asymptotically stable node. 


| FIGURE 9.3.1 | Schematic perturbation of ri =i p, r; = —ip 


into complex conjugate eigenvalues with (a) positive real part 
or (b) negative real part. 


Im(r) I(r) 
A+iw 
rı =r2 
2p 49x m L 
ri-À-iu 


Schematic perturbation of rı =r, into (a) a pair of 
different real eigenvalues or (b) complex conjugate eigenvalues. 


Linear Approximations to Nonlinear Systems. Now let us consider a nonlinear 
autonomous two-dimensional system 


x’ = f(x). (3) 


Our main object is to investigate the behavior of trajectories of the system (3) near a critical 
point x°. Recall that in Example 1 in Section 9.2 we noted that near each critical point of that 
nonlinear system the pattern of trajectories resembles the trajectories of a certain linear system. 
This suggests that near a critical point we may be able to approximate the nonlinear system (3) 
by an appropriate linear system, whose trajectories are easy to describe. The crucial question 
is whether and how we can find an approximating linear system whose trajectories closely 
match those of the nonlinear system near the critical point. 

It is convenient to choose the critical point to be the origin. This involves no loss 
of generality, since if x? + 0, it is always possible to make the substitution u = x — x? 
in equation (3). Then u will satisfy an autonomous system with a critical point at the origin. 

First, let us consider what it means for a nonlinear system (3) to be “close” to a linear 
system (1). Accordingly, suppose that 


X = Ax + g(x) (4) 


and that x = 0 is an isolated critical point of the system (4). This means that there is some 
circle about the origin within which there are no other critical points. In addition, we assume 
that det A Z 0, so that x = 0 is also an isolated critical point of the linear system x’ = Ax. For 
the nonlinear system (4) to be close to the linear system x’ = Ax, we must assume that g(x) 
is small. More precisely, we assume that the components of g have continuous first partial 
derivatives and satisfy the limit condition 


Igo Il 


—^0 as x—> 0; (5) 
IIx|| 


that is, ||g( X) | is small in comparison to ||x|| itself near the origin. Such a system is called a 
locally linear system in the neighborhood of the critical point x = 0. 
It may be helpful to express the condition (5) in scalar form using polar coordinates. If we 
let x = (x, y)”, then [xl = (x? + y?)!/? = r. Similarly, if g(x) = (g1(x, y), go(x, y))7, 
1/2 
then [|g(x) || = (sit. y) + g(x, )) . Then it follows that condition (5) is satisfied if and 
only if 


i stud brane 

EA COSC SEY og, SSO ON) oor oe pO Pe m. 6 
r r 

EXAMPLE 1 


Determine whether the system 


x a 1 0 x + =x? — xy (7) 
y) o 0.5) \y —0.75xy — 0.25y? 


is locally linear in the neighborhood of the origin. 
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Solution: 


Observe that the system (7) is of the form (4), that (0,0) is a critical point, and that 

detA + 0. It is not hard to show that the other critical points of equations (7) are (0, 2), (1, 0), 
11 

and (s :) ; consequently, the origin is an isolated critical point. In checking the condition (6), it 

is convenient to introduce polar coordinates by letting x = r cos0, y =r sin 0. Then 


2 


gi y) —x? — xy _ = cos? 0 — r? sin 0 cos 0 


r r r 


= —r(cos? 0 + sinf cos0) > 0 


asr — 0. In a similar way, you can show that g;(x, y)/r — Oas r — 0. Hence the system (7) is 
locally linear near the origin. 


EXAMPLE 2 
The motion of a pendulum is described by the system (see equation (13) of Section 9.2) 

dx d 

PT 2 = —u? sinx — yy. (8) 
The critical points are (0, 0), (7, 0), (+27, 0), ... , so the origin is an isolated critical point of 


this system. Show that the system is locally linear near the origin. 


Solution: 


To convert equations (8) into the form of system (4), we rewrite the former so that the linear 
and nonlinear terms are clearly identified. If we write sinx = x + (sinx — x) and substitute this 
expression in the second of equations (8), we obtain the equivalent system 


Oe ae. » 


Thus we see that g;(x, y) = 0 and go(x, y) = —w?(sinx — x). From the Taylor series for sin x, 
we know that sinx — x behaves like —x?/3! — —(r? cos? 0) /3! when x is small. Consequently, 
(sinx — x)/r — Oasr — 0. Thus the conditions (6) are satisfied, and the system (9) is locally 
linear near the origin. 


Let us now return to the general nonlinear system (3), which we write in the scalar form 
x= F(x,y), y = G(x, y); (10) 
that is, x = (x, y)? and f(x) = (F(x, y), G(x, y))*. 


Theorem 9.3.2 


The system (10) is locally linear in the neighborhood of a critical point (xo, yo) whenever the 
functions F and G have continuous partial derivatives up to order two. 


To show this, we use Taylor expansions about the point (xo, yo) to write F(x, y) and 
G(x, y) in the form 


F(x, y) = F(xo, yo) + Fx(xo, yo) (x — xo) + Fy(xo. yo) Cy — yo) +7104, y). 
G(x, y) = G(xo, yo) + Gx(xo. yo) (x — xo) + Gy(xo. yo) Cy — Yo) + "x. y). 


1/2 "e 
where. Gc 3) / (Gc — x0)? + O — 992) ^ — 0 as (x,y) > (xo, yo), and similarly 
for 72. Note that F(xo, yo) = G(xo, yo) = 0, and that dx/dt = d(x — xg) /dt and 


dy /dt = d(y — yo) /dt. Then the system (10) reduces to 


d C = R (PO 9. Fy, Yo) C Bj $ ee 3) dis 
dt Xy — yo Gí(xo. yo) — Gy(xo, yo) / Y — Yo nox, y) J? 
or, in vector notation, 


du df >% 
dr a tT, (12) 
where u = (x — xo, y — yo)” and y = (m1, 2)". 

The significance of Theorem 9.3.2 is twofold. First, if the functions F and G are twice 
differentiable, then the system (10) is locally linear, and it is unnecessary to resort to the 
limiting process used in Examples 1 and 2. Second, the linear system that approximates the 
nonlinear system (10) near (xo, yo) is given by the linear part of equations (11) or (12): 


za) 7 es yo) Fy (x0, a) Hs 
dt Nu» G.(xo. yg  Gy(xo. yo) / \u2/’ 
where u; = x — xo and u = y — yo. Equation (13) provides a simple and general method for 


finding the linear system corresponding to a locally linear system near a given critical point. 
The matrix 


(14) 


Fy F. 
J = JIF, Gl, y) -( (x, y) n 


G(x, y) G(x, y) 
which appears as the coefficient matrix in equation (13), is called the Jacobian? matrix of the 


functions F and G with respect to x and y. We need to assume that det J is not zero at (xo, yo) 
so that this point is also an isolated critical point of the linear system (13). 


EXAMPLE 3 


Use equation (13) to find the linear system corresponding to the pendulum equations (8) near the 
origin; near the critical point (7, 0). 
Solution: 
In this case we have, from equation (8), 
F(x,y)=y, G(x, y) =—w? sinx — yy; (15) 


since these functions are differentiable as many times as necessary, the system (8) is locally linear 
near each critical point. The first partial derivatives of F and G are 


F,20, Fy=1, G,--w'cox, Gy,2-w. (16) 


Thus, at the origin the corresponding linear system is 


adela 220 an 


which agrees with equation (9) from Example 2. 
Similarly, evaluating the partial derivatives in equation (16) at (7, 0), we obtain 


zb)" 2 as 


where u = x — T, v = y. This is the linear system corresponding to equations (8) near the point 
(7, 0). 


3Carl Gustav Jacob Jacobi (1804-1851), a German analyst who was professor and lecturer at the Universities of 
Königsberg and Berlin, made important contributions to the theory of elliptic functions. The determinant of J and its 
extension to n functions of n variables is called the Jacobian because of his notable paper in 1841 on the properties of 
this determinant. The corresponding matrix is also named for Jacobi, even though matrices were not developed until 
after his death. 
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We now return to the locally linear system (4). Since the nonlinear term g(x) is small 
compared to the linear term Ax when x is small, it is reasonable to hope that the trajectories of 
the linear system (1) are good approximations to those of the nonlinear system (4), at least near 
the origin. This turns out to be true in many (but not all) cases, as the following theorem states. 


Theorem 9.3.3 


Let rı and r; be the eigenvalues of the linear system (1), x’ = Ax, corresponding to the locally linear 
system (4), x’ = Ax + g(x). Then the type and stability of the critical point (0, 0) of the linear system 
(1) and the locally linear system (4) are as shown in Table 9.3.1. 


Stability and Instability Properties of Linear and Locally Linear 


TABL .3.1 
ABLEO:3 Systems 


Linear System 


Locally Linear System 


Eigenvalues Type Stability Type Stability 
rj a AA N Unstable N Unstable 
re<m<0 N Asymptotically stable N Asymptotically stable 
iy «& OS um SP Unstable SP Unstable 
T, =%>0 PNorIN Unstable NorSpP Unstable 
if Sry < (0) PNorIN Asymptotically stable NorSpP Asymptotically stable 
ryt =à +ip 
A20 SpP Unstable SpP Unstable 
A«0 SpP Asymptotically stable SpP Asymptotically stable 
Ay emm C Stable CorSpP Indeterminate 


Key: N, node; IN, improper node; PN, proper node; SP, saddle point; SpP, spiral point; C, 
center. 


At this stage, the proof of Theorem 9.3.3 is too difficult to give, so we will accept 
the results without proof. The statements for asymptotic stability and instability follow as 
a consequence of a result discussed in Section 9.6, and a proof is sketched in Problems 9 to 
11 of that section. Essentially, Theorem 9.3.3 says that for small x (or x — x°) the nonlinear 
terms are also small and do not affect the stability and type of critical point as determined 
by the linear terms, except in two sensitive cases. If r; and r are pure imaginary, then 
the small nonlinear terms may change the stable center into a spiral point, which may be 
either asymptotically stable or unstable. If r; and rz are real, equal, and positive or real, 
equal, and negative, then the nonlinear terms may change the node into a spiral point, but 
its asymptotic stability or instability remains unchanged. Recall that earlier in this section 
we stated that small perturbations in the coefficients of the linear system (1), and hence in 
the eigenvalues rı and r2, can alter the type and stability of the critical point only in these 
two cases. It is reasonable to expect that the small nonlinear term in equation (4) might 
have a similar substantial effect, at least in these two cases. This is so, but the main significance 
of Theorem 9.3.3 is that in all other cases the small nonlinear term does not alter the type or 
stability of the critical point. The essential understanding to take from this discussion is this: 
except in the two sensitive cases, the type and stability of the critical point of the nonlinear 
system (4) can be determined from a study of the much simpler linear system (1). 

Even if the critical point is of the same type as that of the linear system, the trajectories 
of the locally linear system may be considerably different in appearance from those of the 
corresponding linear system, except very near the critical point. However, it can be shown 
that the slopes at which trajectories “enter” or “leave” the critical point are given correctly by 
the linear system. 


Damped Pendulum. We continue the discussion of the damped pendulum begun in 
Examples 2 and 3. Near the origin the nonlinear equations (8) are approximated by the linear 


system (17), whose eigenvalues are 


—y + / y? — 4w2 
2 x 


Fist = (19) 
The nature of the solutions of equations (8) and (17) depends on the sign of y? — 4w? as 
follows: 


1. If 4? — 4w? > 0, then the eigenvalues are real, unequal, and negative. 
The critical point (0, 0) is an asymptotically stable node of the linear system (17) and of 
the locally linear system (8). 

2. If 4? — 4w? = 0, then the eigenvalues are real, equal, and negative. 
The critical point (0, 0) is an asymptotically stable (proper or improper) node of the linear 
system (17). It may be either an asymptotically stable node or spiral point of the locally 
linear system (8). 

3. If 4? — 4w? < 0, then the eigenvalues are complex with negative real part. 
The critical point (0, 0) is an asymptotically stable spiral point of the linear system (17) 
and of the locally linear system (8). 


Thus the critical point (0, 0) is a spiral point of the system (8) if the damping ~y is small and 
a node if y is large enough. In either case, the origin is asymptotically stable. 

Let us now consider the case y? — 4w? < 0, corresponding to small damping, in 
more detail. The direction of motion on the spirals near (0, 0) can be obtained directly from 
equations (8). Consider the point at which a spiral intersects the positive y-axis (x = 0 and 
y > 0). It follows from equations (8) that at such a point, dx /dt > 0. Thus the point (x, y) 
on the trajectory is moving to the right, so the direction of motion on the spirals is clockwise. 

The behavior of the pendulum near the critical points (-En7, 0), with n even, is the same 
as its behavior near the origin. We expect this on physical grounds since all these critical 
points correspond to the downward equilibrium position of the pendulum. The conclusion can 
be confirmed by repeating the analysis carried out above for the origin. Figure 9.3.3 shows the 
clockwise spirals at a few of these critical points. 


@)— -—©@)-> 


-27 2T 


| FIGURE 9.3.3 | Asymptotically stable spiral points at (+2n7,0) for the 


damped pendulum with small damping, 5? — 4w? < 0. 


Now let us consider the critical point (7,0). Here the nonlinear equations (8) are 
approximated by the linear system (18), whose eigenvalues are 


di Pade 
í | 


Fist = (20) 
One eigenvalue (rı) is positive and the other (r2) is negative. Therefore, regardless of the 
amount of damping, the critical point (x, y) = (7, 0) is an unstable saddle point both of the 
linear system (18) and of the locally linear system (8). 

To examine the behavior of trajectories near the saddle point (7, 0) in more detail, we 
write down the general solution of equations (18), namely, 


u 1 ryt 1 rot 
=C, e! TC, er; (21) 
y ri F5 


where C, and C» are arbitrary constants. Since rj > 0 and rz < 0, it follows that the solution 
that approaches zero as £ — oo corresponds to C, = 0. For this solution v/u = r», so the 
slope of the entering trajectories is negative; one lies in the second quadrant (Cy < 0), and 
the other lies in the fourth quadrant (C5 > 0). For Cz = 0 we obtain the pair of trajectories 
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"exiting" from the saddle point. These trajectories have slope rı > 0; one lies in the first 
quadrant (C, > 0), and the other lies in the third quadrant (C, < 0). 

The situation is the same at other critical points (17,0) with n odd. These all 
correspond to the upward equilibrium position of the pendulum, so we expect them to be 
unstable. The analysis at (7, 0) can be repeated to show that they are saddle points oriented 
in the same way as the one at (7, 0). Diagrams of the trajectories in the neighborhood of two 
saddle points are shown in Figure 9.3.4. 


Unstable saddle points at (+(2n + 1)7,0) for the damped 
pendulum with small damping, y? — 4w? < 0. 


EXAMPLE 4 


The equations of motion of a certain pendulum are 


1 
=y, = —9 sinx 5? (22) 


where x = 0 and y = d@/dt. Draw a phase portrait for this system, and explain how it shows the 
possible motions of the pendulum. 


Solution: 


By plotting the trajectories starting at various initial points in the phase plane, we obtain the phase 
portrait shown in Figure 9.3.5. As we have seen, the critical points (equilibrium solutions) are the 
points (n7, 0), where n = 0, +1, +2, ... . Even values of n, including zero, correspond to the 
downward position of the pendulum, while odd values of n correspond to the upward position. Near 
each of the asymptotically stable critical points, the trajectories are clockwise spirals that represent a 
decaying oscillation about the downward equilibrium position. The wavy horizontal portions of the 
trajectories that occur for larger values of |y| represent whirling motions of the pendulum. Note that 


ya 


10 — 


RY 
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Phase portrait for the damped pendulum of Example 4. 
The shaded region is the basin of attraction for (0, 0) . 
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a whirling motion cannot continue indefinitely, no matter how large |y| is; eventually, the angular 
velocity is so much reduced by the damping term that the pendulum can no longer go over the top 


and, instead, begins to oscillate about its downward position. 


The basin of attraction for the origin is the shaded region in Figure 9.3.5. It is bounded by the 
(black) trajectories that enter the two adjacent saddle points at (7, 0) and (—7, 0). The bounding 
trajectories are separatrices. Each asymptotically stable critical point has its own basin of attraction, 
which is bounded by the separatrices entering the two neighboring saddle points. All of the basins of 
attraction are congruent to the shaded one; the only difference is that they are translated horizontally 
by integer multiples of 27. Note that it is mathematically possible (but physically unrealizable) to 
choose initial conditions exactly on a separatrix so that the resulting motion leads to a balanced 


pendulum in a vertically upward position of unstable equilibrium. 


An important difference between nonlinear autonomous systems and the linear systems 
discussed in Section 9.1 is illustrated by the pendulum equations. Recall that the linear system 
(1) has only the single critical point x = 0 if det A Æ 0. Thus, if the origin is asymptotically 
stable, then not only do trajectories that start close to the origin approach it, but, in fact, 
every trajectory approaches the origin. In this case the critical point x = 0 is said to be 
globally asymptotically stable. This property of linear systems is not, in general, true for 
nonlinear systems, even if the nonlinear system has only one asymptotically stable critical 
point. Therefore, for nonlinear systems, it is important to determine (or to estimate) the basin 


of attraction for each asymptotically stable critical point. 


Problems 


In each of Problems 1 through 3, verify that (0, 0) is a critical point, 
show that the system is locally linear, and discuss the type and stability 
of the critical point (0,0) by examining the corresponding linear 
system. 


1. dx/dt=x-—y?, dyjdt =x —2y +x? 
2. dx/dt =(1 +x) sin y, 
3. dx/dt 2x + y?, 


dy/dt = 1 — x — cos y 
dy/dt =x +y 


In each of Problems 4 through 15: 

a. Determine all critical points of the given system of equations. 
b. Find the corresponding linear system near each critical point. 
c. Find the eigenvalues of each linear system. What conclusions 
can you then draw about the nonlinear system? 

Q d. Draw a phase portrait of the nonlinear system to confirm 
your conclusions, or to extend them in those cases where the 
linear system does not provide definite information about the 
nonlinear system. 


4. dx/dt=(2+x)(y—-x), dy/dt=(4-x)(y +x) 
5. dx/dt 2x —x?— xy, dy/dt 2 3y — xy - 2y? 

6. dx/dt21—y, dyjdt =x — y? 

7. dx/dt=(2+y)(2y—x), dy/dt = (2 — x)(2y + x) 
8. dx/dt 2x x? y, dy/dt=y—xy 

9. dx/dt=(1+x)siny,  dy/dt = 1 — x — cos y 

10. dx/dt2x—y^ dyjdt =y- x? 

11. dx/dt 21—xy, dyjdt =x- y’? 


12. dx /dt = —2x — y - x (x? + y?). 
dy/dt =x y o y(à e y?) 
13. dx [dt = y - x(1—x? — y?), dy/dt = =x + y(1—x? — y?) 
14. dx/dt 24— y?, dy/dt 2 (1.5 - x)(y — x) 
15. dx/dt 2(1—y)(2x —y), dy/dt — (2+ x)(x —2y) 


16. Consider the autonomous system 
dx dy 


— =y, =x 42x. 
4 cu Ur 


a. Show that the critical point (0, 0) is a saddle point. 
b. Sketch the trajectories for the corresponding linear system by 
integrating the equation for dy/dx. Show from the parametric 
form of the solution that the only trajectory on which x — 0, 
y 0ast > oois y = —x. 
C. Determine the trajectories for the nonlinear system by 
integrating the equation for dy /dx. Sketch the trajectories for the 
nonlinear system that correspond to y — —x and y — x for the 
linear system. 
17. Consider the autonomous system 
dx dy 


— =x, =-2 a 
di. Uam TE 


a. Show that the critical point (0, 0) is a saddle point. 

b. Sketch the trajectories for the corresponding linear system, 
and show that the trajectory for which x > 0, y > 0 as t > co 
is given by x = 0. 

C. Determine the trajectories for the nonlinear system for x 4 0 
by integrating the equation for dy/dx. Show that the trajectory 
corresponding to x = O for the linear system is unaltered, but 
that the one corresponding to y = 0 is y = x?/5. Sketch several 
of the trajectories for the nonlinear system. 


18. The equation of motion of an undamped pendulum is 
d?0/dt? + w? sinb = 0, where w? = g/L. Let x = 0, y = dO/dt to 
obtain the system of equations 

dx dy 


—— = 2 
de di 


= —u*sinx. 


a. Show that the critical points are (+77, 0), n =0,1,2,..., 
and that the system is locally linear in the neighborhood of each 
critical point. 
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b. Show that the critical point (0, 0) is a (stable) center of the 
corresponding linear system. Using Theorem 9.3.3, what can you 
say about the nonlinear system? The situation is similar at the 
critical points (2-2n7, 0), n = 1,2,3, ... . What is the physical 
interpretation of these critical points? 

c. Show that the critical point (7,0) is an (unstable) 
saddle point of the corresponding linear system. What 
conclusion can you draw about the nonlinear system? The 
situation is similar at the critical points (4(2n — 1)7,0), n = 
1,2, 3, ... . What is the physical interpretation of these critical 
points? 

O d. Choose a value for w? and plot a few trajectories of the 
nonlinear system in the neighborhood of the origin. Can you now 
draw any further conclusion about the nature of the critical point 
at (0, 0) for the nonlinear system? 

Q e. Using the value of w? from part d, draw a phase portrait 
for the pendulum. Compare your plot with Figure 9.3.5 for the 
damped pendulum. 


19. a. By solving the equation for dy /dx, show that the equation of 
the trajectories of the undamped pendulum of Problem 18 can be 
written as 


la 2 
2" +w*(1 — cosx) =c, (23) 


where c is a constant of integration. 
b. Multiply equation (23) by mL?. Then express the result in 
terms of 0 to obtain 


E L? a ja L(1 0)—-E (24) 
7” a mg — COS —L, 


where E = mL?c. 

C. Show that the first term in equation (24) is the kinetic energy 
of the pendulum and that the second term is the potential energy 
due to gravity. Thus the total energy E of the pendulum is 
constant along any trajectory; its value is determined by the initial 
conditions. 


20. The motion of a certain undamped pendulum is described by the 
equations 
dx dy 
di 7 qr 
If the pendulum is set in motion with an angular displacement A and 
no initial velocity, then the initial conditions are x(0) = A, y(0) = 0. 
Q a. Let A — 0.25 and plot x versus t. From the graph, estimate 
the amplitude R and period T of the resulting motion of the 
pendulum. 
Q b. Repeat part a for A — 0.5, 1.0, 1.5, and 2.0. 
Q c. How do the amplitude and period of the pendulum's 
motion depend on the initial position A? Draw a graph to show 
each of these relationships. Can you say anything about the 
limiting value of the period as A — 0? 
Q d. Let A — 4 and plot x versus t. Explain why this graph 
differs from those in parts a and b. For what value of A does the 
transition take place? 


= —4sinx. 


21. Consider again the pendulum equations (see Problem 20) 
dx B dy 
dt 7 qr 

If the pendulum is set in motion from its downward equilibrium 

position with angular velocity v, then the initial conditions are 

x(0) = 0, y(0) =v. 

Q a. Plot x versus t for v = 2 and also for v = 5. Explain 


the differing motions of the pendulum that these two graphs 
represent. 


= —4sinx. 


b. There is a critical value of v, which we denote by v,, such that 
one type of motion occurs for v < ve and the other for v > ve. 
Estimate the value of ve. 


22. This problem extends Problem 21 to a damped pendulum. The 
equations of motion are 
dx dy 
dt’ qr 
where y is the damping coefficient, with the initial conditions 
x(0) 20, y(0) = v. 
O a. For y = 0.25, plot x versus t for v = 2 and for v = 5. 
Explain these plots in terms of the motions of the pendulum 
that they represent. Also explain how they are related to the 
corresponding graphs in Problem 21a. 
b. Estimate the critical value v, of the initial velocity where the 
transition from one type of motion to the other occurs. 
C. Repeat part b for other values of y and determine how ve 
depends on y. 


4sin x — yy, 


23. Theorem 9.3.3 provides no information about the stability of a 
critical point of a locally linear system if that point is a center of the 
corresponding linear system. That this must be the case is illustrated 
by the system 


dx 2 2 
ar cog ee Ty. 
T (25) 
LE 2 2 
= x c ay(x^ t y?) 
and 
dx 
dr =y—x(x? +y’), 
à Q6) 
dy, — —x — y(x? + y?) 
dt — i 


where o is a real-valued constant. 
a. Show that, for all values of a, (0,0) is a critical point of 
system (25) and, furthermore, is a center of the corresponding 
linear system. 
b. Show that, for all values of a, system (25) is locally linear. 
c. Letr? = x*+y?, and note that x dx /dt+y dy /dt = r dr jdt. 
Show that dr /dt = ar?. 
d. Show that, for any a < 0, r decreases to 0 as t — oo; hence 
the critical point is asymptotically stable. 
e. Show that, for any œa > 0, the solution of the initial value 
problem for r with r = rg att = 0 becomes unbounded 


as f increases towards 1/ (2a ro ), and hence the critical point is 
unstable. 


24. In this problem we show how small changes in the coefficients 
of a system of linear equations can affect a critical point that is a center. 
Consider the system 


Show that the eigenvalues are +i so that (0,0) is a center. Now 
consider the system 


where |e| is arbitrarily small. Show that the eigenvalues are e + i. Thus 
no matter how small |e| 4 0 is, the center becomes a spiral point. If 
€ < O, the spiral point is asymptotically stable; if € > 0, the spiral 
point is unstable. 


25. In this problem we show how small changes in the coefficients 
of a system of linear equations can affect the nature of a critical point 
when the eigenvalues are equal. Consider the system 


` [f~ 4 
x = X. 
0 -1 


Show that the eigenvalues are r} = —1, r2 = —1 so that the critical 
point (0, 0) is an asymptotically stable node. Now consider the system 


: (= ) 
x= X, 
—€ EI 


where |e| is arbitrarily small. Show that if € > 0, then the eigenvalues 
are —1 + ie, so that the asymptotically stable node becomes an 
asymptotically stable spiral point. If € < 0, then the roots are 
—1+./Je], and the critical point remains an asymptotically stable 
node. 


26. In this problem we derive a formula for the natural period of 
an undamped nonlinear pendulum (equation (10) with c = O in 
Section 9.2). Suppose that the bob is pulled through a positive angle 
a and then released with zero velocity. 
a. We usually think of 0 and d@/dt as functions of t. However, 
if we reverse the roles of t and 0, we can regard t as a function 
of 0 and, consequently, can also think of d0/dt as a function of 
0. Then derive the following sequence of equations: 


1 .,dífd0M 
mL‘ = 
2° d0 \\ at 


do \* 
-m ies = mgL(cos@ — cosa), 


mgL sin, 


iL dO 
dt = : 
28 ,/cos@ — cosa 


Why was the negative square root chosen in the last equation? 
b. If T is the natural period of oscillation, derive the formula 


p aa do 
4 28 Ja V/cosÓ — cosa 
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€. Use the identities  cos@ = 1 — 2 sin? (4) and 
cosa = 1 -— 2sin? (+). followed by the change of variable 
sin(4) = ksin ó with k = sin(4). show that 


7/2 

[L d 

T=4 n $ ; 
$J0  |/1—K^sin ó 


The integral is called the elliptic integral of the first kind. Note 
that the period depends on the ratio L/ and also on the initial 
displacement a through k = sin(a /2). 

O d. By evaluating the integral in the expression for T, obtain 
values for T that you can compare with the graphical estimates 
you obtained in Problem 20. 


27. A generalization of the damped pendulum equation discussed in 
the text, or a damped spring-mass system, is the Liénard^ equation 
d?x dx 
T tex) + a(x) =0. 
If c(x) is a constant and g(x) = kx, then this equation has the form of 
the linear pendulum equation (compare with equation (12) of Section 
9.2 with sin 0 replaced by its linear approximation, 0); otherwise, the 
damping force c(x) dx /dt and the restoring force g(x) are nonlinear. 
Assume that c is continuously differentiable, g is twice continuously 
differentiable, and g(0) = 0. 
a. Write the Liénard equation as a system of two first-order 
equations by introducing the variable y = dx /dt. 
b. Show that (0,0) is a critical point and that the system is 
locally linear in the neighborhood of (0, 0). 
€. Show that ifc(0) > Oand g’(0) > 0, then the critical point is 
asymptotically stable, and that if c(0) < Oor g/(0) < 0, then the 
critical point is unstable. Hint: Use Taylor series to approximate 
c and g in the neighborhood of x = 0. 


4 Alfred-Marie Liénard (1869-1958), a French physicist and engineer, was 
professor at l'École des Mines in Paris. He worked primarily in electricity, 
mechanics, and applied mathematics. The results of his investigation of this 
differential equation were published in 1928. 


In this section and the next, we explore the application of phase plane analysis to some 
problems in population dynamics. These problems involve two interacting populations and are 
extensions of those discussed in Section 2.5, which dealt with a single population. Although 
the equations discussed here are extremely simple compared to the very complex relationships 
that exist in nature, it is still possible to acquire some insight into ecological principles from 
a study of these model problems. The same, or similar, models have also been used to study 
other types of competitive situations — for instance, businesses competing in the same market. 

Suppose that in some closed environment there are two similar species competing for a 
limited food supply —for example, two species of fish in a pond that do not prey on each other 
but do compete for the available food. Let x and y be the populations of the two species at 
time f. As discussed in Section 2.5, we assume that the population of each of the species, in 


the absence of the other, is governed by a logistic equation. Thus 


dx 
dt 
dy 
dt 


= x(e1— 21x), 


= y(€2 — 027), 


(1) 
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respectively, where €; and € are the growth rates of the two populations, and e; / o ; and €2/ 02 
are their saturation levels. However, when both species are present, each will tend to diminish 
the available food supply for the other. In effect, they reduce each other's growth rates and 
saturation populations. The simplest expression for reducing the growth rate of species x due 
to the presence of species y is to replace the growth rate factor e, — o x in the first of equations 
(1) by €; — 81x — ay, where a, is a measure of the degree to which species y interferes with 
species x. Similarly, in the second of equations (1) we replace e? — a2y by €2 — 02y — ax. 
Thus we have the system of equations 


dx 

um —x(e1— 04x — ay), 

dy (2) 
arp y(€2 — 02y — ax). 


The values of the positive constants €41, 0 1, 0,1, €2, 02, and œ, depend on the particular species 
under consideration and, in general, must be determined from observations. We are interested 
in solutions of equations (2) for which x and y are nonnegative. In the following two examples 
we discuss two typical problems in some detail. At the end of the section we return to the 
general equations (2). 


EXAMPLE 1 


Discuss the qualitative behavior of solutions of the system 


dx _ a ) 
a Ms 
(3) 
y Pe Ay — 2x). 
dt 4 
Solution: 
We find the critical points by solving the system of algebraic equations 
x1-7x-y) 20 tG-4y-29 =0. (4) 


The first equation can be satisfied by choosing x = 0; then the second equation requires that y = 0 


3 
ory = T Similarly, the second equation can be satisfied by choosing y = 0, and then the first 
equation requires that x = 0 or x = 1. Thus we have found three critical points, namely, (0, 0), 
3 
0, a)? and (1,0). If neither x nor y is zero, then equations (4) are also satisfied by solutions of 


the system 


1—x—y20, 3—4y -2x 20, (5) 


: ; . These four critical points correspond to the 
equilibrium solutions of the system (3). The first three of these points involve the absence of one or 
both species; only the last corresponds to the presence of both species. Other solutions are represented 
as curves or trajectories in the x y-plane that describe the evolution of the populations in time. To 
begin to discover their qualitative behavior, we can proceed in the following way. 

First, observe that the coordinate axes are themselves trajectories. This follows directly from 
equations (3) since dx /dt = 0 on the y-axis (where x = 0) and, similarly, dy /dt = 0 on the x-axis 
(where y = 0). Thus no other trajectories can cross the coordinate axes. For a population problem 
only nonnegative values of x and y are significant, and we conclude that any trajectory that starts in 
the first quadrant remains there for all time. 

A direction field for the system (3) in the positive quadrant is shown in Figure 9.4.1; the black 
dots in this figure are the critical points or equilibrium solutions. Based on the direction field, it 


which leads to a fourth critical point 


1 1 
appears that the point 7 5) attracts other solutions and is therefore asymptotically stable, while 


the other three critical points are unstable. To confirm these conclusions, we can look at the linear 
approximations near each critical point. 
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| FIGURE 9.4.1 | Critical points and direction field for the system (3). 


The system (3) is locally linear in the neighborhood of each critical point. There are two 
ways to obtain the linear system near a critical point ( X, Y). First, we can use the substitution 
x = X +u, y = Y + v in equations (3), retaining only the terms that are linear in u and v. Or, 
as we saw in Section 9.3, we can evaluate the Jacobian matrix J at each critical point to obtain the 
coefficient matrix in the approximating linear system; see equation (13) in Section 9.3. When several 
critical points are to be investigated, it is usually better to use the Jacobian matrix. For the system 
(3), we have 


F(x,y) —x(1—-x—y) G(x, y) = y(0.75 — y — 0.5x), © 
SO 
ee B 1—2x-y =x 7 
J = J[F, G(x, y) = —0.5y 0.75 — 2y — 0.5x J " 


We will now examine each critical point in turn. 


(x, y) = (0, 0). This critical point corresponds to the state in which neither species is present. To 
determine what happens near the origin we can set x = y = 0 in equation (7), which leads to the 


corresponding linear system 
d (x 1 0 x 
ta = f (8) 
dt Ny 0 0.75) \y 


The eigenvalues and eigenvectors of the system (8) are 


1 0 
ral, €D= o ra =0.75, €V = b (9) 


so the general solution of the system is 


C) =c] (o) etc (3) ert, (10) 


Thus the origin is an unstable node both of the linear system (8) and of the nonlinear system (3). In 
the neighborhood of the origin, because r; = 1 is the dominant eigenvalue, all trajectories are tangent 
to the y-axis except for one trajectory that lies along the x-axis. If either or both of the species are 
present in small numbers, the population(s) will grow. 
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(x, y) = (1, 0). This corresponds to a state in which species x is present but species y is not. By 
evaluating J from equation (7) at (1, 0), we find that the corresponding linear system is 


ao)- Co wu an 


Its eigenvalues and eigenvectors are 


1 4 
r= -l, £&?- ; =1/4, P = . 12 
n £ 0 m=1/4, £ E (12) 
and its general solution is 
u 1 —t 4 t/4 
= e + e 13 


Since the eigenvalues have opposite signs, the point (1, 0) is a saddle point, and so it is an unstable 
equilibrium point of the linear system (11) and of the nonlinear system (3). The behavior of the 
trajectories near ( 1, 0) can be seen from equation (13). If c? = 0, then there is one pair of trajectories 
that approaches the critical point along the x-axis. In other words, if the y population is initially 
zero, then it remains zero forever. All other trajectories depart from the neighborhood of ( 1, 0); if y 
is initially small and positive, then the y population grows with time. As t — —oo, one trajectory 
approaches the saddle point tangent to the eigenvector £? whose slope is —5/4. 


3 
(x,y) = | 0, j This critical point is a state where species y is present but x is not. The analysis is 


similar to that for the point (1, 0). The corresponding linear system is 


d fu 0.25 0 u 
— = ; (14) 
dt Vv —0.375 | —0.75] \v 

The eigenvalues and eigenvectors are 


1 8 3 0 
n=p = (5): n=-7 = (0) (15) 


so the general solution of equation (14) is 


(£) =a ( se +e ee (16) 
v —3 1 


The point (0, 3/4) is also a saddle point. All trajectories leave the neighborhood of this point 
except one pair that approaches along the y-axis. The trajectory that approaches the saddle point as 
t — —oo is tangent to the line with slope —3/8 = —0.375 determined by the eigenvector €"!). If 
the x population is initially zero, it will remain zero, but a small positive x population will grow. 


(yy) = E 3] This critical point corresponds to a mixed equilibrium state, or coexistence, in the 
competition between the two species. The eigenvalues and eigenvectors of the corresponding linear 


system 
d fu —0.5 —0.5 u 
— = (17) 
dt \v —0.25  —0.5 v 


H- z(-2+ v2) = —0.146, €” -= Ga: 


» 
(18) 


N 


r = E - v2) = 0854 £? = o) 


Therefore, the general solution of equation (17) is 


() x po jm Pis p (19) 
v = 


Since both eigenvalues are negative, the critical point (1/2, 1/2) is an asymptotically stable 
node of the linear system (17) and of the nonlinear system (3). All nearby trajectories approach 
the critical point as £ — oo. One pair of trajectories approaches the critical point along 
the line with slope 2 /2 = 0.707 determined from the eigenvector £(?. All other trajectories 
approach the critical point tangent to the line with slope -/2 /2 = —0.707 determined from the 
eigenvector £P. 

A phase portrait for the system (3) is shown in Figure 9.4.2. By looking closely at the trajectories 
near each critical point, you can see that they behave in the manner predicted by the linear system 
near that point. In addition, note that the quadratic terms on the right-hand side of equations (3) 
are all negative. Since for x and y large and positive these terms are the dominant ones, it follows 
that far from the origin in the first quadrant both x’ and y’ are negative; that is, the trajectories are 
directed inward. Thus all trajectories that start at a point (xo, yo) with x9 > O and yo > 0 eventually 
approach the point (0.5, 0.5). In other words, the entire open first quadrant is the basin of attraction 
for (0.5, 0.5). 


0.75 


0.5 


0.25 0.5 0.75 1 1.25 x 


A phase portrait of the system (3). The critical points at (0, 0) is 
an unstable node, the ones at ( 1, 0) and (0, 3/4) are saddle points, and (1/2, 1/2) 
is an asymptotically stable node. 


EXAMPLE 2 


Discuss the qualitative behavior of the solutions of the system 


dx a ) 

— — x(1-x-—y), 

dt à 

2» (20) 
ae y(0.5 — 0.25 y — 0.75x), 


when x and y are nonnegative. Observe that this system is also a special case of the system (2) for 
two competing species. 


Solution: 


Once again, there are four critical points, namely, (0, 0), (1, 0), (0, 2), and (0.5, 0.5), corresponding 
to equilibrium solutions of the system (20). Figure 9.4.3 shows a direction field for the system (20), 
together with the four critical points. From the direction field it appears that the mixed equilibrium 
solution (0.5, 0.5) is a saddle point, and therefore unstable, while the points (1, 0) and (0, 2) are 
asymptotically stable. Thus, for competition described by equations (20), one species will eventually 
overwhelm the other and drive it to extinction. The surviving species is determined by the initial state 
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| FIGURE 9.4.3 | Critical points and direction field for the system (20). 


of the system. To confirm these conclusions, we can look at the linear approximations near each 
critical point. For later use, we record the Jacobian matrix J for the system (20): 


pa (FEY BOW) _ (1-2x-y -x T 
T Gay Gay) X -0.75y — 05-0.5y — 0.75x J” 


(x, y) = (0, 0). Using the Jacobian matrix J from equation (21) evaluated at (0, 0), we obtain the 


linear system 
1 0 
£(*) = u Q2) 
dt Vy 0 057 \y 


which is valid near the origin. The eigenvalues and eigenvectors of the system (22) are 


1 0 
=r 9-1 hme» 2ST (23) 
0 1 
so the general solution is 
1 0 
C) -a( Je rel Jes Q4) 
y 0 1 


Therefore, the origin is an unstable node of the linear system (22) and also of the nonlinear system 
(20). All trajectories leave the neighborhood of the origin tangent to the y-axis except for one 
trajectory that lies along the x-axis. 


(x, y) = (1, 0). The corresponding linear system is 


(s) 7 © E @ (25) 


Its eigenvalues and eigenvectors are 


1 4 
n=l, &»- ; m= 0.25, E%= : (26) 
0 =3 
and its general solution is 
u 1 4 
() E 6a + cy eae (27) 


The point (1,0) is an asymptotically stable node of the linear system (25) and of the nonlinear 
system (20). If the initial values of x and y are sufficiently close to (1,0), then the interaction 
process will lead ultimately to that state—that is, to the survival of species x and the extinction 
of species y. There is one pair of trajectories that approaches the critical point along the x-axis. 
All other trajectories approach (1, 0) tangent to the line with slope —3/4 that is determined by the 
eigenvector £O. 


(x, y) = (0, 2). The analysis in this case is similar to that for the point (1, 0). The appropriate linear 


system is 
d fu _ -1 0 u 28 
dtNv] \-15 -05/]ivJ aio 


The eigenvalues and eigenvectors of this system are 


1 5 0 
n=l, gD = ( J m=-0.5, ¿P = ( ) Q9) 
3 1 
and its general solution is 
1 0 
“Veal, jetel, eo. (30) 
v 3 1 


Thus the critical point (0, 2) is an asymptotically stable node of both the linear system (28) and the 
nonlinear system (20). All nearby trajectories approach the critical point tangent to the y-axis except 
for one trajectory that approaches along the line with slope 3. 


11 
(y) = ( ) The corresponding linear system is 


53 
d fu —0.5 —0.5 u 
Ed = (31) 
dt Vv —0.375  —0.125/ Vv 


The eigenvalues and eigenvectors are 


= a(t v57) = 0.1594, ¿® = (¢(-3- 57) ) E E. 


1 1 
ry = —(-5— y/57 | = —0.7844, eg? = (4(-3 37)) = . 
a A ) £ ppm 0.5687 


so the general solution is 


u 1 0.1594t 1 —0.7844t 
= ; ; ; . 33 
(£) " "uu * 0.5687 J^ pu 


Since the eigenvalues are of opposite signs, the critical point (0.5, 0.5) is a saddle point and therefore 
is unstable, as we had surmised earlier. All trajectories depart from the neighborhood of the critical 
point except for one pair that approaches the saddle point as t — oo. As they approach the critical 
point, the entering trajectories are tangent to the line with slope ( /57 — 3) /8 = 0.5687 determined 
from the eigenvector €‘?). There is also a pair of trajectories that approach the saddle point as t 
approaches —oo. These trajectories are tangent to the line with slope —1.3187 corresponding to €”. 

A phase portrait for the system (20) is shown in Figure 9.4.4. Near each of the critical points, the 
trajectories of the nonlinear system behave as predicted by the corresponding linear approximation. 
Of particular interest is the pair of red trajectories that enter the saddle point. These trajectories 
form a separatrix that divides the first quadrant into two basins of attraction. Trajectories starting 
above the separatrix ultimately approach the node at (0, 2), while trajectories starting below the 


(32) 
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v 
separatrix approach the node at (1, 0). If the initial state lies precisely on the separatrix, then the 
solution (x, y) will approach the saddle point as t — oo. However, the slightest perturbation of the 
point (x, y) as it follows this trajectory will dislodge the point from the separatrix and cause it to 
approach one of the nodes instead. Thus, in practice, one species will survive the competition and 
the other will not. 


x 


0.25 0.5 0.75 1 1.25 x 


A phase portrait of the system (20). The red curve is 
the separatrix. Solutions with initial conditions above the separatrix 
approach the critical point (0, 2). Solutions with initial conditions 
below the separatrix approach the critical point (1, 0). The critical 
point (0, 0) is an unstable node and the one at ( 1/2, 1/2) is a saddle 
point. 


Examples 1 and 2 show that in some cases the competition between two species leads to 
an equilibrium state of coexistence, while in other cases the competition results in the eventual 
extinction of one of the species. To understand more clearly how and why this happens, and to 
learn how to predict which situation will occur, it is useful to look again at the general system 
(2). There are four cases to be considered, depending on the relative orientation of the lines 


€; —04,X —a,y — 0 and e? — 02y — ox = 0, (34) 


as shown in Figure 9.4.5. These lines are called x- and y-nullclines? of system (2), 
respectively, because x’ is zero on the first and y’ is zero on the second. In each of the four 
cases in Figure 9.4.5, the x-nullcline is the solid blue line and the y-nullcline is the dashed 
green line. 

Let ( X, Y) denote any critical point in any one of the four cases. As in Examples 1 and 
2, the system (2) is locally linear in the neighborhood of this point because the right-hand 
side of each differential equation is a quadratic polynomial. To study the system (2) in the 
neighborhood of this critical point, we can look at the corresponding linear system obtained 


5The vertical line x = 0 is also an x-nullcline of system (2). The horizontal line y = 0 is also a y-nullcline of system 


(2). 


from equation (13) of Section 9.3: 


d (uj  f[e—2201X -aY —a,X\ /u (35) 
dt \v i —a35Y €? —205Y — aX vj. 


» » 
x €5l0to €i/0 x 
(b) 
~ 
+ > Ss = 
eola eo, X €i/01 Elo X 


(c) (d) 


The four cases for the competing-species system (2). 
The x-nullcline is the solid blue line, and the y-nullcline is the dashed 
green line. 


We now use equation (35) to determine the conditions under which the model described 
by equations (2) permits the coexistence of the two species x and y. Of the four possible cases 
shown in Figure 9.4.5, coexistence is possible only in cases (c) and (d). In these cases, the 
nonzero values of X and Y are obtained by solving the algebraic equations (34); the general 
result is 


€102 — €201 €20] — €102 
X -m——————, y-—————. (36) 
010» = Q103 0102 = Q103 


Further, since e; — c4 X — a ,Y = 0 and e? — o5Y — a2X = 0, equation (35) immediately 


reduces to 
d fu —0;X —aıX\/u 
— = : (37) 
dt \v —arY —g5Y v 


The eigenvalues of the system (37) are found from the equation 
r? + (01X +02Y)r +(0102 — aja) XY =0. (38) 
Thus 


—(oıX +02Y) «€ "TOES + o2Y)* — 4(o405 — A102) XY 
rit = : (39) 
2 
If 0405» — ~,;a. < O, then the radicand of equation (39) is positive and greater than 
(o,X + a;Y)?. Thus the eigenvalues are real and opposite in sign. Consequently, the critical 
point (X, Y) is an (unstable) saddle point, and coexistence is not possible. This is the case in 
Example 2, where c; = 1, a; = 1,05 = 0.25, a2 = 0.75, and o 102 — a 1a» = —0.5. 
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On the other hand, if c 10» — a@;@2 > O, then the radicand of equation (39) is less than 
(c1X + o5Y)?. In these cases the eigenvalues could be real, negative, and unequal, or they 
could be complex-valued with negative real part. A straightforward analysis of the radicand of 
equation (39) shows that the eigenvalues cannot be complex (see Problem 5). Thus the critical 
point is an asymptotically stable node, and sustained coexistence is possible. This is illustrated 


by Example 1, where o; = 1, œ; = 1,0. = 1, œ? = 5 and o102 — Q 1Q2 = 7 
Let us relate this result to Figures 9.4.5c and 9.4.5d. In Figure 9.4.5c we have 


€1 €2 €2 €1 

— > — or €;a2 > €590, and — > — or €Q] > €102. (40) 

01 [07] 02 Q1 
These inequalities, coupled with the condition that X and Y given by equations (36) be positive, 
yield the inequality c ,05» < oo». Hence in this case the critical point is a saddle point. On 
the other hand, in Figure 9.4.5d we have 

€1 €2 €2 €1 

— < — or €Q? < €90,; and — < — or €5a < €102. (41) 

91 a2 92 Q1 
Now the condition that X and Y be positive yields c;05 > oo». Hence the critical 
point is asymptotically stable. For this case we can also show that the other 
critical points (0, 0), (1/0, 0), and (0, e2/65) are unstable. Thus for any positive initial 
values of x and y, the two populations approach the equilibrium state of coexistence given by 
equations (36). 

Equations (2) provide the biological interpretation of the result that whether coexistence 
occurs depends on whether o 102 — o 10? is positive or negative. The o’s are a measure of the 
inhibitory effect that the growth of each population has on itself, whereas the a’s are a measure 
of the inhibiting effect that the growth of each population has on the other species. Thus, 
when 204105 > 002, interaction (competition) is “weak” and the species can coexist; when 
0102 < QQ, interaction (competition) is "strong" and the species cannot coexist— one 


must die out. 


Problems 


Each of Problems 1 through 4 can be interpreted as describing the 
interaction of two species with populations x and y. In each of these 
problems, carry out the following steps. 
Q a. Draw a direction field and describe how solutions behave. 
b. Find the critical points. 
C. For each critical point find the corresponding linear system. 
Find the eigenvalues and eigenvectors of the linear system; 
classify each critical point as to type, and determine whether it is 
asymptotically stable, stable, or unstable. 
d. Sketch trajectories in the neighborhood of each critical point. 
Q e. Compute and plot enough trajectories of the given system 
to show clearly the behavior of the solutions. 
f. Determine the limiting behavior of x and y as t — oo, and 
interpret the results in terms of the populations of the two species. 
l. dx/dt —x(1.5— x —0.5y) 
dy/dt = y(2 — y —0.75x) 


2. dx/dt — x(1.5 — x — 0.5y) 
dy/dt = y(2 — 0.5y — 1.5x) 

3. dx/dt — x(1—x — y) 
dy/dt = y(1.5— y — x) 


4. dx/dt — x(1— x 4-0.5y) 
dy /dt = y(2.5 — 1.5y + 0.25x) 


5. Consider the eigenvalues given by equation (39). Show that 


(c1X-o5Y)? —4(o,05—a,a3) XY = (0 |X —07Y)*+4a a XY. 


Hence conclude that the eigenvalues can never be complex-valued. 


6. Two species of fish that compete with each other for food, but 
do not prey on each other, are bluegill and redear. Suppose that a pond 
is stocked with bluegill and redear, and let x and y be the populations 
of bluegill and redear, respectively, at time t. Suppose further that the 
competition is modeled by the equations 


= XE O 1X Q A — € oO Ax). 
lt 1 1 1. lt MS) 2y 2 


a. If c/o» > €1/01 and €2/02 > €,/a1, show that the only 
equilibrium populations in the pond are no fish, no redear, or no 
bluegill. What will happen for large t? 
b. If c;/o, > €2/a2 and €,/a, > €2/02, show that the only 
equilibrium populations in the pond are no fish, no redear, or no 
bluegill. What will happen for large t? 


7. Consider the competition between bluegill and redear mentioned 
in Problem 6. Suppose that €2/a@2 > €,/a, and e€,/a, > €2/02, 
so, as shown in the text, there is a stable equilibrium point at which 
both species can coexist. It is convenient to rewrite the equations of 
Problem 6 in terms of the carrying capacity of the pond for bluegill 
(B = e,/0;) in the absence of redear and its carrying capacity for 
redear ( R = €2/ 02) in the absence of bluegill. 

a. Show that the equations of Problem 6 take the form 


dx _ i 1 yı dy _ 1 1 Y2 

d BU BY a v" rE GU 
where y; — 01/0, and y2 = 05/05. Determine the coexistence 
equilibrium point ( X, Y) in terms of B, R, y1, and y2. 


b. Now suppose that an angler fishes only for bluegill with 
the effect that B is reduced. What effect does this have on the 
equilibrium populations? Is it possible, by fishing, to reduce the 
population of bluegill to such a level that they will die out? 

8. Consider the system (2), and assume that o 103 — & 4105 = 0. 
a. Find all the critical points of the system. Observe that the 
result depends on whether o ;€3 — a €, is zero. 

b. Ifc ,€5—a2»e, > 0, classify each critical point and determine 

whether it is asymptotically stable, stable, or unstable. Note that 

Problem 3 is of this type. Then do the same if c4€5? — 5e; < 0. 

c. Analyze the nature of the trajectories when o ;€2—a2€, = 0. 

9. Consider the system (3) in Example 1 of the text. Recall that 

this system has an asymptotically stable critical point at (0.5, 0.5), 

corresponding to the stable coexistence of the two population species. 

Now suppose that immigration or emigration occurs at the constant 

rates of ôa and 6b for the species x and y, respectively. In this case 
equations (3) are replaced by 

dx dy 


) y 
-—-—— uv) —J(3-4y-2x)- 6b. (42 
a x( Xx — y) a qd Pi y — 2x) (42) 


The question is what effect this has on the location of the stable 
equilibrium point. 
a. To find the new critical point, we must solve the equations 
x(1— x -— y) + ĉa = 0, 


7(3 -4y — 2x) + 6b — 0. (43) 


One way to proceed is to assume that x and y are given by power 
series in the parameter 6; thus 


x-—xgd4xjió 4, y= yo +y +. (44) 


Substitute equations (44) into equations (43) and collect terms 
according to powers of 6. 

b. From the constant terms (the terms not involving 6), show 
that x9 = 0.5 and yo = 0.5, thus confirming that in the absence 
of immigration or emigration, the critical point is (0.5, 0.5). 

C. From the terms that are linear in ó, show that 


x, =4a — 4b, yı =—2a+ 4b. (45) 


d. Suppose that a > 0 and b > O so that immigration occurs 
for both species. Show that the resulting equilibrium solution 
may represent an increase in both populations, or an increase in 
one but a decrease in the other. Explain intuitively why this is a 
reasonable result. 


10. The system 
x -—y, y'= —yy — x(x —0.15)(x — 2) 


results from an approximation to the Hodgkin—Huxley® equations, 
which model the transmission of neural impulses along an axon. 
a. Find the critical points, and classify them by investigating the 
approximate linear system near each one. 
Q b. Draw phase portraits for y = 0.8 and for y = 1.5. 
O c. Consider the trajectory that leaves the critical point 
(2, 0). Find the value of y for which this trajectory ultimately 
approaches the origin as t — oo. Draw a phase portrait for this 
value of y. 


Sir Alan L. Hodgkin (1914-1998) and Sir Andrew F. Huxley (1917-  ), 
British physiologists and biophysicists, studied the excitation and transmission 
of neural impulses at Cambridge University and the Marine Biological 
Association Laboratory in Plymouth. This work was both theoretical (resulting 
in a system of nonlinear differential equations) and experimental (involving 
measurements on the giant axon of the Atlantic squid). They were awarded the 
Nobel Prize in Physiology or Medicine in 1963. 
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Bifurcation Points. Consider the system 


x'=F(x,y,a), y =G(x,y,a), (46) 


where o is a parameter. The equations 
F(x,y,a) 20, G(x,y,a) =0 (47) 


determine the x- and y-nullclines, respectively; any point where an 
x-nullcline anda y-nullcline intersect is a critical point. As o varies and 
the configuration of the nullclines changes, it may well happen that, at 
a certain value of a, two critical points coalesce into one. For further 
variation in o, the critical point may once again separate into two 
critical points, or it may disappear altogether. Or the process may occur 
in reverse: For a certain value of a, two formerly nonintersecting 
nullclines may come together, creating a critical point, which, for 
further changes in a, may split into two. A value of a at which such 
phenomena occur is a bifurcation point. It is also common for a critical 
point to experience a change in its type and stability properties at a 
bifurcation point. Thus both the number and the kind of critical points 
may change abruptly as o passes through a bifurcation point. Since a 
phase portrait of a system is very dependent on the location and nature 
of the critical points, an understanding of bifurcations is essential to 
an understanding of the global behavior of the system's solutions. 


In each of Problems 11 through 14: 
a. Sketch the nullclines and describe how the critical points 
move as a increases. 
b. Find the critical points. 
O c. Leto = 2. Classify each critical point by investigating the 
corresponding approximate linear system. Draw a phase portrait 
in a rectangle containing the critical points. 
Q d. Find the bifurcation point œg at which the critical points 
coincide. Locate this critical point, and find the eigenvalues of 
the approximate linear system. Draw a phase portrait. 
(G] e. Fora > ag, there are no critical points. Choose such a 
value of o and draw a phase portrait. 


11. x’ =—4x + y x? y -$ja-y 


3 
12. ga cy y = —4x + y +x? 


13. x’ =—-4x +y+x%, y'2—-a-x-y 
14. x/=-a-xt+y, y=—4x+y+x? 
Problems 15 through 17 deal with competitive systems much like 
those in Examples 1 and 2, except that some coefficients depend on a 
parameter a. In each of these problems, assume that x, y, and a are 
always nonnegative. In each of Problems 15 through 17: 
a. Sketch, by hand, the nullclines in the first quadrant, as in 
Figure 9.4.5. For different ranges of a, your sketch may resemble 
different parts of Figure 9.4.5. 
b. Find the critical points. 
c. Determine the bifurcation points. 
d. Find the Jacobian matrix J, and evaluate it for each of the 
critical points. 
e. Determine the type and stability property of each critical 
point. Pay particular attention to what happens as a passes 
through a bifurcation point. 
Q f. Draw phase portraits for the system for selected values of 
à to confirm your conclusions. 


15. dx/dt 2x(1—x— y), dy/dt = y(a — y —0.5x) 
16. dx/dt —x(1—x-—y) dy/dt = y(0.75 — ay —0.5x) 
17. dx/dt 2x(1—x— y), dy/dt = y(a — y — (2a — D)x) 


428 


CHAPTER 9 Nonlinear Differential Equations and Stability 


95 Predator- Prey Equations 


In the preceding section we discussed a model of two species that interact by competing for 
a common food supply or other natural resource. In this section we investigate the situation 
in which one species (the predator) preys on the other species (the prey), while the prey lives 
on a different source of food. For example, consider foxes and rabbits in a closed forest. The 
foxes prey on the rabbits, the rabbits live on the vegetation in the forest. Other examples are 
bass in a lake as predators and redear as prey, and ladybugs as predators and aphids as prey. 
We emphasize again that a model involving only two species cannot fully describe the complex 
relationships among species that actually occur in nature. Nevertheless, the study of simple 
models is the first step toward an understanding of more complicated phenomena. 

We will denote by x and y the populations of the prey and predator, respectively, at 
time f. In constructing a model of the interaction of the two species, we make the following 
assumptions: 


1. In the absence of the predator, the prey grows at a rate proportional to the current 
population; thus dx /dt = ax,a > 0, when y = 0. 

2. In the absence of the prey, the predator dies out; thus dy /dt = —cy, c > 0, when x = 0. 

3. The number of encounters between predator and prey is proportional to the product of their 
populations. Each such encounter tends to promote the growth of the predator and to 
inhibit the growth of the prey. Thus the growth rate of the predator is increased by a term 
of the form yxy, while the growth rate of the prey is decreased by a term —a xy, where 
y and a are positive constants. 


As a consequence of these assumptions, we are led to the equations 


dx ( ) 

— — ax -—axy= — 

di y 2x(a — ay), 

d = 
y 

1 —cy + yxy = y(—c + yx). 


The constants a, c, a, and y are all positive; a and c are the growth rate of the prey and the 
death rate of the predator, respectively, and o and y are measures of the effect of the interaction 
between the two species. Equations (1) are known as the Lotka — Volterra equations. They 
were developed in papers by Lotka” in 1925 and by Volterra? in 1926. Although these are 
rather simple equations, they do characterize a wide class of problems. Ways of making them 
more realistic are discussed at the end of this section and in the problems. Our goal here is to 
determine the qualitative behavior of the solutions (trajectories) of the system (1) for arbitrary 
positive initial values of x and y. We first do this for a specific example and then return to the 
general equations (1) at the end of the section. 


7 Alfred J. Lotka (1880-1949), an American biophysicist, was born in what is now Ukraine and was educated mainly 
in Europe. He is remembered chiefly for his formulation of the Lotka- Volterra equations. He was also the author, in 
1924, of the first book on mathematical biology; it is now available as Elements of Mathematical Biology (New York: 
Dover, 1956). 


8Vito Volterra (1860—1940), a distinguished Italian mathematician, held professorships at Pisa, Turin, and Rome. He 
is particularly famous for his work in integral equations and functional analysis. Indeed, one of the major classes of 
integral equations is named for him; see Problem 16 of Section 6.6. His theory of interacting species was motivated 
by data collected by his son-in-law, Humberto d'Ancona, concerning fish catches in the Adriatic Sea. A translation 
of his 1926 paper can be found in an appendix to R. N. Chapman, Animal Ecology with Special Reference to Insects 
(New York: McGraw-Hill, 1931). 


EXAMPLE 1 


Discuss the solutions of the system 


— = x(1 — 0.5y) = x — 0.5xy = F(x, y), 


(2) 
d 
oY = y(—0.75 + 0.25x) = —0.75y + 0.25xy = G(x, y) 


for x and y positive. 


Solution: 


The critical points of this system are the solutions of the algebraic equations 
x(1—0.5y) 20, y(—0.75 + 0.25x) = 0, (3) 


namely, the points (0, 0) and (3, 2). Figure 9.5.1 shows the critical points and a direction field for the 
system (2). From this figure, it appears that trajectories in the first quadrant encircle the critical point 
(3, 2). Whether the trajectories are actually closed curves, or whether they slowly spiral in or out, 
cannot be definitely determined from the direction field. The origin appears to be a saddle point. Just 
as for the competition equations in Section 9.4, the coordinate axes are trajectories of equations (1) 
or (2). Consequently, no other trajectory can cross a coordinate axis, which means that every solution 
starting in the first quadrant remains there for all time. 
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| FIGURE 9.5.1 | Critical points and direction field for the predator-prey system (2). 


Next we examine the local behavior of solutions near each critical point. 


(x, y) = (0, 0). Near the origin we can neglect the nonlinear terms in equations (2) to obtain the 


corresponding linear system 
d (x 1 0 x 
2 = (4) 
dt \y 0  —0.75/ Vy 


The eigenvalues and eigenvectors of equation (4) are 


1 0 
n=l, = ( ) rp = —0.75, £O = ( i (5) 
0 1 
so its general solution is 
* Il. 0 o5; 
= fon 6 
() i (o): tal) P 
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Thus the origin is a saddle point both of the linear system (4) and of the nonlinear system (2) and 
therefore is unstable. One trajectory enters the origin along the y-axis; all other trajectories depart 
from the neighborhood of the origin. 


(x, y) = (3, 2). To examine the critical point (3, 2), we can use the Jacobian matrix 


Fy (x, y) Fy(x, y) 1 —0.5y —0.5x 
x uL [S Gy(x, ») ( 0.25y oc 0) 


Evaluating J at the point (3, 2), we obtain the linear system 


d fuj (0 —].5 u 8 
dt\v) |. \0.5 0 vy’ (5) 


where u = x — 3 and v = y — 2. The eigenvalues and eigenvectors of this system are 


3i 3i 1 
ry = v3 E ¿eD = eI pee $ EP = (uus) (9) 


Since the eigenvalues are imaginary, the critical point (3, 2) is a center of the linear system (8) and 
is therefore a stable critical point for that system. Recall from Section 9.3 that this is one of the cases 
in which the behavior of the linear system may or may not carry over to the nonlinear system, so the 
nature of the point (3, 2) for the nonlinear system (2) cannot be determined from this information. 

The simplest way to find the trajectories of the linear system (8) is to divide the second of 
equations (8) by the first so as to obtain the differential equation 


dv dv/dt  0.5u u 


du | du/dt | —l.5v 3v' 


or 
udu + 3v dv =0. (10) 

Consequently, 
u? -- 3v? =k, (11) 


where k is an arbitrary nonnegative constant of integration. Thus the trajectories of the linear system 
(8) are ellipses centered at the critical point and elongated somewhat in the horizontal direction. 

Now let us return to the nonlinear system (2). Dividing the second of equations (2) by the first, 
we obtain 


dy — y(—0.75 + 0.25x) 


(12) 
dx x(1—0.5y) 
Equation (12) is a separable equation and can be put in the form 
1 —0.5 —0.75 + 0.25 
? dye ENS diis 
y x 
from which it follows that 
0.75In x + In y — 0.5y — 0.25x =c, (13) 


where c is a constant of integration. Although by using only elementary functions we cannot solve 
equation (13) explicitly for either variable in terms of the other, it is possible to show that the graph 
of the equation for a fixed value of c is a closed curve surrounding the critical point (3, 2). Thus 
the critical point is also a center of the nonlinear system (2), and the predator and prey populations 
exhibit a cyclic variation. 

Figure 9.5.2 shows a phase portrait of the system (2). For some initial conditions, the trajectory 
represents small variations in x and y about the critical point and is almost elliptical in shape, as the 
linear analysis suggests. For other initial conditions, the oscillations in x and y are more pronounced, 
and the shape of the trajectory is significantly different from an ellipse. Observe that the trajectories 
are traversed in the counterclockwise direction. The dependence of x and y on t for the red trajectory 
in Figure 9.5.2 is shown in Figure 9.5.3. Note that x and y are periodic functions of t, as they must 
be since the trajectories are closed curves. Further, the oscillation of the predator population lags 
behind that of the prey. Starting from a state in which both predator and prey populations are relatively 
small, the prey first increase because there is little predation. Then the predators, with abundant food, 
increase in population also. This causes heavier predation, and the prey tend to decrease. Finally, 
with a diminished food supply, the predator population also decreases, and the system returns to its 
original state. From here the trajectory begins to repeat itself. 
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2 3 4 5 6 7 8 x 
A phase portrait of the system (2). The critical point at (3,2) is a 


center and the one at (0, 0) is a saddle point. The component plots for the red curve, 
with initial condition x(0) = 2, y(0) = 1 are shown in Figure 9.5.3. 


> 
oh 
* 


Variations of the prey (dotted red curve) and predator 


(dashed red curve) populations with time for the system (2) with initial 


condition x(0) = 2, y(0) = 1. 


The general system (1) can be analyzed in exactly the same way as in the example. The 


critical points of the system (1) are the solutions of 
x(a—ay)=0, y(—-cc yx) =0, 


that is, the points (0, 0) and (c/y, a/a). We first examine the solutions of the corresponding 


linear system near each critical point. 
In the neighborhood of the origin, the corresponding linear system is 

i)=(0 90) 

har = . (14) 
dt Ny 0 =c) Ny 

The eigenvalues and eigenvectors are 
1 0 
mag £P - (o): n=—c, P = (0) (15) 
(16) 


so the general solution is 
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Thus the origin is a saddle point and hence unstable. Entrance to the saddle point is along the 
(positive) y-axis; all other trajectories depart from the neighborhood of the critical point. 
Next consider the critical point (c/^y, a/a). The Jacobian matrix is 


Evaluating J at (c/ y, a/a), we obtain the approximate linear system 


ay) = (vale i a! (17) 


where u=x—c/y and v—y-a/a. The eigenvalues of the system (17) are 
r = +i yac, so the critical point is a (stable) center of the linear system. To find the trajectories 
of the system (17), we can divide the second equation by the first to obtain 


dv dv/dt ^ (ya/o)u 


du  du/ídt | (ac/y)v’ n 
Or 
yau du + a?cv dv = (0! (19) 
Consequently, 
yau? + a?cy? =k, (20) 


where k is a nonnegative constant of integration. Thus the trajectories of the linear system (17) 
are ellipses, just as in the example. 
Returning briefly to the nonlinear system (1), observe that it can be reduced to the single 
equation 
dy dy/dt _ y(—c+ yx) 


= = . (21) 
dx dxjdt x(a — ay) 
Equation (21) is separable and has the solution 
alny—ay+clInx —yx =C, (22) 


where C is a constant of integration. Again, it is possible to show that for fixed C, the graph 
of equation (22) is a closed curve surrounding the critical point (c/^, a/a). Thus this critical 
point is also a center for the general nonlinear system (1). 

The cyclic variation of the predator and prey populations can be analyzed in more detail 
when the deviations from the point (c/^y, a/ o) are small and the linear system (17) can be 
used. The solution of the system (17) can be written in the form 


u = ÊK cos(Vact + 4), v = Ek sin(a 9). (23) 
y aVa 


where the constants K and $ are determined by the initial conditions. Thus 


x= E 4 É K cos(Jact + à), 
d d 


(24) 
a a [c 
y=— + 2 fex sin(/act + $). 
a aya 
These equations are good approximations for the nearly elliptical trajectories close to 
the critical point (c/y, a/a). We can use them to draw several conclusions about the cyclic 
variation of the predator and prey on such trajectories. 


1. The sizes of the predator and prey populations vary sinusoidally with period 27/,/ac. 
This period of oscillation is independent of the initial conditions. 

2. The predator and prey populations are out of phase by one-quarter of a cycle. The prey 
leads and the predator lags, as explained in the example. 

3. The amplitudes of the oscillations are Kc/ for the prey and ./acK /a for the predator, 
and hence depend on the initial conditions as well as on the parameters of the problem. 

4. The average populations of predator and prey over one complete cycle are c/^ anda/a, 
respectively. These are the same as the equilibrium populations; see Problem 10. 


Cyclic variations of predator and prey as predicted by equations (1) have been observed in 
nature. One striking example is described by Odum (pp. 191—192); based on the records of the 
Hudson’s Bay Company of Canada, the abundance of lynx and snowshoe hare, as indicated by 
the number of pelts turned in over the period 1845-1935, shows a distinct periodic variation 
with a period of 9 to 10 years. The peaks of abundance are followed by very rapid declines, and 
the peaks of abundance of the lynx and hare are out of phase, with that of the hare preceding 
that of the lynx by a year or more. 

Since the critical point (c/y,a/a) is a center, we expect that small perturbations of 
the Lotka- Volterra equations may well lead to solutions that are not periodic. To put it 
another way, unless the Lotka- Volterra equations exactly describe a given predator-prey 
relationship, the actual fluctuations of the populations may differ substantially from those 
predicted by the Lotka- Volterra equations, due to small inaccuracies in the model equations. 
This has led to many attempts? to replace the Lotka- Volterra equations by other systems 
that are less susceptible to the effects of small perturbations. Problem 13 introduces one such 
alternative model. 

Another criticism of the Lotka- Volterra equations is that in the absence of the predator, 
the prey will grow without bound. This can be corrected by allowing for the natural inhibiting 
effect that an increasing population has on the growth rate of that population. For example, 
the first of equations (1) can be modified so that when y = 0, it reduces to a logistic equation 
for x. The effects of this modification are explored in Problems 11 and 12. Problems 14 
through 16 deal with harvesting in a predator- prey relationship. The results may seem rather 
counterintuitive. 

Finally, we repeat a warning stated earlier: relationships among species in the natural 
world are often complex and subtle. You should not expect too much of a simple system of 
two differential equations in describing such relationships. Even if you are convinced that 
the general form of the equations is sound, the determination of numerical values for the 
coefficients may present serious difficulties. 


?See the book by Brauer and Castillo-Chávez listed in the References for an extensive discussion of alternative models 
for predator-prey relationships. 


Problems 


Each of Problems 1 through 5 can be interpreted as describing the 3. dx/dt — x(1— 0.5x — 0.5y) 
interaction of two species with population densities x and y. In each dy/dt = y(—0.25 + 0.5x) 


dx /dt = x(—1 + 2.5x — 0.3y — x?) 


of these problems, carry out the following steps. 4 


E a direction field and describe how solutions seem to dy jdt Jj das) 


b. Find the critical points. 5. dx/dt = x(—0.5 + y) 


9.5 Predator - Prey Equations 


C. For each critical point, find the corresponding linear system. dy [dt = y(—0.25 + y —0.5x — y?) 
Find the eigenvalues and eigenvectors of the linear system; 6. In this problem we examine the phase difference between the 
classify each critical point as to type, and determine whether itis ^ cyclic variations of the predator and prey populations as given by 
asymptotically stable, stable, or unstable. equations (24) of this section. Suppose we assume that K > O and that 
d. Sketch the trajectories in the neighborhood of each critical ^ ; is measured from the time that the prey population x is a maximum; 


point. then ó = 0. 
Q e. Draw a phase portrait for the system. 
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a. Show that the predator population y reaches a maximum at 


f. Determine the limiting behavior of x and y as t — oc. t = T /(24/ac) = T/A, where T is the period of the oscillation. 


g. Interpret the results in terms of the populations of the two 
species. 


b. When is the prey population increasing most rapidly? 
decreasing most rapidly? a minimum? 


l. dx/dt = x(1.5 — 0.5y) C. Answer the questions in part b for the predator population. 
dy/dt = y(—0.5 + x) d. Draw a typical elliptic trajectory enclosing the point 
2. dx/dt — x(1 — 0.5y) (c/y,a/a), and mark on it the points found in parts a, b, 


dy/dt = y(—0.25 + 0.5x) and c. 
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7. a. Find the ratio of the amplitudes of the oscillations of 
the prey and predator populations about the critical point 
(c/y,a/a), using the approximation (24), which is valid for 
small oscillations. Observe that the ratio is independent of the 
initial conditions. 

b. Evaluate the ratio found in part a for the system (2). 

c. Estimate the amplitude ratio for the solution of the nonlinear 
system (2) shown in Figure 9.5.3. Does the result agree with that 
obtained from the linear approximation? 

O d. Determine the prey- predator amplitude ratio for other 
solutions of the system (2)—that is, for solutions satisfying 
other initial conditions. Is the ratio independent of the initial 
conditions? 


8. a. Find the period of the oscillations of the prey and predator 
populations, using the approximation (24), which is valid for 
small oscillations. Note that the period is independent of the 
amplitude of the oscillations. 

b. For the solution of the nonlinear system (2) shown in Figure 
9.5.3, estimate the period as well as possible. Is the result the 
same as for the linear approximation? 

Q c. Calculate other solutions of the system (2)—that is, 
solutions satisfying other initial conditions — and determine their 
periods. Is the period the same for all initial conditions? 

9. Consider the system 


dx y dy x 
-Á — 1 E =b 1 E 
di «( z) dt »( +3) 


where a and b are positive constants. Observe that this system is the 
same as in the example in the text if a = 1 and b = 0.75. Suppose the 
initial conditions are x(0) = 5 and y(0) = 2. 

Q a. Let a = 1 and b = 1. Plot the trajectory in the phase plane, 

and determine (or estimate) the period of the oscillation. 

Q b. Repeat part a fora = 3 and a = 1/3, with b = 1. 

Q c. Repeat part a for b = 3 and b = 1/3, witha = 1. 

d. Describe how the period and the shape of the trajectory 

depend on a and b. 


10. The average sizes of the prey and predator populations are 


defined as 
| [^T 1 patt 
x-- x(t)dt, y= ;/ y(t) dt, 
ah T Ja 


respectively, where T is the period of a full cycle, and A is any 
nonnegative constant. 
a. Using the approximation (24), which is valid near the critical 
point, show that x = c/y and y =a/a. 
O b. For the solution of the nonlinear system (2) shown in 
Figure 9.5.3, estimate x and y as well as you can. Try to 
determine whether x and y are given by c/y and a/a, 
respectively, in this case. Hint: Consider how you might estimate 
the value of an integral even though you do not have a formula 
for the integrand. 
Q c. Calculate other solutions of the system (2)—that is, 
solutions satisfying other initial conditions — and determine x 
and y for these solutions. Are the values of x and y the same for 
all solutions? 


In Problems 11 and 12, we consider the effect of modifying the 
equation for the prey x by including a term —c x? so that this equation 
reduces to a logistic equation in the absence of the predator y. Problem 
11 deals with a specific system of this kind, and Problem 12 takes up 
this modification to the general Lotka-Volterra system. The system in 
Problem 3 is another example of this type. 


1l. Consider the system 
y! = y(—0.75 4- 0.25x), 


where a > 0. Observe that this system is a modification of the system 
(2) in Example 1. 
a. Find all of the critical points. How does their location change 
as o increases from zero? Observe that there is a critical point in 
the interior of the first quadrant only if o. < 1/3. 
b. Determine the type and stability property of each critical 
point. Find the value c, < 1/3 where the nature of the critical 
point in the interior of the first quadrant changes. Describe the 
change that takes place in this critical point as c passes through 
O1. 
O c. Draw a direction field and phase portrait for a value of o 
between zero and o ;; for a value of ø between c ; and 1/3. 
d. Describe the effect on the two populations as ø increases 
from zero to 1/3. 


x' = x(1—ox — 0.5y), 


12. Consider the system 


dx/dt = x(a — ox — ay), dy/dt = y(—c + yx), 


where a, 0, a, c, and ¥ are positive constants. 
a. Find all critical points of the given system. How does 
their location change as o increases from zero? Assume that 
a/o>c/y, that is, o<ay/c. Why is this assumption 
necessary? 
b. Determine the nature and stability characteristics of each 
critical point. 
€. Show that there is a value of ø between zero and ay / c where 
the critical point in the interior of the first quadrant changes from 
a spiral point to a node. 
d. Describe the effect on the two populations as ø increases 
from zero to ay/c. 


13. In the Lotka- Volterra equations, the interaction between the 
two species is modeled by terms proportional to the product xy of the 
respective populations. If the prey population is much larger than the 
predator population, this may overstate the interaction; for example, 
a predator may hunt only when it is hungry and ignore the prey at other 
times. In this problem we consider an alternative model proposed by 
Rosenzweig and MacArthur.!? 
a. Consider the system 


j AES 2y j là x 
x =x{1l-s=- R =y{-- . 
5 xiuj* " ^V 4 x6 


Find all of the critical points of this system. 

b. Determine the type and stability characteristics of each 
critical point. 

Q c. Draw a direction field and phase portrait for this system. 


Harvesting in a Predator - Prey Relationship. In a predator-prey 
situation, it may happen that one or perhaps both species are valuable 
sources of food. Or, the prey species may be regarded as a pest, 
leading to efforts to reduce its numbers. In a constant-effort model of 
harvesting, we introduce a term — E,x in the prey equation and a term 
— E5y in the predator equation, where E; and E, are measures of the 
effort invested in harvesting the respective species. A constant-yield 
model of harvesting is obtained by including the term — H; in the prey 
equation and the term — H; in the predator equation. The constants E, 
E», H4, and H; are always nonnegative. Problems 14 and 15 deal with 
constant-effort harvesting, and Problem 16 deals with constant-yield 
harvesting. 


10See the book by Brauer and Castillo-Chávez for further details. 


14. Applying a constant-effort model of harvesting to the Lotka- 
Volterra equations (1), we obtain the system 


x =x(a— ay- E), y —y(-c-cyx — Ej). 


When there is no harvesting, the equilibrium solution is (c/y, a/a). 
a. Before doing any mathematical analysis, think about the 
situation intuitively. How do you think the populations will 
change if the prey alone is harvested? if the predator alone is 
harvested? if both are harvested? 

b. How does the equilibrium solution change if the prey is 
harvested, but not the predator ( E, > 0, E; = 0)? 

c. How does the equilibrium solution change if the predator is 
harvested, but not the prey ( Ej = 0, E» > 0)? 

d. How does the equilibrium solution change if both are 
harvested ( E, > 0, E; > 0)? 


15. If we modify the Lotka- Volterra equations by including a self- 
limiting term —o x? in the prey equation, and then assume constant- 
effort harvesting, we obtain the equations 


x'—x(a—ox—ay-—Ej, y'= y(—c + yx — Ej). 


In the absence of harvesting, the equilibrium solution of interest is 
x=c/y,y = (a/a) —(ac)/(a4y). 
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a. How does the equilibrium solution change if the prey is 
harvested (E, > 0), but not the predator ( £5 = 0)? 

b. How does the equilibrium solution change if the predator is 
harvested ( E2 > 0), but not the prey ( E, = 0)? 

c. How does the equilibrium solution change if both predator 
and prey are harvested ( Ej > 0, E» > 0)? 


16. In this problem we apply a constant-yield model of harvesting 
to the situation in Example 1. Consider the system 


x’ =x(1—0.5y) — Hı, y = y(—0.75 + 0.25x) — Hh, 


where H; and H, are nonnegative constants. Recall that if 
H, = H, = 0, then (3, 2) is an equilibrium solution for this system. 
a. Before doing any mathematical analysis, think about the 
situation intuitively. How do you think the populations will 
change if the prey alone is harvested? if the predator alone is 
harvested? if both are harvested? 
b. How does the equilibrium solution change if the prey is 
harvested (H, > 0), but not the predator (H = 0)? 
c. How does the equilibrium solution change if the predator is 
harvested ( H5 > 0), but not the prey (H; = 0)? 
d. How does the equilibrium solution change if both predator 
and prey are harvested ( H1 > 0, Hy > 0)? 


9.6 Liapunov's Second Method 


In Section 9.3 we showed how the stability of a critical point of a locally linear system 
can usually be determined from a study of the corresponding linear system. However, no 
conclusion can be drawn when the critical point is a center of the corresponding linear system. 
Examples of this situation are the undamped pendulum, equations (1) and (2) below, and the 
predator-prey problem discussed in Section 9.5. Also, for an asymptotically stable critical 
point, it may be important to investigate the basin of attraction— that is, the domain such that 
all solutions starting within that domain approach the critical point. The theory of locally linear 
systems does not help us to answer this question. 

In this section we discuss another approach, known as Liapunov's!! second method 
or direct method. The method is referred to as a direct method because no knowledge 
of the solution of the system of differential equations is required. Rather, conclusions 
about the stability or instability of a critical point are obtained by constructing a suitable 
auxiliary function. The technique is a very powerful one that provides a more global type 
of information — for example, an estimate of the extent of the basin of attraction of a critical 
point. Liapunov's second method can also be used to study systems of differential equations 
that are not locally linear; however, we will not discuss such problems. 


ThePendulum Equations. Basically, Liapunov's second method is a generalization of two 
physical principles for conservative systems, namely, (1) a rest position is stable if the potential 
energy is a local minimum, otherwise it is unstable, and (ii) the total energy is a constant 
during any motion. To illustrate these concepts, again consider the undamped pendulum 
(a conservative mechanical system), which is governed by the equation 


1 Alexander M. Liapunov (1857-1918), a student of Chebyshev at St. Petersburg, taught at the University of Kharkov 
from 1885 to 1901, when he became an academician in applied mathematics at the St. Petersburg Academy of 
Sciences. In 1917 he moved to Odessa because of his wife's frail health. His research in stability encompassed 
both theoretical analysis and applications to various physical problems. His second method formed part of his most 
influential work, General Problem of Stability of Motion, published in 1892. 


436 


CHAPTER 9 Nonlinear Differential Equations and Stability 


The corresponding system of first-order equations is 


dx dy 8. 

— = — = —— 3 2 

dt’ di p e 
where x = 0 and y = dÓ/dt. If we omit an arbitrary constant, the potential energy U is the 
work done in lifting the pendulum above its lowest position, namely, 


U(x, y) = mgL(1 — cos x); (3) 
see Figure 9.2.2. The critical points of the system (2) are (x,y) = (-+n7,0), for 
n = 0,1,2, 3, ... , corresponding to 0 = +n, d0/dt = 0. Physically, we expect the 
points (x, y) = (0,0), (+27, 0) ... , corresponding to 0 = 0, +27, ... , to be stable, since 
for them the pendulum bob is vertical with the weight down. Further, we expect the points 
(x, y) = (+7,0), (+37,0), ..., corresponding to 0 = +7,+37, ... , to be unstable, 


since for them the pendulum bob is vertical with the weight up. This agrees with statement (1), 
for at the former points U is a minimum equal to zero, and at the latter points U is a maximum 
equal to 2mgL. 

Next consider the total energy V, which is the sum of the potential energy U and the 


1 
kinetic energy anh d0/dt)?. In terms of x and y, 


l 5 1 
V(x, y) = U(x, y) + gnLy = mgL(1-— cosx) + gnL y. (4) 


On a trajectory corresponding to a solution (x, y) = (x(t), y(1)) of equations (2), V can be 
considered a function of t. The derivative of V(x(t), y(t)) with respect to f is called the rate 
of change of V along the trajectory. By the chain rule, 


dV(x(t), y(t)) _ dx(t) dy(t) 
~z ED y T VEO y) di 


' dx 2 dy 
— (mgLsinx) di T mL^y FE (5) 
where it is understood that x and y are actually functions of t: x = x(t), y = y(t). Finally, 
because (x, y) is a solution of system (2), we know dx /dt = y and dy/dt = —(g/L) sin x; 
substituting these in equation (5) for dx /dt and dy /dt, we find that dV /dt — 0. Hence V isa 
constant along any trajectory of the system (2), which is principle (ii). 

Itis important to note that at any point ( x, y), the rate of change of V along the trajectory 
through that point was computed without actually solving the system (2). It is precisely this 
fact that enables us to use Liapunov's second method for systems whose solution we do not 
know, which is the main reason for its importance. 

At the stable critical points, (x, y) = (+2n7,0),n = 0, 1,2, ... , the energy V is zero. 
If the initial state (x1, y1) of the pendulum is sufficiently near a stable critical point, then the 
energy V(x, y1) is small, and the motion (trajectory) associated with this energy stays close 
to the critical point. It can be shown that if V (x1, y1) is sufficiently small, then the trajectory 
is closed and encloses the critical point. For example, suppose that (x1, y1) is near (0, 0) and 
that V(x,, y1) is very small. The equation of the trajectory with energy V (xi, y1) is 


1 
V(x, y) 2 mgL(1 — cosx) + zmL’y? = V(x, yp. 


For x small, we have 1 — cosx = 1 — (1 — x?/2! + ---) = x?/2. Thus the equation of the 
trajectory is approximately 
1 1 
;"gLx^ + smL*y = V(x, yı), 
2 2 
or 


x? y? 


+ — 
2V (xi, y1) / (mg L) 2V (x1, y1)/ (mL?) 
This is an ellipse enclosing the critical point (0, 0); the smaller V (x4, yi) is, the shorter are the 


major and minor axes of the ellipse. Physically, the closed trajectory corresponds to a solution 
that is periodic in time — the motion is a small oscillation about the equilibrium point. 


If damping is present, however, it is natural to expect that the amplitude of the motion 
decays in time and that the stable critical point (center) becomes an asymptotically stable 
critical point (spiral point). See the phase portrait for the damped pendulum in Figure 9.3.5. 

dV 
This can almost be argued from a consideration of E For the damped pendulum, the total 


energy is still given by equation (4), but now, when (x, y) = (x(t), y(t)) is a solution of 


equations (13) of Section 9.2, ^ — y and = " sin x Ly. Substituting these 
expressions for dx /dt and dy /dt in equation (5) gives — = —cLy? < 0. Thus the energy 


is nonincreasing along any trajectory and, except for the trajectory approaching the origin 
along the line y = 0, the motion is such that the energy decreases. Hence each trajectory 


dV 
must approach a point of minimum energy —a stable equilibrium point. If F < O instead of 


dV bad : : 3" = 
dr X 0, it is reasonable to expect all trajectories that start sufficiently close to the origin, a 
t 


stable equilibrium point, will approach the origin as f increases. 
GeneralSystems. To pursue these ideas further, consider the autonomous system 


dx dy 

gi OD di = G(x, y). (6) 
and suppose that the point (x, y) = (0, 0) is an asymptotically stable critical point. Then there 
exists some domain D containing (0,0) such that every trajectory that starts in D must 
approach the origin as £ —> oo. Suppose that there exists an “energy” function V such that 
V > Ofor (x, y) in D with V = 0 only at the origin. Since each trajectory in D approaches the 
origin as f — oo, then following any particular trajectory, V decreases to zero as t approaches 
infinity. The type of result we want to prove is essentially the converse: if, on every trajectory, 
V decreases to zero as f increases, then the trajectories must approach the origin as t — oo, 
and hence the origin is asymptotically stable. First, however, it is necessary to make several 
definitions. 

Let V be defined on some domain D containing the origin. Then V is said to be positive 
definite on D if V(0, 0) = 0 and V(x, y) > 0 for all other points in D. Similarly, V is said 
to be negative definite on D if V(0, 0) = 0 and V(x, y) < 0 for all other points in D. If the 
inequalities > and < are replaced by > and <, then V is said to be positive semidefinite and 
negative semidefinite, respectively. We emphasize that when we speak of a positive definite 
(negative definite, ...) function on a domain D containing the origin, the function must be 
zero at the origin in addition to satisfying the proper inequality at all other points in D. 


EXAMPLE 1 


Verify that 
V(x, y) = sin(x? + y?) 
is positive definite on x? + y? < 1/2. 
Also, verify that 
W(x, y) = Gt y)? 


is positive semidefinite for all (x, y). 


Solution: 


Since V(0,0) 20 and V(x,y)>0 for 0<x?+y?< -/2, V is positive definite on 
0< x? y? < 1/2. 

Likewise, W(0,0) = 0 and W(x, y) 7 0 for all (x, y). But, the fact that W(x, y) = 0 along 
the line y = —x means that W is only positive semidefinite. 


We also want to consider the function 


V(x, y) = Vi(x, y) F(x, y) + Vy(x, y) G(x, y), (7) 
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where F and G are the same functions as in equations (6). We choose this notation 
because V(x, y) can be identified as the rate of change of V along the trajectory of the system 
(6) that passes through the point (x, y). That is, if (x, y) = (x(t), y(1)) is a solution of the 
system (6), then 


dV(x(t), y(t)) _ dx(t) dy(t) 
~z ` V.(x(t), y(t)) "y T Vy(x(t), y(t)) ae 


= Vix, » F(x, y) F V(x, y) G(x, y) 


= V(x, y). (8) 


The function V is often referred to as the derivative of V with respect to the system (6). 
We now state two Liapunov theorems, one dealing with stability, the other with instability. 


Theorem 9.6.1 | Liapunov Stability Theorem 


Suppose that the autonomous system (6) has an isolated critical point at the origin. If there exists 
a function V that is continuous and has continuous first partial derivatives, that is positive definite, 
and for which the function V given by equation (7) is negative definite on some domain D in the 
xy-plane containing (0, 0), then the origin is an asymptotically stable critical point. If V is negative 
semidefinite, then the origin is a stable critical point. 


Theorem 9.6.2 | Liapunov Instability Theorem 


Let the origin be an isolated critical point of the autonomous system (6). Let V be a function that is 
continuous and has continuous first partial derivatives. Suppose that V(0, 0) = 0 and that in every 
neighborhood of the origin there is at least one point at which V is positive (negative). If there exists 
a domain D containing the origin such that the function V given by equation (7) is positive definite 
(negative definite) on D, then the origin is an unstable critical point. 


The function V is called a Liapunov function. Before sketching geometric arguments for 
Theorems 9.6.1 and 9.6.2, we note that the difficulty in using these theorems is that they tell us 
nothing about how to construct a Liapunov function, assuming that one exists. In cases where 
the autonomous system (6) represents a physical problem, it is natural to consider first the 
actual total energy function of the system as a possible Liapunov function. However, Theorems 
9.6.1 and 9.6.2 are applicable in cases where the concept of physical energy is not pertinent. 
In such cases a judicious trial-and-error approach may be necessary. 

Now consider the second part of Theorem 9.6.1 —that is, the case V <0. Letc > Obea 
constant, and consider the curve in the xy-plane given by V(x, y) = c. For c = 0, the curve 
reduces to the single point (x, y) = (0, 0). We assume that if 0 < c, < c», then the curve 
V(x, y) = cı contains the origin and lies within the curve V(x, y) = c», as illustrated in 
Figure 9.6.1a. We show that a trajectory starting inside a closed curve V(x, y) — c cannot 
cross to the outside. Thus, given a circle of radius e about the origin, by taking c sufficiently 
small, we can ensure that every trajectory starting inside the closed curve V(x, y) — c stays 
within the circle of radius c; indeed, it stays within the closed curve V(x, y) = c itself. Thus 
the origin is a stable critical point. 

To show this, recall from calculus that the vector 


VV(x, y) = V(x, y)i + V(x, y)j. (9) 


known as the gradient of V, is normal to the level curve V(x, y) = c and points in the 
direction of increasing V. In the present case, V increases outward from the origin, so VV 
points away from the origin, as indicated in Figure 9.6.1b. Next, consider a trajectory (x, y) = 
(x(t), y(t)) of the system (6), and recall that the vector T(t) = x/(t)i+ y'(1)j is tangent 
to the trajectory at each point; see Figure 9.6.1b. Let (x1, y1) = (x(t), y(11)) be a point of 


intersection of the trajectory and a closed curve V(x, y) = c. At this point x’(t;) = F(x, y1), 
y(t) = G(xi, yi), so from equation (7) we obtain 


V(x, y) = Veli, yx") + VG, yi y(t) 
= (V(x YDI + VG. y03) (x GÀ + y) 


= VVGi y + T(t). (10) 
yA yA 
V(x,y)2c 
» » 
x x 
€1 
C2 x = x(t) 
y -y(t) 
NN dx(t), dy (tq). 
O <c «e; g it TEL j= TH) 


N 
Vin, y) i + Vo, y) j = VV æ, y 


(a) (b) 


(a) The point V(x, y) = 0 and the level curves V(x, y) = cı and V(x, y) = c2 
with 0 < c, < c». (b) Geometric interpretation of Liapunov's second method. 


From this construction we recognize. V(x, yı) as the scalar product of the vector 
VV(xi, yi) and the vector T(ż4). Since V(xi, yi) < 0, it follows that the cosine of the 
angle between V V(x , y1) and T(t;) is also less than or equal to zero; hence the angle itself 


"" T 3T RET : me 
is in the closed interval gta | Thus the direction of motion on the trajectory is inward 


with respect to V(xj, y1) = c or, at worst, tangent to this curve. Trajectories starting inside 
a closed curve V(xj, yj) = c (no matter how small c is) cannot escape, so the origin is 
a stable point. If V(x;, y1) < 0, then the angle between VV(xj, yj) and T(1j) is in the 


3 
open interval E =) , and the trajectories passing through points on the curve are actually 


pointed inward. As a consequence, it can be shown that trajectories starting sufficiently close 
to the origin must approach the origin; hence the origin is asymptotically stable. 

A geometric argument for Theorem 9.6.2 follows in a somewhat similar manner. Briefly, 
suppose that V is positive definite, and suppose that given any circle about the origin, 
there is an interior point (xj, y1) at which V(x,, y1) > 0. Consider a trajectory that starts 
at (x1, y1). It follows from equation (8) that along this trajectory, V must increase since 
V(xi, y1) > 0; furthermore, since V(xi, yi) > 0, the trajectory cannot approach the origin 
because V(0,0) = 0. This shows that the origin cannot be asymptotically stable. By further 
exploiting the fact that V(x, y) > 0, itis possible to show that the origin is an unstable critical 
point; however, we will not pursue this argument. 


EXAMPLE 2 


Use Theorem 9.6.1 to show that (x, y) —(0,0) is a stable critical point for the undamped 
pendulum equations (2). 


Solution: 


Before we begin, observe that the stability of the origin cannot be determined by Theorem 9.3.2 
because (0,0) is a center of the corresponding linear system. To apply the Liapunov Stability 
Theorem (Theorem 9.6.1), let V be the total energy given by equation (4): 


1 
V(x, y) = mgL(1 — cos x) + 5L y. (4) 
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If we take D to be the domain —7/2 < x < m/2, —oo < y < oo, then V is positive 
there except at the origin, where it is zero. Thus V is positive definite on D. Further, as we have 
already seen, 


V= (mgL sinx)(y) + (mL? y) (-2 sinx) =0 


for all x and y. Thus V is negative semidefinite on D; and so, by the last statement in Theorem 9.6.1, 
the origin is a stable critical point for the undamped pendulum. 


EXAMPLE 3 


Use the Liapunov Instability Theorem (Theorem 9.6.2) to show that (x, y) = (7, 0) is an unstable 
critical point for the undamped pendulum equations (2). 


Solution: 


For the critical point (7, 0) the Liapunov function given by equation (4) is no longer suitable because 
Theorem 9.6.2 calls for a function V for which V is either positive or negative definite. To analyze 
the point (7, 0), it is convenient to move this point to the origin by the change of variables x = 7 +u, 
y =v. Then the differential equations (2) become 


d d 
- e esiti. aD 
d^ L 


dt 


V, 
and the critical point is (0, 0) in the uv-plane. Consider the function 
V(u,v) =v sinu (12) 
and let D be the domain —7/4 < u < 7/4, —oo < v < oo. Then 
V = (v cosu) (v) + (sinu) (£ sinu) = v?’ cosu + 7 sin^ u (13) 


is positive definite in D. The only remaining question is whether there are points in every 
neighborhood of the origin where V itself is positive. From equation (12) we see that V(u, v) > 0 
in the first quadrant (where both sin u and v are positive) and in the third quadrant (where both are 
negative). Thus the conditions of Theorem 9.6.2 are satisfied, and the point (0, 0) in the uv-plane, 
corresponding to the point (7, 0) in the xy-plane, is unstable. 

The damped pendulum equations are discussed in Problem 6. 


From a practical point of view, we are often interested in the basin of attraction. The 
following theorem provides some information on this subject. 


Theorem 9.6.3 


Let the origin be an isolated critical point of the autonomous system (6). Let the function V be 
continuous and have continuous first partial derivatives. If there is a bounded domain Dy containing 
the origin where V(x, y) < K for some positive K, V is positive definite, and V is negative definite, 
then every solution of equations (6) that starts at a point in Dx approaches the origin as t approaches 
infinity. 


In other words, Theorem 9.6.3 says that if (x, y) =(x(t), y(t)) is the solution of 
equations (6) for initial data lying in Dx, then (x, y) approaches the critical point (0, 0) as 
t — oo. Thus Dy gives a region of asymptotic stability; of course, it may not be the entire 
basin of attraction. This theorem is proved by showing that (1) there are no periodic solutions 
of the system (6) in Dx, and (ii) there are no other critical points in Dx. It then follows that 
trajectories starting in Dx cannot escape and therefore must tend to the origin as t tends to 
infinity. 

Theorems 9.6.1 and 9.6.2 give sufficient conditions for stability and instability, 
respectively, but these conditions are not necessary. Also, our failure to find a suitable 
Liapunov function does not mean that there is no such function. Unfortunately, there are 


no general methods for the construction of Liapunov functions; however, there has been 
extensive work on the construction of Liapunov functions for special classes of equations. 
An elementary algebraic result that is often useful in constructing positive definite or negative 
definite functions is stated without proof in the following theorem. 


Theorem 9.6.4 


The function 
V(x, y) = ax? + bxy + cy? (14) 
is positive definite if, and only if, 
a > 0 and 4ac—b? 0 (15) 
and is negative definite if, and only if, 


a< 0 and 4ac — P? > 0. (16) 


The use of Theorem 9.6.4 is illustrated in the following example. 


EXAMPLE 4 


Show that the critical point (x, y) — (0, 0) of the autonomous system 


LX xy’, =-y xy (17) 
is asymptotically stable. 


Solution: 


The only equilibrium solution of system (17) is (x, y) = (0, 0). We try to construct a Liapunov 
function of the form (14). Then V,(x, y) = 2ax + by, V\(x, y) = bx + 2cy, so 


V(x, y) = (2ax + by) (—x — xy?) + (bx + 2cy) (—y — x?y) 
=— (2a (x^ + xy) + b(2xy xy + xy) + 2c p? + ay). 
If we choose b = 0, and a and c to be any positive numbers, then V is negative definite and V is 


positive definite by Theorem 9.6.4. Thus, by Theorem 9.6.1, the origin is an asymptotically stable 
critical point. 


EXAMPLE 5 


Consider the system 


dx a ) 

— =x(l-x-y), 

^ (18) 
y y 

— = 4(3 — 4y — 2x). 

dt ae y €) 


In Example 1 of Section 9.4, we found that this system models a certain pair of competing species 


11 
and that the critical point (5 3) is asymptotically stable. Confirm this conclusion by finding a 
suitable Liapunov function. 
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Solution: 


1 1 
It is helpful to transform the point (s :) to the origin. To accomplish this, let 


1 1 
Ec tu, peg (19) 


Then, substituting for x and y in equations (18), we obtain the new system 


du 1 1 / 5 
dt = 5" 2" u uv, 
2 
dv 1 1 1l à M 
= u V uv Ms 
d 4 2 


To keep the calculations relatively simple, consider the function V (u, v) = u?--v? as a possible 
Liapunov function. This function is clearly positive definite, so we need only to determine whether 
there is a region containing the origin in the uv-plane where the derivative V with respect to the 
system (20) is negative definite. We compute V (u, v) and find that 


V(u,v) = Ve EN. 


ll 
N 
& 
ca 
NI 
= 
| 
NI 
< 
| 
= 
N 
| 
= 
< 
Se 
+ 
N 
x 
AT 
E 
xz 
an 
= 
E 
- 
N 
Ne 


or 
, 3 
V(u, v) = — ( | dou + | + (zu? + 2u*v + uy? + 2)), (21) 


where we have collected together the quadratic and cubic terms. We want to show that the expression 
in square brackets in equation (21) is positive definite, at least for u and v sufficiently small. By 
Theorem 9.6.4 with a = c = 1 and b = 1.5, the quantity 4ac — P? is positive so that u? + 1.5uv +v? 
is positive definite. On the other hand, the cubic terms in equation (21) may be of either sign. Thus we 
must show that, in some neighborhood of (u, v) = (0, 0), the cubic terms are smaller in magnitude 
than the quadratic terms. Toward this end, we will find it helpful to observe that the quadratic terms 
can be written as 


3 1 3 
u? + zuv +y? = ge er ye ue p. (22) 


Thus, our goal is to show that, for u and v sufficiently small, 
3 2 2 3 l;a 2 3 2 
2u^ + 2u“v + uv +2v3] < 5 (u +v?) qG +v)’. (23) 


To estimate the left-hand side of equation (23), we introduce polar coordinates u = r cos 0, 
v =r sinf. Then 


2u? + 2u7?v + uv? + 2v7| = P 


2cos 0 -- 2cos? 0 sind + cos Ü sin? 0 Zr 


r? (2 sin? 0 ) 


ye 


sin? 0 +2 


IA 


cos Ü 


cos? 6| 4-2cos? 0 


sino | + 


since both | sin 0| and | cos 0| are bounded above by 1. To satisfy equation (23), it is now certainly 
sufficient to satisfy the more stringent requirement 


1 1 
Tr < AGA +v’) = 25 
1 
which yields r < 28 Thus, at least in this disk, the hypotheses of Theorem 9.6.1 are satisfied, so 


the origin is an asymptotically stable critical point of the system (20). The same is then true of the 


1 1 
critical point 225 of the original system (18). 
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If we refer to Theorem 9.6.3, the preceding argument also shows that the disk with center 


1 1 
73 and radius 28 is a region of asymptotic stability for the system (18). This is a severe 


underestimate of the full basin of attraction, as the discussion in Section 9.4 shows. To obtain a 
better estimate of the actual basin of attraction from Theorem 9.6.3, we would have to estimate the 
terms in equation (23) more accurately, or use a better (and presumably more complicated) Liapunov 
function, or both. 


Problems 


In each of Problems | through 3, construct a suitable Liapunov 
function of the form ax? + cy?, where a and c are to be determined. 
Then show that the critical point at the origin is of the indicated type. 


d 
1. — = —x3+xy?, AE es —2x?y-y3; 


asymptotically stable 
dt dt i ea 
d d 
2. T = —x3 + 2y3, z = —2xy7; stable (at least) 
d d 
T =x- y’, > = 2xy? + 4x?y +2y?; unstable 
4. Consider the system of equations 
T. lyo-xfG 2 =-x yf y) 
dt m * x,y), dt m x y. X, y). 


where f is continuous and has continuous first partial derivatives. 
Show that if f(x,y) > 0 in some neighborhood of the origin, 
then the origin is an asymptotically stable critical point, and that if 
f(x,y) < 0 in some neighborhood of the origin, then the origin is 
an unstable critical point. Hint: Construct a Liapunov function of the 
form c(x? + y?). 
5. A generalization of the undamped pendulum equation is 
au 
Ja + g(u) =0, (24) 
where g(0) = 0, g(u) > OforO < u < k, and g(u) < 0 for 
—k < u < O;thatis, ug(u) > 0 for u 40, —k < u < k. Notice that 
g(u) = sinu has this property on (—7 /2, 7/2). 
a. Letting x = u, y = du/dt, write equation (24) as a system 
of two equations, and show that x = 0, y = 0 is a critical point. 
b. Show that 
x 


1 > 
V(x, y) = at +f g(s)ds, -—k«x«k (25) 
0 


is positive definite, and use this result to show that the critical 
point (0,0) is stable. Note that the Liapunov function V 
given by equation (25) corresponds to the energy function 


V(x, y) = 22 + (1 — cos x) for the case g(u) = sinu. 


6. By introducing suitable dimensionless variables, we can write 
the system of nonlinear equations for the damped pendulum 
(Equations (8) of Section 9.3) as 


dx dy ; 
1009 g ^ TY Sin 


a. Show that the origin is a critical point. 

b. Show that although V(x, y) = x? + y? is positive definite, 
V (x, y) takes on both positive and negative values in any domain 
containing the origin, so V is not a Liapunov function. 

Hint: x — sinx > 0 for x > 0, and x — sinx < O for x < 0. 
Consider these cases with y positive but y so small that y? can 
be ignored compared to y. 


1 
C. Using the energy function V(x, y) = A + (1 — cosx) 


mentioned in Problem 5b, show that the origin is a stable critical 
point. Since there is damping in the system, we can expect that 
the origin is asymptotically stable. However, it is not possible to 
draw this conclusion using this Liapunov function. 

d. To show asymptotic stability, it is necessary to construct 
a better Liapunov function than the one used in part c. Show 


1 1 
that V(x, y) = z“ +y ++ a is such a Liapunov 


function, and conclude that the origin is an asymptotically stable 
critical point. Hint: From Taylor’s formula with a remainder, it 
follows that sinx = x — ox?/3!, where a depends on x but 
0< a < lfor—r/2 < x < n/2. Then, letting x = r cos0, 
y —rsin6, show that V(r cos0,rsin0) = —r?(1-r h(r, 0)), 
where |h(7, 0)| < 1 ifr is sufficiently small. 


7. The Liénard equation (Problem 27 of Section 9.3) is 


d?u Feti du +e) =0 

— + c(u) — u) =0, 

dt? dt 

where g satisfies the conditions of Problem 5 and c(u) > 0. Show that 
the point u = 0, du/dt = 0 is a stable critical point. 


8. a. A special case of the Liénard equation of Problem 7 is 


@u | du 

E + dt + 8 ( u) => 0, 
where g satisfies the conditions of Problem 5. Letting x = u, 
y = du /dt, show that the origin is a critical point of the resulting 
system. This equation can be interpreted as describing the motion 
of a spring- mass system with damping proportional to the 
velocity and a nonlinear restoring force. Using the Liapunov 
function of Problem 5, show that the origin is a stable critical 
point, but note that even with damping, we cannot conclude 
asymptotic stability using this Liapunov function. 
b. Asymptotic stability of the critical point (0, 0) can be shown 
by constructing a better Liapunov function, as was done in part d 
of Problem 6. However, the analysis for a general function g is 
somewhat sophisticated, and we mention only that an appropriate 
form for V is 

1 x 
V(x, y) = 3 + Ayg(x) «f a(s)ds, 
0 


where A is a positive constant to be chosen so that V is positive 
definite and V is negative definite. For the pendulum problem 
g(x) = sinx, use V as given by the preceding equation 


with A = 3 to show that the origin is asymptotically stable. 
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Hint: Use sinx = x — ax?/3! and cosx = 1 — 3x?/2!, where 
a and £ depend on x, and 0 < a < 1 andO0 < 8 < 1 for 


—T/2 < x < m/2;letx =r cos, y = r sin 0, and show that 


: 1 1 
V(rcos@,rsin@) = E d ( + j sin 20 + h(r, 2 where 


1 
|A(r, 0)| < 2 if r is sufficiently small. To show that V is positive 


definite, use cos x = 1 — x?/2 + yx*/4!, where y depends on 
x,andÜüc y < lfor—=r/2< x< 7/2. 


In Problems 9 and 10, we will prove part of Theorem 9.3.2: If the 
critical point (0, 0) of the locally linear system 
dx dy 
d ay X +any+ Fi(x, y), dE d51X a5; y- Gi(x, y) (26) 
is an asymptotically stable critical point of the corresponding linear 
system 
x + 2 zs (27) 
= Spa a ? SS: xX 252 y, 
di 11 12y dt 21 22) 
then it is an asymptotically stable critical point of the locally linear 
system (26). Problem 11 deals with the corresponding result for 
instability. 
9. Consider the linear system (27). 
a. Since (0,0) is an asymptotically stable critical point, show 
that a11 + a2 T (0) and 411422 — 412421 > 0. (See Problem 18 of 
Section 9.1.) 
b. Construct a Liapunov function V(x, y) = Ax? + Bxy+Cy? 
such that V is positive definite and V is negative definite. One 
way to ensure that V is negative definite is to choose A, B, and 
C so that V(x, y) 2 —x? — y?. Show that this leads to the result 


2 2 
a4, + a5, + (41422 — 412421) E 12025 + 411421 


2A ; A ; 


2 2 
g dj, t dj, + (411422 — 412421) 
i 2A f 


where A = (aj; + a55) (411422 — 412421). 
c. Using the result of part a, show that A > 0, and then show 
(several steps of algebra are required) that 


4AC— B = 


2 2 2 2 2 
e T4 +45, + az, ) (anan 415051) + 2( a) 1492 — 412421) 


AÀ 20. 


Thus, by Theorem 9.6.4, V is positive definite. 


10. In this problem we show that the Liapunov function constructed 
in the preceding problem is also a Liapunov function for the locally 
linear system (26). We must show that there is some region containing 
the origin for which V is negative definite. 

a. Show that 


V(x, y) = - (à y?) + QAx + By) Fi(x, y) 
+( Bx + 2Cy)G,(x, y). 


b. Recall that Fi(x, y)/r — O0 and G,(x,y)/r — O0 as 
r = (x? + y?)!/2 — 0. This means that, given any € > 0, 
there exists a circle r = R about the origin such that for 0. < 
r < R,|Fi(x, y)]| < er and |G,(x, y)| < er. Letting M be 
the maximum of |2A], | B], and |2C |, show by introducing polar 
coordinates that R can be chosen so that V (x, y) « Oforr « R. 
Hint: Choose c sufficiently small in terms of M. 


1l. In this problem we prove a part of Theorem 9.3.2 related to 
instability. 
a. Show that if 11 + a2 > 0 and 11022 — 412421 = 0, then 
the critical point (0, 0) of the linear system (27) is unstable. 
b. The same result holds for the locally linear system (26). As in 
Problems 9 and 10, construct a positive definite function V such 
that V(x, y) = x? + y and hence is positive definite, and then 
invoke Theorem 9.6.2. 


%7 Periodic Solutions and Limit Cycles 


In this section we discuss further the possible existence of periodic solutions of two- 
dimensional autonomous systems 


x’ = f(x). (1) 


Such solutions satisfy the relation 


x(t +T) = x(t) (2) 


for all t and for some positive constant T called the period. The corresponding trajectories are 
closed curves in the phase plane. Periodic solutions often play an important role in physical 
problems because they represent phenomena that occur repeatedly. In many situations a 
periodic solution represents a “final state" that is approached by all “neighboring” solutions 
as the transients due to the initial conditions die out. 

A special case of a periodic solution is a constant solution x = x°, which corresponds to a 
critical point of the autonomous system. Such a solution is clearly periodic with any period. In 
this section, when we speak of a periodic solution, we mean a nonconstant periodic solution. In 
this case the period T is usually chosen as the smallest positive number for which equation (2) 
is valid. 

Recall that the solutions of the linear autonomous system 


x’ = Ax (3) 
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are periodic if and only if the eigenvalues of A are pure imaginary. In this case the critical point 
at the origin is a center, as discussed in Section 9.1. We emphasize that if the eigenvalues of 
A are pure imaginary, then every solution of the linear system (3) is periodic, while if the 
eigenvalues are not pure imaginary, then there are no (nonconstant) periodic solutions. The 
predator-prey equations discussed in Section 9.5, although nonlinear, behave similarly: all 
solutions in the first quadrant are periodic. The following example illustrates a different way 
in which periodic solutions of nonlinear autonomous systems can occur. 


EXAMPLE 1 


Discuss the qualitative properties of the solutions of the system 


(7) -( ME AM (4) 
3]  Xexsby e yQe y 


Solution: 


It is not difficult to show that (0, 0) is the only critical point of the system (4) and also that the system 
is locally linear in the neighborhood of the origin. The corresponding linear system 


9-G 290 o 


has eigenvalues 1 + i. Therefore, the origin is an unstable spiral point for both the linear system (5) 
and the nonlinear system (4). Thus any solution that starts near the origin in the phase plane will 
spiral away from the origin. 

Since there are no other critical points, we might think that all solutions of equations (4) 
correspond to trajectories that spiral out to infinity. However, we now show that this is incorrect, 
because far away from the origin the trajectories are directed inward. 

The presence of x? + y? in system (4) suggests the introduction of polar coordinates r and 0, 


where 
x-—rcosÜ, yc-rsin6, (6) 
andr > 0. Differentiation of both sides of r? = x? + y?, with respect to t, gives 
2 dr " dx m dy 
"aro dt D wp 
If we multiply the first of equations (4) by x, multiply the second by y, and add (and divide by 2), 
we obtain 
dx dy 2 2 2 a2 
xq 4x76 ty") -(@ tr) (7) 
In terms of polar coordinates, equation (7) can be written as 
dr 2 2 
ra =r (12r) (8) 


This equation is similar to the equations discussed in Section 2.5. The critical points (for r > 0) 
are the origin and the point r = 1, which corresponds to the unit circle in the phase plane. From 


d d 
equation (8) it follows that 7 > Oifr < 1 and iid < O ifr > 1. Thus, inside the unit circle the 


trajectories are directed outward, while outside the unit circle they are directed inward. Apparently, 
the circle r — 1 is a limiting trajectory for this system. 

To complete the conversion of system (4) to polar coordinates, we need an equation involving 
dy 


2 from equations (6), we can see 


d. 
3 To find this equation, notice that upon calculating m and 


that 


y x =-r : (9) 


d d 
Thus, when the two equations in system (4) are used to replace m and z in equation (9), we arrive 


at 
dé dx dy 
T = EN 22 
£r ae 
Cancelling the common factor of r?, this reduces to 
dO — 
dt 


—]. (10) 
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The system of equations (8), (10) for r and 0 is equivalent to the original system (4). One 
solution of the system (8), (10) is 


rel, 0 = —t + tọ, (11) 


where tọ is an arbitrary constant. As f increases, a point satisfying equations (11) moves clockwise 
around the unit circle. Thus the autonomous system (4) has a periodic solution. Other solutions can 
be obtained by solving equation (8) by separation of variables; if r 4 0 andr Æ 1, then 


dr 


pap e (12) 


Equation (12) can be solved by using partial fractions to rewrite the left-hand side and 
then integrating. By performing these calculations, we find that the solution of equations (10) and 
(12) is 


1 
r= 0 = —t + tọ, (13) 


IF coe 
where c, and tọ are arbitrary constants. The solution (13) also contains the solution (11), which is 
obtained by setting co = 0 in the first of equations (13). 
The solution satisfying the initial conditions r = p, 0 = a att = O is given by 


0 = -(t — a). (14) 


1 
eae 


If p < 1, thenr — 1 from the inside as t — oo; if p > 1, then r — 1 from the outside as t > oo. 
Thus in all cases, the trajectories spiral toward the red circle r = 1 as t — oo. Several trajectories 
are shown in Figure 9.7.1. 


-2 


Sle chet) All trajectories of the system (4) spiral 
toward the red circle r = 1 as t > oo. 


In this example, the circle r — 1 not only corresponds to periodic solutions of the system 
(4), but it also attracts other nonclosed trajectories that spiral toward it as £ — oo. In general, a 
closed trajectory in the phase plane such that other nonclosed trajectories spiral toward it, from 
either the inside or the outside, as  — oo, is called a limit cycle. Thus the circle r = 1 is a limit 
cycle for the system (4). If all trajectories that start near a closed trajectory (both inside and 
outside) spiral toward the closed trajectory as £ — oo, then the limit cycle is asymptotically 
stable. 

Since the limiting trajectory is itself a periodic orbit rather than an equilibrium point, 
this type of stability is often called orbital stability. If the trajectories on one side spiral 
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toward the closed trajectory, while those on the other side spiral away as t — co, then the 
limit cycle is said to be semistable. If the trajectories on both sides of the closed trajectory 
spiral away as t — ox, then the closed trajectory is unstable. It is also possible to have closed 
trajectories that other trajectories neither approach nor depart from — for example, the periodic 
solutions of the predator-prey equations in Section 9.5. In this case the closed trajectory is 
stable. 

In Example 1 the existence of an asymptotically stable limit cycle was established by 
solving the equations explicitly. Unfortunately, this is usually not possible, so it is worthwhile 
to know general theorems concerning the existence or nonexistence of limit cycles of nonlinear 
autonomous systems. In discussing these theorems, it is convenient to rewrite the system (1) 
in the scalar form 


d d 
MFG. F = Gy): (15) 


dt 


Theorem 9.7.1 


Let the functions F and G have continuous first partial derivatives in a domain D of the xy-plane. A 
closed trajectory of the system (15) must necessarily enclose at least one critical (equilibrium) point. 
If it encloses only one critical point, the critical point cannot be a saddle point. 


Although we omit the proof of this theorem, itis easy to show examples of it. One is given 
by Example 1 and Figure 9.7.1, in which the closed trajectory encloses the critical point (0, 0), 
a spiral point. Another example is the system of predator-prey equations in Section 9.5; see 
Figure 9.5.2. Each closed trajectory surrounds the critical point (3, 2); in this case the critical 
point is a center. 

Theorem 9.7.1 is also useful in a negative sense. If a given region contains no critical 
points, then there can be no closed trajectory lying entirely in the region. The same conclusion 
is true if the region contains only one critical point, and this point is a saddle point. For 
instance, in Example 2 of Section 9.4, which deals with two competing species, the only 
critical point in the interior of the first quadrant is the saddle point (0.5, 0.5). Therefore, 
using the contrapositive of Theorem 9.7.1, this system has no closed trajectory lying in the 
first quadrant. 

A second result about the nonexistence of closed trajectories is given in the following 
theorem. 


Theorem 9.7.2 


Let the functions F and G have continuous first partial derivatives in a simply connected domain D 
of the x y-plane. If F} + Gy has the same sign throughout D, then there is no closed trajectory of the 
system (15) lying entirely in D. 


A simply connected two-dimensional domain is one with no holes. Theorem 9.7.2 is a 
straightforward consequence of Green's theorem in the plane; see Problem 13. The quantity 
F, + G, that appears in Theorem 9.7.3 can be recognized as the divergence of the vector field 
Fi + Gj. Note that if F, + G, changes sign in the domain, then no conclusion can be drawn; 
there may or may not be closed trajectories in D. 

To illustrate Theorem 9.7.2, consider the system (4). A routine calculation shows that 


F(x, y) + Gy(x, y) 22— a(x? ai »?) = 2(1 z 2r?), (16) 


where, as usual, r? = x? + y?. Hence F, + Gy is positive for 0 < r < 1/,/2, so there 
is no closed trajectory in this circular disk. Of course, we showed in Example 1 that there is 
no closed trajectory in the larger region r « 1. This illustrates that the information given by 
Theorem 9.7.2 may not be the best possible result. Again referring to equation (16), note that 
F, +G, < Oforr > 1/4/2. However, the theorem is not applicable in this case because 
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this annular region is not simply connected. Indeed, as shown in Example 1, the unit circle is 
a limit cycle for system (4). 
The following theorem gives conditions that guarantee the existence of a closed trajectory. 


Theorem 9.7.3 | Poincaré -Bendixson?? Theorem 


Let the functions F and G have continuous first partial derivatives in a domain D of the xy-plane. 
Let D, be a bounded subdomain in D, and let R be the region that consists of D, plus its boundary 
(all points of R are in D). Suppose that R contains no critical point of the system (15). If there is a 
solution of the system (15) that exists and stays in R for all t > tọ (for some constant fg), then either 
the solution is a periodic solution (closed trajectory), or the solution spirals toward a closed trajectory 
as f — oo. In either case, the system (15) has a periodic solution in R. 


Note that if R does contain a closed trajectory, then necessarily, by Theorem 9.7.1, this 
trajectory must enclose a critical point. However, this critical point cannot be in R. Thus R 
cannot be simply connected; it must have a hole. 

As an application of the Poincaré - Bendixson theorem, consider again the system (4). 
Since the origin is a critical point, it must be excluded. For instance, we can consider the region 
R defined by 0.5 < r < 2. Next, we must show that there is a solution whose trajectory stays 
in R for all ¢ greater than or equal to some fo. This follows immediately from equation (8). 
For r = 0.5, dr/dt > 0, so r increases, while for r = 2, dr/dt < 0, so r decreases. Thus 
any trajectory that crosses the boundary of R is entering R. Consequently, any solution of 
equations (4) that starts in R at t = tọ cannot leave but must stay in R for t > tọ. Of course, 
other numbers could be used instead of 0.5 and 2; all that is important is that r = 1 is included. 

One should not infer from this discussion of the preceding theorems that it is easy to 
determine whether a given nonlinear autonomous system has periodic solutions; often it is not 
a simple matter at all. Theorems 9.7.1 and 9.7.2 are frequently inconclusive, and for Theorem 
9.7.3 it is often difficult to determine a region R and a solution that always remains within it. 

We close this section with another example of a nonlinear system that has a limit cycle. 


EXAMPLE 2 


The van der Pol!? equation 


u” — u(1— u2)u! +u — 0, (17) 
where p is a nonnegative constant, describes the current u in a triode oscillator. Discuss the solutions 
of this equation. 


Solution: 


If js = 0, equation (17) reduces to u” + u = 0, whose solutions are sine or cosine waves of period 
2m . For jj > 0, the second term on the left-hand side of equation (17) must also be considered. This 
is the resistance term, proportional to u’, with a coefficient —j.(1 — u?) that depends on u. For large 
u, this term is positive and acts as usual to reduce the amplitude of the response. However, for small 
u, the resistance term is negative and so causes the response to grow. This suggests that perhaps there 
is a solution of intermediate size that other solutions approach as f increases. 

To analyze equation (17) more carefully, we write it as a system of two equations by introducing 
the variables x = u and y = u’. Then it follows that 


x —y and y ——x-4g(1—x2)y. (18) 


PTvar Otto Bendixson (1861-1935), a Swedish mathematician, received his doctorate from Uppsala University 
and was professor and for many years rector of Stockholm University. This theorem improved on an earlier result 
of Poincaré and was published in a paper by Bendixson in Acta Mathematica in 1901. 


'3Balthasar van der Pol (1889-1959) was a Dutch physicist and electrical engineer who worked at the Philips 
Research Laboratory in Eindhoven. He was a pioneer in the experimental study of nonlinear phenomena and 
investigated the equation that bears his name in a paper published in 1926. 


9.7 Periodic Solutions and Limit Cycles 


The only critical point of the system (18) is the origin. Near the origin the corresponding linear system 


of differential equations is 
x\' 0 1 x 
y =1 pj \y 


1 
whose eigenvalues are 2 (n +y/u?— 4) . Thus the origin is an unstable spiral point for0 < u < 2 


and an unstable node for j > 2. In all cases, a solution that starts near the origin grows as f increases. 

With regard to periodic solutions, Theorems 9.7.1 and 9.7.2 provide only partial information. 
From Theorem 9.7.1 we conclude that if there are closed trajectories, they must enclose the origin. 
Next, with F(x, y) = y and G(x, y) = —x + u (1 = x)y, we calculate 


F(x, y) + Gy(x, y) = u(1 — x°). (20) 


Then, because Fy + Gy > 0 when |x| < 1, it follows from Theorem 9.7.2 that closed trajectories, 
if there are any, are not contained in the vertical strip |x| < 1. 

The application of the Poincaré - Bendixson theorem to this problem is not nearly as simple as 
for Example 1. If we introduce polar coordinates, we find that the equation for the radial variable r 
is (see Problem 12) 


r' = u(1— r?° cos? 0)r sin? 0. (21) 


As in Example 1, consider an annular region R given by r} < r < r2, where rı is small and 
rə is large. When r = r4, the linear term on the right-hand side of equation (21) dominates, and 
r' > 0 except on the x-axis, where sin = 0 and consequently r’ = 0 also. Thus trajectories 
are entering R at every point on the circle r = r,, except possibly for those on the x-axis, where 
the trajectories are tangent to the circle. When r = r», the cubic term on the right-hand side of 
equation (21) is the dominant one. Thus r' « 0, except for points on the x-axis where r' = 0 
and for points near the y-axis where r? cos!0 < 1 and the linear term makes r’ > 0. Thus, no 
matter how large a circle is chosen, there will be points on it (namely, the points on or near the 
y-axis) where trajectories are leaving R. Therefore, the Poincaré- Bendixson theorem is not 
applicable unless we consider more complicated regions. 

It is possible to show, by a more intricate analysis, that the van der Pol equation does 
have a unique limit cycle. However, we will not follow this line of argument further. We turn 
instead to a different approach in which we plot numerically-computed approximations to solutions. 
Experimental observations indicate that the van der Pol equation has an asymptotically stable periodic 
solution whose period and amplitude depend on the parameter ju. By looking at graphs of trajectories 
in the phase plane and of u versus t, we can gain some understanding of this periodic behavior. 

Figure 9.7.2 shows two trajectories of the van der Pol equation in the phase plane for 
u = 0.2. The trajectory passing through (0, 1/3) spirals outward in the clockwise direction; 
this is consistent with the behavior of the linear approximation near the origin. The other 
trajectory passes through (—3,2) and spirals inward, again in the clockwise direction. 
Both trajectories approach a closed curve that corresponds to a stable periodic solution. In 
Figure 9.7.3 we show the plots of u versus t for the solutions corresponding to the trajectories in 


JAUTA Two trajectories of the van der Pol equation 
(17) for u = 0.2. The red trajectory passes through ( —3, 2); 
the blue trajectory passes through (0, 1/3). 
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Plots of u versus t for the trajectories in Figure 9.7.2. The blue 
curve is the trajectory through (0, 1/3); the red curve passes through ( —3, 2). 


Figure 9.7.2. The solution that is initially smaller gradually increases in amplitude, while the larger 
solution gradually decays. Both solutions approach a stable periodic motion that corresponds to the 
limit cycle. Figure 9.7.3 also shows that there is a phase difference between the two solutions as they 
approach the limit cycle. The plots of u versus f are nearly sinusoidal in shape, consistent with the 
nearly circular limit cycle in this case. 

Figures 9.7.4 and 9.7.5 show similar plots for the case u = 1. Trajectories again move 
clockwise in the phase plane, but the limit cycle is considerably different from a circle. The plots 
of u versus t tend more rapidly to the limiting oscillation, and again show a phase difference. 
The oscillations are somewhat less symmetric in this case, rising somewhat more steeply than 
they fall. 

Figure 9.7.6 shows the phase plane for u = 5. The motion remains clockwise, and the limit cycle 
is even more elongated, especially in the y direction. Figure 9.7.7 is a plot of u versus t. Although 
the solution starts far from the limit cycle, convergence to the limiting oscillation is virtually complete 
in a fraction of a period. Starting from one of its extreme values on the x-axis in the phase plane, 
the solution moves toward the other extreme position slowly at first, but once a certain point on the 


RY 


Two trajectories of the van der Pol equation (17) for jj = 1. 
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AUIII Plots of u versus t for the trajectories in Figure 9.7.4. Note that the 
waveforms are periodic and completely out of phase. 


w 
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Four trajectories of the van der Pol 
equation (17) for u = 5. 


A:N Plot of u versus t for the outward spiralling trajectory through (0, 1/3) (blue) 
and the inward spiralling trajectory through (—3, 2) (red). While these waveforms for y = 5 
are much less smooth than the waveforms for u = 1, they are still periodic and out of phase. 
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trajectory is reached, the remainder of the transition is completed very swiftly. The process is then 
repeated in the opposite direction. The waveform of the limit cycle, as shown in Figure 9.7.7, is quite 


different from a sine wave. 


These graphs clearly show that in the absence of external excitation, the van der Pol oscillator 
has a certain characteristic mode of vibration for each value of u. The graphs of u versus t show that 
the amplitude of this oscillation changes very little with u, but the period increases as ju increases. 
We emphasize that for small values of u the limit cycle is nearly a circle of radius 2. As ju increases, 
the limit cycle is stretched in the y direction but remains in the interval [—2, 2] horizontally. At the 
same time, the waveform changes from a smooth oscillation to a much more jumpy type of motion. 

The presence of a single periodic motion that attracts all (nearby) solutions —that is, an 
asymptotically stable limit cycle—is one of the characteristic phenomena present in many nonlinear 


differential equations. 


Problems 


In each of Problems 1 through 6, an autonomous system is expressed 
in polar coordinates. Determine all periodic solutions, all limit cycles, 
and the stability characteristics of all periodic solutions. 


d d0 
1. : = ru my, =1 
dt dt 
d d0 
2. € 2ra0-rp, =j 
dt dt 
3 dr ( m 3) dé 1 
A =r(r r , = 
dt dt 
4 dr a T 2) d0 i 
$ =f r)(r ; = 
dt dt 
dr : d0 
5. — —sin(ur), —=1 
dt dt 
dr d0 
6. =r|r —2|(r — 3), = -l1 
dt dt 


7. a. Show that the system 
d d 
c --y*xfn Saxtyf/r 


has periodic solutions corresponding to the zeros of f(r). What 
is the direction of motion on the closed trajectories in the phase 
plane? 

b. Let f(r) 2 r(r - 2)? (r? — 4r + 3) Determine all periodic 
solutions, and determine their stability characteristics. 


8. Determine the periodic solutions, if any, of the system 


dx 


ego quud) 
dt /x2 + y 

dy _ y 2 gu 
dt Et most +Y 2). 


9. Using Theorem 9.7.2, show that the linear autonomous system 


dx " d E 

— =. 7 — = a a 

di 11% T d12y dt 21% 22Y 
does not have a periodic solution (other than x — 0, y — 0) if 
ay, + 42? #0. 


In each of Problems 10 and 11, show that the given system has no 
periodic solutions other than constant solutions. 


d d 
10. — =xty+x3—y%, Z =x +2y-+27y + y 


dx dy 
11. = —2x —3 A = 3 x? 
ED X y— xy di yx x“y 


12. Derive equation (21) in Example 2. 


13. Prove Theorem 9.7.2 by completing the following argument. 
According to Green's theorem in the plane, if C is a sufficiently 
smooth simple closed curve, and if F and G are continuous and have 
continuous first partial derivatives, then 


nuc y)dy — G(x, y)dx) = [fre + Gy(x, y)) dA, 
ë R 


where C is traversed counterclockwise and R is the region enclosed 
by C. Assume that x = (t), y = W(t) isa solution of the system 
(15) that is periodic with period T. Let C be the closed curve given by 
x = ġ (t), y = Y(t) for0 < t < T. Show that for this curve, the line 
integral is zero. Then show that the conclusion of Theorem 9.7.2 must 
follow. 


14. Q a. By examining the graphs of u versus t in Figures 9.7.3, 
9.7.5, and 9.7.7, estimate the period T of the van der Pol oscillator 
in these cases. 

Q b. Plot the graphs of solutions of the van der Pol equation for 
other values of the parameter u. Estimate the period T in these 
cases also. 

Q c. Plot the estimated values of T versus js. Describe how T 
depends on p. 


15. The equation 


1 
iare) nao 


is often called the Rayleigh!^ equation. 


John William Strutt (1842-1919), the third Lord Rayleigh, made notable 
contributions in several areas of mathematical physics. His theory of scattering 
(1871) provided the first correct explanation of why the sky is blue, and his two- 
volume treatise The Theory of Sound, published in 1877 and 1878, is one of the 
classics of applied mathematics. Apart from five years as Cavendish Professor 
of Physics at Cambridge, he worked primarily in his private laboratory at home. 
He was awarded the Nobel Prize for Physics in 1904 for the discovery of argon. 


a. Write the Rayleigh equation as a system of two first-order 
equations. 

b. Show that the origin is the only critical point of this system. 
Determine its type and whether it is asymptotically stable, stable, 
or unstable. 

Qc. Let u = 1. Choose initial conditions, and graph the 
corresponding solution of the system on an interval such as 
0 <t < 20 or longer. Plot u versus f, and also plot the trajectory 
in the phase plane. Observe that the trajectory approaches a 
closed curve (limit cycle). Estimate the amplitude A and the 
period T of the limit cycle. 

O d. Repeat part c for other values of ji, such as 
u = 0.2, 0.5, 2, and 5. In each case estimate the amplitude 
A and the period T. 

e. Describe how the limit cycle changes as u increases. For 
example, make a table of values and/or plot A and T as functions 
of u. 


16. Consider the system of equations 
x’ =pxty—x(x?+y’), y =—xtyy—y(x*+y’), (22) 


where ju is a parameter. Observe that this system is the same as the 
one in Example 1, except for the introduction of ju. 
a. Show that the origin is the only critical point. 
b. Find the linear system that approximates equations (22) 
near the origin, and find its eigenvalues. Determine the type 
and stability of the critical point at the origin. How does this 
classification depend on u? 
c. Referring to Example 1 if necessary, rewrite equations (22) 
in polar coordinates. 
Q d. Show that when u > 0, there is a periodic solution 
r = ,/. By solving the system found in part c, or by plotting 
numerically computed approximate solutions, conclude that this 
periodic solution attracts all other nonzero solutions. 
Note: As the parameter u increases through the value zero, 
the previously asymptotically stable critical point at the origin 
loses its stability, and simultaneously a new asymptotically stable 


solution (the limit cycle) emerges. Thus the point y = O is 
a bifurcation point; this type of bifurcation is called a Hopf ? 
bifurcation. 


17. Consider the van der Pol system 


/ 


x =y, y 2—x- n(1—- x?)y, 


where now we allow the parameter j to be any real number. 
a. Show that the origin is the only critical point. Determine its 
type, its stability property, and how these depend on p. 
O b. Let u = —1; draw a phase portrait, and conclude that 
there is a periodic solution that surrounds the origin. Observe that 
this periodic solution is unstable. Compare your plot with Figure 
9.7.4. 
O c. Draw a phase portrait for a few other negative values of ju. 
Describe how the shape of the periodic solution changes with p. 
Q d. Consider small positive or negative values of p. By 
drawing phase portraits, determine how the periodic solution 
changes as jj, — 0. Compare the behavior of the van der Pol 
system as ju increases through zero with the behavior of the 
system in Problem 16. 


I5Eberhard Hopf (1902-1983) was born in Austria and educated at the 
University of Berlin but spent much of his life in the United States, mainly 
at Indiana University. He was one of the founders of ergodic theory. Hopf 
bifurcations are named for him in honor of his rigorous treatment of them in a 
1942 paper. 
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Problems 18 and 19 extend the consideration of the 
Rosenzweig - MacArthur predator-prey model introduced in Problem 
13 of Section 9.5. 


18. Consider the system 


; 2y ; x 
x =x| 2.4—0.2x - —— |. y-—y(-0254 —— |J. 
x+6 f ° x+6 


Observe that this system differs from that in Problem 13 of Section 
9.5 only in the growth rate for the prey. 
a. Find all of the critical points. 
b. Determine the type and stability of each critical point. 
Q c. Draw a phase portrait in the first quadrant, and conclude 
that there is an asymptotically stable limit cycle. Thus this model 
predicts a stable long-term oscillation of the prey and predator 
populations. 


19. Consider the system 


/ 0.2% — 22 ! 0.25 + —* 
x —-xla—0.2x — z = y| —0. = 
x+6 dni x +6 


where a is a positive parameter. Observe that this system includes the 
one in Problem 18 above and the one in Problem 13 in Section 9.5. 


a. Find all of the critical points. 

b. Consider the critical point in the interior of the first 
quadrant. Find the eigenvalues of the approximate linear system. 
Determine the value ag where this critical point changes from 
asymptotically stable to unstable. 

Q c. Draw a phase portrait for a value of a slightly greater than 
dy. Observe that a limit cycle has appeared. How does the limit 
cycle change as a increases further? 


20. There are certain chemical reactions in which the constituent 
concentrations oscillate periodically over time. The system 
x '=1-—(b+1)x+ Low y= bx — Tas 
4 4 
is a special case of a model, known as the Brusselator, of this kind of 
reaction. Assume that b is a positive parameter, and consider solutions 
in the first quadrant of the xy-plane. 
a. Show that the only critical point is (1, 4b). 
b. Find the eigenvalues of the approximate linear system at the 
critical point. 
C. Classify the critical point as to type and stability. How does 
the classification depend on 5? 
d. As b increases through a certain value bo, the critical point 
changes from asymptotically stable to unstable. What is that 
value bo? 
Q e. Plot trajectories in the phase plane for values of b slightly 
less than and slightly greater than bọ. Observe the limit cycle 
when b > bo; the Brusselator has a Hopf bifurcation point at bo. 
Q f. Plot trajectories for several values of b > bo and observe 
how the limit cycle deforms as b increases. 


21. 


The system 


1 1 
zie jE); y — -3G + 0.8y — 0.7) 


is a special case of the Fitzhugh - Nagumo!Ó equations, which model 
the transmission of neural impulses along an axon. The parameter k is 
the external stimulus. 


‘Richard Fitzhugh (1922-2007) of the United States Public Health 
Service and Jin-Ichi Nagumo (1926-1999) of the University of Tokyo 
independently proposed a simplification of the Hodgkin- Huxley model of 
neural transmission around 1961. 
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a. Show that the system has one critical point regardless of the 
value of k. 

[G| b. Find the critical point for k = 0, and show that it is 
an asymptotically stable spiral point. Repeat the analysis for 
k — 0.5, and show that the critical point is now an unstable spiral 
point. Draw a phase portrait for the system in each case. 

Q c. Find the value kọ where the critical point changes from 
asymptotically stable to unstable. Find the critical point and draw 
a phase portrait for the system for k = ko. 


Q d. Fork > ko, the system exhibits an asymptotically stable 
limit cycle; the system has a Hopf bifurcation point at kọ. Draw 
a phase portrait for k = 0.4, 0.5, and 0.6; observe that the limit 
cycle is not small when K is near kp. Also plot x versus ¢ and 
estimate the period T in each case. 

€. Ask increases further, there is a value kı at which the critical 
point again becomes asymptotically stable and the limit cycle 
vanishes. Find ky. 


98 Chaos and Strange Attractors: 
The Lorenz Equations 


In principle, the methods described in this chapter for two-dimensional autonomous systems 
can be applied to higher dimensional systems as well. In practice, several difficulties arise 
when we try to do this. One problem is that there is simply a greater number of possible cases 
that can occur, and the number increases with the number of equations in the system (and 
the dimension of the phase space). Another problem is the difficulty of graphing trajectories 
accurately in a phase space of more than two dimensions; even in three dimensions it may not 
be easy to construct a clear and understandable plot of the trajectories, and it becomes more 
difficult as the number of variables increases. Finally—and this has been widely recognized 
only since the 1970s—there are different and very complex phenomena that can occur, and 
frequently do occur, in n-dimensional systems with n > 3 that are not present in two- 
dimensional systems. Our goal in this section is to provide a brief introduction to some of 
these phenomena by discussing one particular three-dimensional autonomous system that has 
been intensively studied. In some respects, the presentation here is similar to the treatment of 
the logistic difference equation in Section 2.9. 

An important problem in meteorology, and in other applications of fluid dynamics, 
concerns the motion of a layer of fluid, such as the earth’s atmosphere, that is warmer at the 
bottom than at the top; see Figure 9.8.1. If the vertical temperature difference AT is small, 
then there is a linear variation of temperature with altitude but no significant motion of the 
fluid layer. However, if A T is large enough, then the warmer air rises, displacing the cooler 
air above it, and a steady convective motion results. If the temperature difference increases 
further, then eventually the steady convective flow breaks up and a more complex and turbulent 
motion ensues. 


Temperature 
A a AT 
Cooler N Y) 
AUN AN A 
Warmer 


A layer of fluid heated from below. 


While investigating this phenomenon, Edward N. Lorenz!’ was led (by a process too 


involved to describe here) to the nonlinear autonomous three-dimensional system 


d d d 
e essc), € y — xz, a bz + xy. (1) 


Edward N. Lorenz (1917-2008), an American meteorologist, received his Ph.D. from the Massachusetts Institute 
of Technology in 1948 and was associated with that institution throughout his scientific career. His first studies of the 
system (1) appeared in a famous 1963 paper (cited in the References) dealing with the stability of fluid flows in the 
atmosphere. 


9.8 Chaos and Strange Attractors: The Lorenz Equations 


The system of differential equations (1) is now commonly referred to as the Lorenz 
equations.!® Observe that the first equation is linear, but that the second and third equations 
involve quadratic nonlinearities. However, except for being a system of three equations, 
superficially the Lorenz equations appear no more complicated than the competing species 
and predator-prey equations discussed in Sections 9.4 and 9.5. 

The variable x in equations (1) is related to the intensity of the fluid motion, while 
the variables y and z are related to the temperature variations in the horizontal and vertical 
directions. The Lorenz equations also involve three parameters a, r, and b, all of which are 
real and positive. The parameters o and b depend on the material and geometric properties of 
the fluid layer. For the earth's atmosphere, reasonable values of these parameters are c. = 10 
and b — - they will be assigned these values in much of what follows in this section. The 
parameter r, on the other hand, is proportional to the temperature difference A T, and our 
purpose is to investigate how the nature of the solutions of equations (1) changes with r. 

Before proceeding further, we note that for an autonomous system of three first-order 
equations 


dx dy dz 
a e F oN so G s Vs $5 Tu Um H os > 2 
di (x, y, z) di (x, y. z) d (x, y, z) (2) 
the Jacobian matrix J is defined by 
J = J[F, G, Hyx, y, z) = Gy Gy G: . (3) 
H, Hy, H. 
Thus, for the Lorenz equations (1), the Jacobian matrix is 
—g o 0 
J-|r-z -1 -x |. (4) 
y x —b 


The first step in analyzing the Lorenz equations is to locate the critical points by solving 


the (nonlinear) algebraic system 
ox-—oy=0, 


rx —y—xz=0, (5) 
—bz + xy =0. 
From the first equation we have y = x. Then, eliminating y from the second and 


third equations, we obtain 
x(r—1-—z)=0, 


6 
—bz + x? — 0. o 


One way to satisfy the first equation in (6) is to choose x = 0. Then it follows that y = 0 and, 
from the second equation in (6), z = 0. Alternatively, we can satisfy the first equation in (6) 
by choosing z = r — 1. Then the second equation in (6) requires that x = +,/b(r — 1) and 
then y = +,/b(r — 1) also. Observe that these expressions for x and y are real only when 
r > 1. Thus (0, 0, 0), which we will denote by P», is a critical point for all values of r, and 
it is the only critical point for r < 1. However, when r > 1, there are also two other critical 
points, namely, (/b(r — 1), vb(r — 1),r — 1) and ( v/b(r 1), v/b(r 1),r- 1). 
We will denote the latter two points by P, and P», respectively. Note that all three critical 
points coincide when r = 1. As r increases through the value 1, the critical point Po at the 
origin bifurcates,!° and, in addition to Po, the critical points P, and P) come into existence. 
Next we will determine the local behavior of solutions in the neighborhood of each critical 
point. Although much of the following analysis can be carried out for arbitrary values of o 


13 A very thorough treatment of the Lorenz equations appears in the book by Sparrow listed in the References at the 
end of this chapter. 

I? Because this transition is from one critical point for r < 1 to three critical points for r > 1, technically, we should 
say that at r = | the critical point Pp trifurcates into Py, P;, and P5. But, it is customary to use bifurcate whenever 
there is a change in the number or classification of the critical points of a problem. 
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8 
and b, we will simplify our work by using the values c = 10 and b = 7 Near the origin (the 
critical point Po), the approximating linear system is 


' /a0 10  0Y/x 


y| = r -l 0 y|. (7) 
z 0 0 —8/3/ \z 
The eigenvalues? are determined from the equation 
—10— A 10 0 8 
r —1—4A 0 = ($3 )tao xa +A) = 100 
0 0 —8/3—A 
(8) 
8 2 
--($*3)0 +114 - 10( — 1)) Es 
Therefore, 
8 —11 — 4/81 + 40 —11+ 4/81 + 40 
Apes A= 2 ka E and À3 = T 2 T r (9) 


Note that all three eigenvalues are negative for r « 1; for example, when r — 1/2, the 
eigenvalues are Aj = — 8/3, A5 — — 10.52494, and A4— — 0.47506. Hence the origin 
is asymptotically stable for 0 < r « 1 both for the linear approximation (8) and for the 
original system (1). However, A5 changes sign when r = 1 and is positive for r > 1. The 
value r — 1 corresponds to the initiation of convective flow in the physical problem described 
earlier. When r 1 the origin is an unstable equilibrium solution for both the linearized 
problem and for the nonlinear Lorenz equations. All solutions to the linearized system 
starting near the origin tend to grow, except for those lying precisely in the plane determined 
by the eigenvectors associated with A, and A». For the nonlinear system (1), the same is true 
except that the exceptional solutions are those lying in a certain surface tangent to the plane 
determined by the eigenvectors associated with A, and A» at the origin. 

The second critical point is Pj = (4/8(r — 1)/3, \/8(r — 1)/3, r — 1) forr > 1. To 
consider the neighborhood of this critical point, let u, v, and w be the perturbations from the 
critical point in the x-, y-, and z-directions, respectively. The approximating linear system is 


—10 10 0 


" 1 zu apani” 
v| = v3 v |. (10) 
w 8 8 8 w 

Jse -» še- E 


The eigenvalues of the coefficient matrix of equation (10) are determined from the equation 


33? + 413? + 8(r 4-10) A + 160(r — 1) = 0, (11) 


which is obtained by straightforward algebraic steps that are omitted here. (See Problem 2.) 
The solutions of equation (11) depend on r in the following way: 


For 1 < r < rı =1.3456 there are three distinct negative real eigenvalues. 


For ri < r < ry =24.737 there are one negative real eigenvalue and two complex 
eigenvalues with negative real part. 


For r > rz, there are one negative real eigenvalue and two complex eigenvalues with 
positive real part. 


The same results are obtained for the critical point P». Thus there are several different 
situations. 

For 0 < r < 1 the only critical point is Pp and it is asymptotically stable. All solutions 
approach this point (the origin) as £ > oo. 


20Since r appears as a parameter in the Lorenz equations, we will use À to denote the eigenvalues. 
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For 1 < r < rj, the critical points P; and P, are asymptotically stable and Po is unstable. 
All nearby solutions approach one or the other of the points P, and P» exponentially. 

Forr,; < r < m, the critical points P; and P, are asymptotically stable and Pp is unstable. 
All nearby solutions approach one or the other of the points P, and P»; most of them spiral 
inward to the critical point. 

Forr > ry, all three critical points are unstable. Most solutions near P, or P spiral away 
from the critical point. 

However, this is by no means the end of the story. Let us consider solutions for r somewhat 
greater than r2. In this case Po has one positive eigenvalue, and each of P, and P; has a pair of 
complex eigenvalues with positive real part. A trajectory can approach any one of the critical 
points, but only when it starts on one of several highly restricted paths. The slightest deviation 
from these paths causes the trajectory to depart from the critical point. Since none of the critical 
points are stable, one might expect that most trajectories would approach infinity for large f. 
However, it can be shown that all solutions remain bounded as t — oo; see Problem 5. In fact, 
it can be shown that all solutions ultimately approach a certain limiting set of points that has 
zero volume. Indeed, this is true not only for r > rz but for all positive values of r. 

A plot of computed values of x, y, and z versus f for a typical solution with r > r» 
is shown in Figure 9.8.2. Note that the x and y solutions oscillate back and forth between 
positive and negative values in a rather erratic manner. Indeed, the graphs of x versus f and y 
versus ¢ resemble a random vibration, although the Lorenz equations are entirely deterministic 
and the solution is completely determined by the initial conditions. Nevertheless, the solution 
also exhibits a certain regularity in that the frequency and amplitude of the oscillations are 
essentially constant in time. 


A plot of x (solid blue), y (dotted red), and z (dashed green) 
versus f for the Lorenz equations (1) with r = 28; the initial point is (5, 5, 5). 


The solutions of the Lorenz equations are also extremely sensitive to perturbations 
in the initial conditions. Figure 9.8.3 shows the graphs of computed values of x versus 
t for the two solutions whose initial points are (5, 5, 5) and (5.01, 5, 5). The solid blue 
graph is the same as the one in Figure 9.8.2, while the solid black graph starts at a 
nearby point. The two solutions remain close until f is near 10, after which they are quite 
different and, indeed, seem to have no relation to each other. (The y and z components 
of the solution exhibit similar differences starting a little after t= 10.) It was this property 
that particularly attracted the attention of Lorenz in his original study of these equations, and 
it caused him to conclude that accurate detailed long-range weather predictions are probably 
not possible. 

The attracting set in this case, although of zero volume, has a rather complicated structure 
and is called a strange attractor. The term chaotic has come into general use to describe 
systems of differential equations with the property that solutions with similar initial conditions 
can have very different solutions, such as the ones in Figure 9.8.3. 

To determine how and when the strange attractor is created, it is illuminating to investigate 
solutions for smaller values of r. For r — 21, solutions starting at three different initial points 
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Plots of x versus t for two initially nearby solutions of Lorenz equations 
with r — 28; the initial point is (5, 5, 5) for the dashed blue curve and is (5.01, 5, 5) for 
the solid black curve. 


are shown in Figure 9.8.4. For the initial point (3, 8, 0), the solution begins to converge to the 
point P5 = (7.303, 7.303, 20) almost at once; see Figure 9.8.4a. For the second initial point 
(5, 5, 5), there is a fairly short interval of transient behavior, after which the solution converges 
to Pi = (—7.303, —7.303, 20); see Figure 9.8.45. However, as shown in Figure 9.8.4c, for 
the third initial point (5, 5, 10), there is a much longer interval of transient chaotic behavior 
before the solution eventually converges to P5. As r increases, the duration of the chaotic 


transient behavior also increases. When r = r} = 24.06, the chaotic transients appear to 
continue indefinitely, and the strange attractor comes into being. 
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(c) 


Plots of x (solid blue), y (dotted red), and z (dashed green) versus t for solutions of Lorenz 
equations with r — 21 with three different initial conditions. (a) Initial point is (3, 8, 0); solution converges 
to P». (b) Initial point is (5, 5, 5); solution converges to P. (c) Initial point is (5, 5, 10); solution converges 
(eventually) to P5. 


9.8 Chaos and Strange Attractors: The Lorenz Equations 


-40 + 


AUIE: Projection in the xy-plane of a trajectory of the Lorenz 
equations (with r — 28) with initial condition (5, 5, 5). 


-40 — 


AUETA Projection in the xz-plane of a trajectory of the Lorenz 
equations (with r — 28) with initial condition (5, 5, 5). 


We can also show the trajectories of the Lorenz equations in the three-dimensional phase 
space, or at least projections of them in various planes. Figures 9.8.5 and 9.8.6 show projections 
in the xy- and xz-planes, respectively, of the trajectory starting at (5, 5, 5). Observe that while 
the graphs in these figures appear to cross over themselves repeatedly, the general uniqueness 
theorem ensures this cannot be true for the actual trajectories in three-dimensional space. The 
apparent crossings are due wholly to the two-dimensional character of the figures. 

The sensitivity of solutions to perturbations of the initial data also has implications for 
numerical computations, such as those reported here. Different step sizes, different numerical 
algorithms, and even the execution of the same algorithm on different machines can introduce 
small differences in the computed solution, which can eventually lead to large deviations. For 
example, the exact sequence of positive and negative loops in the calculated solution depends 
strongly on the precise numerical algorithm and its implementation, as well as on the initial 
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conditions. However, the general appearance of the solution and the structure of the attracting 
set are independent of all these factors. 

Solutions of the Lorenz equations for other parameter ranges exhibit other interesting 
types of behavior. For example, for certain values of r greater than r2, intermittent bursts of 
chaotic behavior separate long intervals of apparently steady periodic oscillation. For other 
ranges of r, solutions show the period-doubling property that we saw in Section 2.9 for the 
logistic difference equation. Some of these features are taken up in the problems. 

Since about 1975, the Lorenz equations and other higher-dimensional autonomous 
systems have been studied intensively, and this is one of the most active areas of current 
mathematical research. Chaotic behavior of solutions appears to be much more common 
than was suspected at first, and many questions remain unanswered. Some of these are 
mathematical in nature, while others relate to the physical applications or interpretations of 


solutions. 


Problems 


Problems 1 through 3 ask you to fill in some of the details of the 
analysis of the Lorenz equations in this section. 


1. a. Show that the eigenvalues of the linear system (8), valid near 
the origin, are given by equation (9). 
b. Determine the corresponding eigenvectors. 
c. Determine the eigenvalues and eigenvectors of the system (7) 
in the case where r — 28. 


2. a. Show that the linear approximation valid near the critical 
point P, is given by equation (10). 
b. Show that the eigenvalues of the system (10) satisfy 
equation (11). 
@ c. Forr = 28, solve equation (11) and thereby determine the 
eigenvalues of the system (10). 


3. Qa. By solving equation (11) numerically, show that the real 
part of the complex roots changes sign when r = 24.737. 
b. Show that a cubic polynomial x? + Ax? + Bx + C has one 
real zero and two pure imaginary zeros only if AB — C. 


C. Apply the result of part b to equation (11) to show that the 
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real part of the complex roots changes sign when r — Ty 


4. Use the Liapunov function V(x, y, z) = x? + oy? + oz? to 
show that the origin is a globally asymptotically stable critical point 
for the Lorenz equations (1) ifr < 1. 


5. Consider the ellipsoid 
V(x, yz) = rx? + o y? +o(z- 2r)? =c> 0. 


a. Calculate — along trajectories of the Lorenz equations (1). 
b. Determine a sufficient condition on c so that every trajectory 
crossing V(x, y, z) = c is directed inward. 
C. Evaluate the condition found in part b for the case c = 10, 
b —8/3,r —28. 
Problems 6 through 10 suggest some further investigations of the 
Lorenz equations. 


@ 6. Forr = 28, plot x versus t for the cases shown in Figures 
9.8.2 and 9.8.3. Do your graphs agree with those shown in the figures? 
Recall the discussion of numerical computation in the text. 


(9 7. Forr = 28, plot the projections in the xy- and xz-planes, 
respectively, of the trajectory starting at the point (5, 5, 5). Do the 
graphs agree with those in Figures 9.8.5 and 9.8.6? 


8. Q a. Forr — 21, plot x versus t for the solutions starting at 
initial points (3, 8, 0), (5, 5, 5), and (5, 5, 10). Use a t-interval 


of at least 0 < t < 30. Compare your graphs with those in Figure 
9.8.4. 


O b. Repeat part a for r = 22, r = 23, and r = 24. Increase 
the t interval as necessary so that you can determine when each 
solution begins to converge to one of the critical points. Record 
the approximate duration of the chaotic transient in each case. 
Describe how this quantity depends on the value of r. 

Q c. Repeat parts a and b for values of r slightly greater than 
24. Try to estimate the value of r for which the duration of the 
chaotic transient approaches infinity. 


9. For certain r intervals, or windows, the Lorenz equations exhibit 
a period-doubling property similar to that of the logistic difference 
equation discussed in Section 2.9. Careful calculations may reveal this 
phenomenon. 

Q a. One period-doubling window contains the value r — 100. 
Let r = 100 and plot the trajectory starting at (5, 5, 5) or some 
other initial point of your choice. Does the solution appear to be 
periodic? What is the period? 

Q b. Repeat part a for slightly smaller values of r. When r = 
99.98, you may be able to observe that the period of the solution 
doubles. Try to observe this result by performing calculations 
with nearby values of r. 

O c. As r decreases further, the period of the solution doubles 
repeatedly. The next period doubling occurs at about r = 99.629. 
Try to observe this by plotting trajectories for nearby values 
ofr. 


10. Now consider values of r slightly larger than those in Problem 9. 
@ a. Plot trajectories of the Lorenz equations for values of r 
between 100 and 100.78. You should observe a steady periodic 
solution for this range of r values. 

@ b. Plot trajectories for values of r between 100.78 and 100.8. 
Determine as best you can how and when the periodic trajectory 
breaks up. 


The Róssler?! System. The system 

x =-y—z, y-xtay Z=b+xUx-c), — (02 
where a, b, and c are positive parameters, is known as the Róssler 
system??. It is a relatively simple system, consisting of two linear 


210tto E. Róssler (1940—), a German medical doctor and biochemist, was a 
student and later became a faculty member at the University of Tübingen. The 
equations named for him first appeared in a paper he published in 1976. 


Z$ee the book by Strogatz for a more extensive discussion and further 
references. 


equations and a third equation with a single quadratic nonlinearity. 
In Problems 11 through 15, we ask you to carry out some numeric 
investigations of this system, with the goal of exploring its period- 
doubling property. To simplify matters, seta = 1/4, set b = 1, and 
let c > 0 remain arbitrary. 


11. a. Show that there are no critical points when c < 1/ 4/2, 
there is one critical point for c — 1/ V, and there are two 
critical points when c > 1/ y. 

b. Find the critical point(s) and determine the eigenvalues of the 
associated Jacobian matrix when c — 1/ JA and when c = 1. 
Q c. How do you think trajectories of the system will behave for 
c — 1? Plot the trajectory starting at the origin. Does it behave 
the way that you expected? 

O d. Choose one or two other initial points, and plot the 
corresponding trajectories. Do these plots agree with your 
expectations? 

12. a. Letc — 1.3. Find the critical points and the corresponding 
eigenvalues. What conclusions, if any, can you draw from this 
information? 

Q b. Plot the trajectory starting at the origin. What is the 
limiting behavior of this trajectory? To see the limiting behavior 
clearly, you may wish to choose a r-interval for your plot so that 
the initial transients are eliminated. 

Q c. Choose one or two other initial points, and plot the 
corresponding trajectories. Are the limiting behavior(s) the same 
as in part b? 

Q d. Observe that there is a limit cycle whose basin of attraction 
is fairly large (although not all of xyz-space). Draw a plot of x, 
y, Or z versus f, and estimate the period T; of motion around the 
limit cycle. 
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CHAPTER 10 


Partial Differential 
Equations and 
Fourier Series 


In many important physical problems there are two or more independent variables, so 
the corresponding mathematical models involve partial, rather than ordinary, differential 
equations. This chapter treats one important method for solving partial differential equations, a 
method known as separation of variables. Its essential feature is the replacement of the partial 
differential equation by a set of ordinary differential equations, which must be solved subject 
to given initial or boundary conditions. The first section of this chapter deals with some basic 
properties of boundary value problems for ordinary differential equations. The desired solution 
of the partial differential equation is expressed as a sum, usually an infinite series, formed from 
solutions of the ordinary differential equations. In many cases we ultimately need to deal with 
a series of sines and/or cosines, so part of the chapter is devoted to a discussion of such series, 
which are known as Fourier series. With the necessary mathematical background in place, we 
then illustrate the use of separation of variables in a variety of problems arising from heat 
conduction, wave propagation, and potential theory. 


10.1 Two-Point Boundary Value Problems 


Up to this point in the book we have dealt with initial value problems, consisting of a 
differential equation together with suitable initial conditions at a given point. A typical 
example, which was discussed at length in Chapter 3, is the differential equation 


y" + p(t)y’+q(t)y = a(t), (1) 
with the initial conditions 
y(to.) = yo. y' (to) = yo. Q) 


Physical applications often lead to another type of problem, one in which the value of the 
dependent variable y or its derivative is specified at two different points. Such conditions are 
called boundary conditions to distinguish them from initial conditions that specify the value 
of y and y' at the same point. A differential equation and suitable boundary conditions form a 
two-point boundary value problem. A typical example is the differential equation 


Y" t p) y' + q(x) y = g(x) (3) 
with the boundary conditions 
yla) =y» (b) =y. (4) 


The natural occurrence of boundary value problems usually involves a space coordinate as 
the independent variable, so we have used x rather than ¢ in equations (3) and (4). To solve 
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the boundary value problem (3), (4), we need to find a function y = y(x) that satisfies the 
differential equation (3) in the interval a < x < 0 and that takes on the specified values 
yo and y, at the endpoints of the interval. Usually, we first seek the general solution of the 
differential equation and then use the boundary conditions to determine the values of the 
arbitrary constants. 

Boundary value problems can also be posed for nonlinear differential equations, but we 
will restrict ourselves to a consideration of linear equations only. It is important to determine 
whether a linear boundary value problem is homogeneous or nonhomogeneous. If the function 
g has the value zero for each x, and if the boundary values yo and y; are also zero, then the 
problem (3), (4) is called homogeneous. Otherwise, the problem is nonhomogeneous. 

Although the initial value problem (1), (2) and the boundary value problem (3), (4) may 
superficially appear to be quite similar, they actually differ in some very important ways. 
Under mild conditions on the coefficients, initial value problems are certain to have a unique 
solution. On the other hand, boundary value problems under similar conditions may have 
a unique solution, but they may also have no solution or, in some cases, infinitely many 
solutions. In this respect, linear boundary value problems resemble systems of linear algebraic 
equations. 

Let us recall some facts (see Section 7.3) about the system 


Ax — b, (5) 


where A is a given n x n matrix, b is a given n x 1 vector, and x is an n x 1 vector to 
be determined. If A is nonsingular, then the system (5) has a unique solution for any b. 
However, if A is singular, then the system (5) has no solution unless b satisfies a certain 
additional condition, in which case the system has infinitely many solutions. Now consider 
the corresponding homogeneous system 


Ax = 0, (6) 


obtained from the system (5) when b — 0. The homogeneous system (6) always has 
the solution x — 0, which is often referred to as the trivial solution. If A is nonsingular, 
then this is the only solution, but if A is singular, then there are infinitely many nonzero, 
or nontrivial, solutions. Note that it is impossible for the homogeneous system to have no 
solution. These results can also be stated in the following way: the nonhomogeneous system 
(5) has a unique solution if and only if the homogeneous system (6) has only the solution 
x = 0, and the nonhomogeneous system (5) has either no solution or infinitely many solutions 
if and only if the homogeneous system (6) has nonzero solutions. 

We now turn to some examples of linear boundary value problems that illustrate very 
similar behavior. A more general discussion of linear boundary value problems appears in 
Chapter 11. 


EXAMPLE 1 


Solve the boundary value problem 
y"+2y=0, y(0)=1, y(t) =0. (7) 


Solution: 


The general solution of the differential equation (7) is 


y= ci cos( 2x) + czsin( v2 ) (8) 


The first boundary condition requires that c} = 1. The second boundary condition implies that 
CI cos( 2 2 +c sin(/27) = 0, soc) = — cot (2) = —0.2762. Thus the solution of the 


boundary value problem (7) is 
y = cos( 2x) - cot( V27) sin( /2 ) (9) 


This example illustrates the case of a nonhomogeneous boundary value problem with a unique 
solution. 


10.1 Two-Point Boundary Value Problems 


EXAMPLE 2 


Solve the boundary value problem 
y”’+y=0, y0 21, y(1) =a, (10) 


where a is a given number. 


Solution: 


The general solution of this differential equation is 
y = cı cosx +c) sin x, (11) 


and from the first boundary condition we find that c; = 1. The second boundary condition now 
requires that —c, = a. These two conditions on c, are incompatible if a 4 —1, so the problem has 
no solution in that case. However, if a — —1, then both boundary conditions are satisfied provided 
that c; = 1, regardless of the value of c». In this case there are infinitely many solutions of the form 


y — cosx + cosinx, (12) 


where c) remains arbitrary. This example illustrates that a nonhomogeneous boundary value problem 
may have no solution— and also that under special circumstances it may have infinitely many 
solutions. 


The nonhomogeneous boundary value problem (3), (4) has a corresponding homogeneous 
problem consisting of the differential equation 


y" + p(x)y' t q()y 20 (13) 
and the boundary conditions 
y(a) 20, y(8)-0. (14) 


Observe that this problem has the solution y = 0 for all x, regardless of the coefficients p(x) 
and q(x). This (trivial) solution is rarely of interest. What we usually want to know is whether 
the problem has other, nonzero solutions. Consider the following two examples. 


EXAMPLE 3 


Solve the boundary value problem 


y"+2y=0, y(0) 20, y(7) =0. (45) 


Solution: 


The general solution of the differential equation is again given by equation (8), 
yc eos ( 2x) +c) sin(/2x) ; 


The first boundary condition requires that c; = 0, and the second boundary condition leads to 
c5sin| 4/27 | = 0. Since sin| 4/2 r) z 0, it follows that c; = 0 also. Consequently, y = 0 for 


all x is the only solution of the problem (15). This example illustrates that a homogeneous boundary 
value problem may have only the trivial solution y = 0. 


EXAMPLE 4 


Solve the boundary value problem 


y"+y=0, y(0 =0, y(r) =0. (16) 
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Solution: 


The general solution is given by equation (11), 
y = cı cosx +c) sinx, 


and the first boundary condition requires that c; = 0. Since sin m = 0, the second boundary condition 
is also satisfied when c; = 0, regardless of the value of c2. Thus the solution of problem (16) is 
y = c sinx, where cy remains arbitrary. This example illustrates that a homogeneous boundary 
value problem may have infinitely many solutions. 


Examples 1 through 4 illustrate (but, of course, do not prove) that there is the same 
relationship between homogeneous and nonhomogeneous linear boundary value problems 
as there is between homogeneous and nonhomogeneous linear algebraic systems. The 
nonhomogeneous boundary value problem (Example 1) has a unique solution, and the 
corresponding homogeneous problem (Example 3) has only the trivial solution. Further, 
the nonhomogeneous problem (Example 2) has either no solution or infinitely many, and the 
corresponding homogeneous problem (Example 4) has nontrivial solutions. 


Eigenvalue Problems. Recall the matrix equation 
Ax = AX (17) 


that we discussed in Section 7.3. Equation (17) has the solution x = 0 for every value of 
A, but for certain values of A, called eigenvalues, there are also nonzero solutions, called 
eigenvectors. The situation is similar for boundary value problems. 

Consider the problem consisting of the differential equation 


y" + ry =0, (18) 
together with the boundary conditions 
y(0) =0, y(7)-0. (19) 


Observe that the boundary value problem (18), (19) is the same as the problems in Examples 3 
and 4 if \ = 2 and A = 1, respectively. Recalling the results of these examples, we note that 
for A = 2, equations (18), (19) have only the trivial solution y = 0, while for A = 1, the 
problem (18), (19) has other, nontrivial solutions. By extension of the terminology associated 
with equation (17), the values of A for which nontrivial solutions of (18), (19) occur are called 
eigenvalues, and the nontrivial solutions themselves are called eigenfunctions. Restating the 
results of Examples 3 and 4, we have found that A = 1 is an eigenvalue of the problem 
(18), (19) and that A = 2 is not. Further, any nonzero multiple of sin x is an eigenfunction 
corresponding to the eigenvalue \ = 1. 

Let us now turn to the problem of finding other eigenvalues and eigenfunctions of the 
problem (18), (19). We need to consider separately the cases A > 0, A = 0, and A < 0, 
because the form of the solution of equation (18) is different in each of these cases. 


Casel: A — 0. To avoid the frequent appearance of radical signs, it is convenient in this case 
to let A = u? and to rewrite equation (18) as 


y" -- u?^y =0. Q0) 

The characteristic polynomial equation for equation (20) is r? + u? = 0 with roots 
r = +i p, so the general solution is 

y = c cos( ux) + c sin( ux). (21) 


Note that j; is nonzero (since \ > 0) and there is no loss of generality if we also assume that 
u is positive. The first boundary condition requires that c; = 0, and then the second boundary 
condition reduces to 


c5sin(jum ) = 0. (22) 


We are seeking nontrivial solutions, so we must require that c; # 0. Consequently, sin ju 7 
must be zero, and our task is to choose u so that this will occur. We know that the sine function 


10.1 Two-Point Boundary Value Problems 


has the value zero at every integer multiple of 7, so we can choose u to be any (positive) 
integer. The corresponding values of A are the squares of the positive integers, so we have 
determined that 


Aji21, A524, 2529, ... Àn 2n), ... Q3) 


are eigenvalues of the problem (18), (19). The eigenfunctions are given by equation (21) 
with c} = O0, so they are just multiples of the functions sinnx for n = 1,2,3, .... 
Observe that the constant c) in equation (21) is never determined, so eigenfunctions are 
determined only up to an arbitrary multiplicative constant [just as are the eigenvectors of the 
matrix problem (17)]. We will usually choose the multiplicative constant to be 1 and write the 
eigenfunctions as 


yi(x) = sinx, y(x) = sin(2x),..., Ya(x) = sin(nx), ..., (24) 


remembering that multiples of these functions are also eigenfunctions. 


Case Il: A < 0. In this case we let \ = —44?, so that equation (18) becomes 
y" — u?y =0. Q5) 
The characteristic equation for equation (25) is r? — u? = 0 with roots r = +y, so its 


general solution can be written as 
y = cı cosh( ux) + co sinh( ux). (26) 


We have chosen the hyperbolic functions cosh ux and sinh ux, rather than the exponential 
functions exp(ux) and exp(—jx), as a fundamental set of solutions for convenience in 
applying the boundary conditions. The first boundary condition requires that c; = 0; then 
the second boundary condition gives c5 sinh( um) = 0. Since sinh( ur) = O if and only if 
uT = 0, the fact that u ZZ O means that sinh( ur) Z 0, and therefore we must have c; = 0. 
Consequently, y = 0 and there are no nontrivial solutions for A < 0. In other words, the 
problem (18), (19) has no negative eigenvalues. 


Case lll: A — 0. Now equation (18) becomes 
y” =0, (27) 
and its general solution is 
y = cx + c2. (28) 


The first of the boundary conditions (19) forces c; = 0; then the second boundary condition 
reduces to cım = 0, so c, = 0. There is only the trivial solution y = 0 in this case as well. 
That is, A = 0 is not an eigenvalue of problem (18), (19). 

To summarize our results: we have shown that the two-point boundary value problem 
(18), (19) has an infinite sequence of positive eigenvalues A, = n? forn — 1,2,3, ... and 
that the corresponding eigenfunctions are proportional to sin(nx). Further, there are no other 
real eigenvalues. There remains the possibility that there might be some complex eigenvalues; 
recall that a matrix with real elements may very well have complex eigenvalues. In Problem 23 
we outline an argument showing that the particular problem (18), (19) cannot have complex 
eigenvalues. Later, in Section 11.2, we discuss an important class of boundary value problems 
that includes (18), (19). One of the useful properties of this class of problems is that all their 
eigenvalues are real. 

In later sections of this chapter, we will often encounter the problem 


y”+rAy=0, y(0)=0, y(L)-0, (29) 


which differs from the problem (18), (19) only in that the second boundary condition is 
imposed at an arbitrary point x = L rather than at x = 7. The solution process for À > 0 is 
exactly the same as before, up to the step where the second boundary condition is applied. For 
problem (29) this condition requires that 


csin( uL) = 0 (30) 


rather than equation (22), as in the former case. Consequently, u L must be an integer multiple 
of 7, so u = nz / L, where n is a positive integer. Hence the eigenvalues and eigenfunctions 
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of problem (29) are given by 


nn? 


mets 


. (nn x 
. xs sin( 75), n=1, 2,3, ll. (31) 


As usual, the eigenfunctions y,(x) are determined only up to an arbitrary multiplicative 
constant. In the same way as for the problem (18), (19), you can show that the problem (29) 
has no eigenvalues or eigenfunctions other than those in equation (31). 

The problems following this section explore to some extent the effect of different 
boundary conditions on the eigenvalues and eigenfunctions. A more systematic discussion 
of two-point boundary and eigenvalue problems appears in Chapter 11. 


Problems 


In each of Problems 1 through 13, either solve the given boundary 
value problem or else show that it has no solution. (Problems 11 
through 13 involve Euler equations; see Section 5.4.) 


1l. y’+y=0, y(0 20, y (1) 251 


2. y"+2y=0, y(0 =1, y(r) =0 

3. y"+y=0, y(0) =0, y(L) 20 

4. y"+y=0, y(0)=1, yxL)-0 

5. y”+y=x, y(0 =0, y(1) =0 

6. y"+2y=x, y(0)=0, y(7) =0 

7. y"+4y=cosx, y(0) 20, y(7) 20 

8. y”+4y =sinx, y(0) 20, y(7) 20 

9. y”+4y=cosx, y(0) 20, y(mr) 20 
10. y”+3y =cosx, y(0) 20, y'(1) 20 
11. x2y" -2xy' -2y 20, y(1) 2-1, y(22 51 
12. x?y" E 3xy'& y 2 x?, y(1) 20, y(e) 20 


13. x?y"+5xy’+(44+77)y 2Inx, y(1) 20, y(e) 20 
In each of Problems 14 through 20, find the eigenvalues and 


eigenfunctions of the given boundary value problem. Assume that all 
eigenvalues are real. 


14. y" Ay-20, y(0 20, y(r) 20 

15. y’+Ay=0, y(0) =0, y(7)20 

16. y”+Ay=0, y'(0) 20, y'(7) 20 

17. y"+Ay=0, y(0) 20, y(L) 20 

18. y”+Ay=0, y(0) 20, y'(L) 20 

19. y"—Ay 20, y(0 20, y(L) 20 

20. x?y" —- xy - Ay 20, y(1) 20, y(L) 20, L>1 

21. Theaxially symmetric laminar flow of a viscous incompressible 
fluid through a long straight tube of circular cross section under a 
constant axial pressure gradient is known as Poiseuille! flow. The axial 


velocity w is a function of the radial variable r only and satisfies the 
boundary value problem 


1 G 
w"-pF—w'--—-—, w(R)=0, w(r) bounded for 0 < r < R, 
u 


‘Jean Louis Marie Poiseuille (1797-1869) was a French physician who was 
also trained in mathematics and physics. He was particularly interested in the 
flow of blood and published his first paper on the subject in 1840. 


where R is the radius of the tube, G is the pressure gradient, and p is 
the coefficient of viscosity of the fluid. 
a. Find the velocity profile w(r). 
b. By integrating w(r) over a cross section, show that the total 
flow rate Q is given by 


v R^G 


ere 


Since Q, R, and G can be measured, this result provides a 
practical way to determine the viscosity u. 

c. Suppose that R is reduced to 3/4 of its original value. What 
is the corresponding reduction in Q? This result has implications 
for blood flow through arteries constricted by plaque. 


22. Consider a horizontal metal beam of length L subject to a 
vertical load f(x) per unit length. The resulting vertical displacement 
in the beam y(x) satisfies the differential equation 
d*y 
EI dut = f(x), 

where E is Young's modulus and 7 is the moment of inertia of the 
cross section about an axis through the centroid perpendicular to 
the xy-plane. Suppose that f(x)/EI is a constant k. For each of 
the boundary conditions given below, solve for the displacement y(x), 
and plot y versus x in the case that L = 1 and k = — 1. 

a. Simply supported at both ends: 

y(0) = y"(0) = y(L) = y'(L) 20 

b. Clamped at both ends: y(0) = y'(0) = y(L) = y'(L) =0 

c. Clamped at x = 0, free at x = L: 

y(0) = y'(0) = y"(L) = y"(L) =0 
23. In this problem we outline a proof that the eigenvalues of the 
boundary value problem (18), (19) are real. 


a. Write the solution of equation (18) as y = kı exp(iux) + 
ky exp(—ijx), where 4 = *, and impose the boundary 
conditions (19). Show that nontrivial solutions exist if and 
only if 


ett — ete — 0, (32) 
b. Letu = v + io and use Euler’s relation exp(ivz) = 
cos(v7) +isin(v7) to determine the real and imaginary parts 


of equation (32). 
c. By considering the equations found in part (b), show that v 
is an integer and that c = 0. Consequently, jx is real and so is A. 


10.2 Fourier Series 


Later in this chapter you will find that you can solve many important problems involving 
partial differential equations, provided that you can express a given function as an infinite 
sum of sines and/or cosines. In this and the following two sections we explain in detail how 
this can be done. These trigonometric series are called Fourier series”; they are somewhat 
analogous to Taylor series in that both types of series provide a means of expressing quite 
complicated functions in terms of certain familiar elementary functions. 
We begin with a series of the form 
x 
Jj: () 


eo 
a + Yos cos (T7 ) + bm sin(— 
m=1 

On the set of points where the series (1) converges, it defines a function f, whose value at 
each point is the sum of the series for that value of x. In this case the series (1) is said to be the 
Fourier series for f. Our immediate goals are to determine what functions can be represented 
as a sum of a Fourier series and to find some means of computing the coefficients in the series 
corresponding to a given function. The first term in the series (1) is written as ag /2 rather than 
as ao to simplify a formula for the coefficients that we derive below. Besides their association 
with the method of separation of variables and partial differential equations, Fourier series are 
also useful in various other ways, such as in the analysis of mechanical or electrical systems 
acted on by periodic external forces. 


Periodicity of the Sine and Cosine Functions. To discuss Fourier series, it is necessary 
to develop certain properties of the trigonometric functions sin( mr x / L) and cos( mz x / L), 
where m is a positive integer. The first property is their periodic character. A function f is 
said to be periodic with period T > 0 if the domain of f contains x + T whenever it contains 
x, and if 


fG T) = f(x) (2) 


for every value of x. An example of a periodic function is shown in Figure 10.2.1. It follows 
immediately from the definition that if T is a period of f, then 27 is also a period, and so 
indeed is any integral multiple of T. The smallest value of T for which equation (2) holds 
is called the fundamental period of f. A constant function is a periodic function with an 
arbitrary period but no fundamental period. 


T 
< 2T 


| FIGURE 10.2.1 | A periodic function of period T. 


Jean Baptiste Joseph Fourier (1768-1830) was twice imprisoned during the French Revolution, later served as 
scientific advisor in Napoleon's army in Egypt, and was prefect of the department of Isére (Grenoble) from 1801 
to 1815. He made the first systematic use, although not a completely rigorous investigation, of trigonometric series 
in 1807 and 1811 in his papers on heat conduction. The papers were not published due to objections from the 
referees, principally Lagrange. Although it turned out that Fourier's claim of generality was somewhat too strong, 
his results inspired a flood of important research that has continued to the present day. See Grattan-Guinness or 
Carslaw (Historical Introduction) for a detailed history of Fourier series. 
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If f and g are any two periodic functions with common period T, then any linear 
combination c; f + cog is also periodic with period T. To prove this statement, begin by 
defining F(x) = ci f(x) + c2g(x); then, for any x, 


F(x +T) 2 cf(x +T) + og(x + T) = cı f(x) + eg(x) = F(x). (3) 


Moreover, it can be shown that the sum of any finite number, or even the sum of a convergent 
infinite series, of functions of period T is also periodic with period T. In a similar way, you 
can show that the product fg is periodic with period T. 

In particular, the functions sin(mz x/L) and cos(mmx/L), m = 1,2,3,..., are 
periodic with fundamental period T = 2L/m. To see this, recall that sinx and cosx 
have fundamental period 27 and that sinax and cos ox have fundamental period 27 /a. 
If we choose a = m7 / L, then the period T of sin(mz x/ L) and cos(mz x / L) is given by 
T —2nL/(mm) —2L/m. 

Note also that, since every positive integral multiple of a period is also a period, each of 
the functions sin(mz x / L) and cos(m7 x / L) has the common period 2L. 


Orthogonality ofthe Sine and Cosine Functions. To describe a second essential property 
of the functions sin(m7 x / L) and cos(mz x / L), we generalize the concept of orthogonality 
of vectors (see Section 7.2). The standard inner product (u, v) of two real-valued functions 
u and v on the interval a € x < ( is defined by 


B 
(u,v) =] u(x)v(x)dx. (4) 


The functions u and v are said to be orthogonal on a x x < f if their inner product is 
zero — that is, if 


B 
| u(x)v(x)dx = 0. (5) 


A set of functions is said to be mutually orthogonal if each distinct pair of functions in the 
set is orthogonal. 

The functions sin(m7 x / L) and cos( mz x / L), m = 1,2, ... formamutually orthogonal 
set of functions on the interval —L < x < L. In fact, they satisfy the following orthogonality 
relations: 


7 mm x nn x 0, mn, 
/ cos( ) eos( Jax = (6) 
-L L L L, m=n; 
i mT x nt 
I cos( ) ( Jax =0, allm,n; (1) 
m L 
L —(mmxs . /nmx 0, mn, 
J sin( ) sin( Jax = (8) 
-L L L L, m=n. 


These results can be obtained by direct integration. For example, to derive equation (8), 
note that 


LoCqmmxs . (nnx ] ft (m — n)max (m 4- n)mx 
nc L ) sin( L Jax =f (cos( L ) cos( L ))e 


L 


|. AL pmo d 
^ 2m 


m-—n m+n -L 


=0 


as long as m +n and m — n are not zero. Since m and n are positive, m +n + 0. On the other 
hand, if m — n = 0, then m = n, and the integral must be evaluated in a different way. In this 


case 


—L. 


This establishes equation (8); equations (6) and (7) can be verified by similar computations. 
(See Problem 12.) 


The Euler-Fourier Formulas. Now let us suppose that a series of the form (1) converges 
for all real numbers x on the interval —L < x < L, and let us call its sum f(x): 


f(x) = 2«Y( cos (77) + bin sin(“**)). (9) 


m=1 


The coefficients a,, and b,, can be related to f(x) as a consequence of the orthogonality 
conditions (6), (7), and (8). First multiply equation (9) by cos(nax/L), where n is a fixed 
positive integer (n > 0), and integrate with respect to x from —L to L. Assuming that the 
integration can be legitimately carried out term by term,? we obtain 


(= 
f fees (t mia cos( É Jax 


+ Diam f es cos ) cos (=) ax 
mm x nT xX 
bm ( ) ( ) . 10 
>? E sin L cos L dx (10) 


Keeping in mind that n is fixed whereas m ranges over the positive integers, it follows from 
the orthogonality relations (6) and (7) that the only nonzero term on the right-hand side of 
equation (10) is the one for which m — n in the first summation. Hence 


L nux L 2 /NTX 
T FG) cos (T7 )ax ea, | eos ( L \dx = aL, | ee ae (11) 
p =L 


Equation (11) is valid for all n = 1, 2, 3, ... . To determine ap, we can integrate equation (9) 
from —L to L, obtaining 


a joa f act Som f cos( Daton f sin(™™*) ax 


m=1 


= aL, (12) 


since each integral involving a trigonometric function over an interval of length 2L (one 
period) is zero. Thus 


-if G (Z =0,1,2 13 
ura E I X, = OL 1, 2, 2c. (13) 


By writing the constant term in equation (9) as ao/2, it is possible to compute all the a, from 
equation (13). Otherwise, a separate formula, with an extra factor of 1/2, would have to be 
used for do. 


?This is a nontrivial assumption, because not all convergent series with variable terms can be so integrated. For the 
special case of Fourier series, however, term-by-term integration can always be justified. 
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A similar expression for b, may be obtained by multiplying equation (9) by sin(nz x / L), 
integrating termwise from —L to L, and using the orthogonality relations (7) and (8); thus 


b Ie )sin( 7 )a Laa (14) 
= — x)sin(—-—]dx, n=1,2,3,.... 
RM i 


Equations (13) and (14) are known as the Euler-Fourier formulas for the coefficients in a 
Fourier series. Hence, if the series (9) converges to f(x), and if the series can be integrated 
term-by-term, then the coefficients must be given by equations (13) and (14). 

Note that equations (13) and (14) are explicit formulas for a, and b, in terms of f, and 
that the determination of any particular coefficient is independent of all the other coefficients. 
Of course, the difficulty in evaluating the integrals in equations (13) and (14) depends very 
much on the particular function f involved. 

Note also that the formulas (13) and (14) depend only on the values of f(x) in the interval 
—L < x < L. Since each of the terms in the Fourier series (9) is periodic with period 2L, 
the series converges for all x whenever it converges in —L < x < L, and its sum is also 
a periodic function with period 2L. Hence f(x) is determined for all x by its values in the 
interval - L < x < L. 

It is possible to show (see Problem 27) that if g is periodic with period T, then every 
integral of g over an interval of length T has the same value. If we apply this result to the 
Euler-Fourier formulas (13) and (14), it follows that the interval of integration, —L < x < L, 
can be replaced, if it is more convenient to do so, by any other interval of length 2L. 


EXAMPLE 1 


Assume that there is a Fourier series converging to the function f defined by 


jt es —x, —2<x<0, 
idm X; O<x < 2; 


(15) 
f(x+4) = fa). 


Determine the coefficients in this Fourier series. 


Solution: 


| FIGURE 10.2.2 | The triangular wave in Example 1. 


This function represents a triangular wave (see Figure 10.2.2) and is periodic with period 4. Thus the 


Fourier series has the form 


f(x) = $450 ses (275) + Din a (*55)). (16) 


m=1 


where the coefficients are computed from equations (13) and (14) with L = 2. Substituting for f(x) 
in equation (13) with m = 0, we have 


1 f° [e 
w=5/ Codi; f xdx —141-22. (17) 
-2 0 


For m > 0, equation (13) yields 


| j^ mox 1 ag mmux 
dm = 5 RE E defe A X COS Em dx. 


These integrals can be evaluated through integration by parts, with the result that 


0 
1 2 . [mTx 2 2 mq x 
lü = x sin cos 
2 mT 2 mT 2 


CE m GG] 

= -| =| — ] +| — | eos(mz) + | — | cos(mm) — | — 

2 mT mT mm HET 
4 

= qug mm TD. m=1, 2... 

_ Tuy m odd, (18) 
0, m even. 


Finally, from equation (14), it follows in a similar way that 
b, =0, m=1,2,.... (19) 


By substituting the coefficients from equations (17), (18), and (19) in the series (16), we obtain the 
Fourier series for f': 


8 TX 1 3ux 1 57x 
f(x) =1 = cos 7 + zz cos 3 + 3008 5 Jeca 


| 
| 
"E: 
Me 
S| - 
2 
E CN 
3 
3 
E 
bed 


2 8x 1 (2n — Dax 
zi Sra 5 ) (20) 


EXAMPLE 2 


Let 


0, —-3«x«-1, 
f(xy) =<1, -I«x«l (21) 
0, l<x<3 


and suppose that f(x + 6) = f(x). Graph three periods of y = f(x). Find the coefficients in the 
Fourier series for f. 


Solution: 


The graph of y = f(x) is shown in Figure 10.2.3. 


| FIGURE 10.2.3 | Graph of f(x) in Example 2. 


Note that f(x) is not assigned a value at the points of discontinuity, such as x = —1 and x = 1, 
or at the ends of the period, such as x = —9, x = —3, x = 3, and x = 9. This has no effect on 
the values of the Fourier coefficients, because they result from the evaluation of integrals, and the 
value of an integral is not affected by the value of the integrand at a single point or at a finite number 
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of points. Thus the coefficients are the same regardless of what value, if any, f(x) is assigned at a 
point of discontinuity. 
Since f has period 6, it follows that L = 3 in this problem. Consequently, the Fourier series 


for f has the form 
oo 
f(x) = 2 «Y em) L3 (22) 


n-l 
where the coefficients a, and b, are given by equations (13) and (14) with L = 3. We have 


: "HK )d. : ud : (23) 
a= = x)dx = = x=. 
d ey ala 3 
Similarly, 
1 
1 : nox d 1. nox 2 . {nt L2 (24) 
ne cos| —— |dx = sin = sin ; 2—1,2;::5 
a 3i 3 nt 3 nt 3 
-1 
and 


mV n=1,2,.... (25) 


1 ! : nox 1 nw x 
bm sin dx = cos 
3 Ja 3 nu 3 


Thus the Fourier series for f is 


f(x) ; +5 = sin( 2) e) Q6) 


EXAMPLE 3 


Consider again the function in Example 1 and its Fourier series (20). Investigate the speed with 
which the series converges. In particular, determine how many terms are needed so that the error is 
no greater than 0.01 for all x. 


Solution: 


The m" partial sum in this series 


8 xS 1 (2n — Dax 
Smn(x) —1 cos 27 
m(x) 7 D ( > (27) 
can be used to approximate the function f. The coefficients diminish as (2n — 1)~?, so the series 
converges fairly rapidly. This is borne out by Figure 10.2.4, where the partial sums for m = 1 


(dotted blue) and m = 2 (dashed red) are plotted. To investigate the convergence in more detail, we 
can consider the error e,(x) = f(x) — Sm(x). Figure 10.2.5 shows a plot of |eg( x) | versus x for 
0 <x x 2. Observe that [eg( x) | is greatest at the points x = 0 and x = 2, where the graph of f(x) 
has corners. It is more difficult for the series to approximate the function near these points, resulting 
in a larger error there for a given m. Similar graphs are obtained for other values of m. 


> 
-4 -2 2 4 x 


Partial sums in the Fourier series, equation (20) for 


m = | (dotted blue) and for m = 2 (dashed red), for the triangular wave 
(gold). 
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| FIGURE 10.2.5 | Plot of |eg( x)| versus x for the triangular wave in Example 3. 


Once you realize that the maximum error always occurs at x = 0 or x = 2, you can obtain a 
uniform error bound for each m simply by evaluating |e,,(x)| at one of these points. For example, 
form = 6 we have eg(2) = 0.03370, so |eg(x)| < 0.034 forO < x < 2 and, consequently, for all x. 
Table 10.2.1 shows corresponding data for other values of m; these data are plotted in Figure 10.2.6. 
From this information you can begin to estimate the number of terms that are needed in the series in 
order to achieve a given level of accuracy in the approximation. From Table 10.2.1, we see that the 
maximum error drops below 0.01 somewhere between m — 20 and m — 25. In fact, to guarantee 
that |ej,(x)| < 0.01, we need to choose m = 21, for which the error is e;,(2) = 0.00965. 


TABLE 10.2.1 Values of the Error en (2) 


for the Triangular Wave 

m em(2) 

2) 0.09937 

4 0.05040 

6 0.03370 
10 0.02025 
15 0.01350 
20 0.01013 
25 0.00810 


0.04 


0.02 s 


5 10 15 20 25 m 


| FIGURE 10.2.6 | Plot of e,,(2) versus m for the triangular wave. 
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In this book, Fourier series appear mainly as a means of solving certain problems in partial 
differential equations. However, Fourier series have much wider application in science and 
engineering and, in general, are valuable tools in the investigation of periodic phenomena. A 
basic problem is to resolve an incoming signal into its harmonic components, which amounts 
to constructing its Fourier series representation. In some frequency ranges, the separate terms 
correspond to different colors or to different audible tones. The magnitude of the coefficient 
determines the amplitude of each component. This process is referred to as spectral analysis. 


Problems 


In each of Problems 1 through 8, determine whether the given function 
is periodic. If so, find its fundamental period. 


sin(5x) 
cos( 27 x) 
sinh(2x) 
sin(7 x / L) 
tan(7 x) 


x? 


fü 0 2n—-1zxc«2n, 
"34. mue Und 


(-1)", 2n-1<x «2n, 
8. f(x) = 
1, 2n<x< 2n+1; 


9. If f(x) = —x for —L < x < L, and if f(x +2L) = f(x), 
find a formula for f(x) in the interval L < x < 2L; in the interval 
—3L < x < —2L. 


at deo We qe eI dm 


N= 0,281, SEX, ess 


N= Orsse lee 2) uis 


x+1, -1«x«0, : 
10. If f(x) = x "^ and if f(x +2) = f(x), 


find a formula for f(x) in the interval 1 < x « 2; in the interval 
8«x«9. 

11. If f(x) 2 L—xfor0 < x < 2L, andif f(x +2L) = f(x), 
find a formula for f(x) in the interval —L < x < 0. 

12. Verify equations (6) and (7) in this section by direct integration. 
In each of Problems 13 through 18: 


a. Sketch the graph of the given function for three periods. 
b. Find the Fourier series for the given function. 


13. f(x) 2—x, -LEx«L; f(x+2L) = f(x) 


T _ fl, -Lzx«0 T 

e f(y = 0, eee Je f(x + 2L) = f(x) 

15 _ Jx -T <x <0, EN 

» oie 0, O<x<7; Fx E20) = fo) 
£1, —l<x< 0, 

h f — X, sxs ii fœ +2 = f) 
+L, -L<x<0O, 

17. roo= fi Deed, PEF2E) = fa) 
0, —2<x<-l, 

18. f(x) =<x, -1«x«l, f(x +4) = f(x) 


0, 1<x<2; 
In each of Problems 19 through 24: 
a. Sketch the graph of the given function for three periods. 
b. Find the Fourier series for the given function. 
Q c. Plot the partial sum s,,(x) versus x form = 5, 10, and 20. 
d. Describe how the Fourier series seems to be converging. 


-l, -2zx«0, 
ES roo ={ i, 0<x<2; 
20. f(x)=x, -1<x< 1; f(x+2)= f(x) 
21. f(x) =x?/2, -2- x €2; f(xt+4 = f(x) 


f(x 4) = f(x) 


x +2, —2<x< 0, 

22. eet (eed fG +4) = f(x) 
-$a —2 <x <0, 

23. f(x)= 1 f(x +4) = f(x) 
2x — —x?, 0x 

2 
A. f(xy = DTI px $6) = F(x) 
Pea i: x*(3 — x), 0cxc«3; fe =f 


25. Consider the function f defined in Problem 21, and let 
em(x) = f(x) — Sm( x). 
a. Plot |em(x)| versus x for 0 < x < 2 for several values of m. 
b. Find the smallest value of m for which |e,,(x)| < 0.01 for 
all x. 
26. Consider the function f defined in Problem 24, and let 
em(x) = f(x) = Sm (X): 
Q a. Plot |@m(x)| versus x for 0 < x < 3 for several values 
of m. 
O b. Find the smallest value of m for which |e,,(x)| < 0.1 for 
all x. 
27. Suppose that g is an integrable periodic function with period T. 
a. If0 <a x T, show that 


F a+T 
n g(x)dx — / g(x)dx. 
0 a 


a a+T 

Hint: Show first that | g(x)dx = f g(x)dx. Then, in the 
0 T 

second integral, consider the change of variable s = x — T. 


b. Show that for any value of a, not necessarily in 0 < a < T, 


T a+T 
| g(x)dx = f g(x)dx. 
0 a 


c. Show that for any values of a and b, 


a+T b+T 
| g(x)dx = f g(x)dx. 
a b 


28. If f is differentiable and is periodic with period T , show that f’ 
is also periodic with period T. Determine whether 


F(x) = i ftdt 
0 


is always periodic. 


29. In this problem we indicate certain similarities between three- 
dimensional geometric vectors and Fourier series. 
a. Let vı, V2, and v3 be a set of mutually orthogonal vectors 
in three dimensions, and let u be any three-dimensional vector. 


Show that 
u = a,V, + d2V2 + 43V3, (28) 
where 
Hosen usd gd. (29) 
Vi * Vi 


Show that a; can be interpreted as the projection of u in the 
direction of v; divided by the length of v;. 
b. Define the inner product (u, v) by 


L 
(u, v) E u(x)v(x)dx. (30) 
= 
Also let 
on(x) = cos (777). nes 1,3, 02% 
vu) = sin( 72). n=1, 2... 61) 
10.3 


In the preceding section we showed that if the Fourier series 


oo 
T + » (an cos (77) + bg sin( 


m=1 
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Show that equation (10) can be written in the form 


oo oo 
a 
(f. 4) = 7 (bo On) +) amm. On) +Y bn Wms Øn). 
m=1 m=1 
(32) 
c. Use equation (32) and the corresponding equation for 
Cf. Wn), together with the orthogonality relations, to show that 


an = Chin) =0,1,2,...; 
(dns On) 
Cf. a) 

b= =, = 15 255-28 % 33 
Cn Vn) ” ou 


Note the resemblance between equations (33) and equation (29). The 
functions ¢, and wv, play a role for functions similar to that of 
the orthogonal vectors v;, V2, and v3 in three-dimensional space. The 
coefficients a, and b, can be interpreted as projections of the function 
f onto the base functions ¢, and Y. 

Observe also that any vector in three dimensions can be expressed 
as a linear combination of three mutually orthogonal vectors. In a 
somewhat similar way, any sufficiently smooth function defined on 
—L < x < L can be expressed as a linear combination of the 
mutually orthogonal functions cos(n7 x / L) and sin( n7 x / L), that is, 
as a Fourier series. 


The Fourier Convergence Theorem 


) o 


converges and thereby defines a function f, then f is periodic with period 2L, and the 
coefficients am and b,, are related to f(x) by the Euler-Fourier formulas: 


Ji f(x) cos ("73a 0,1,2 2) 
an = — x) cos( —— )dx, m=0,1,2,...; 
Ld L 


] fL . (MTX 
bim] FO) sin( Jax, m=1, 2 .... 
LJ- 


L 


(3) 


In this section we suppose that a function f is given. If this function is periodic with 
period 2L and integrable on the interval [—L, L], then a set of coefficients a,, and b, can be 
computed from equations (2) and (3), and a series of the form (1) can be formally constructed. 
The question is whether this series converges for each value of x and, if so, whether its sum 
is f(x). Examples have been discovered showing that the Fourier series corresponding to a 
function f may not converge to f(x) and may even diverge. Functions whose Fourier series 
do not converge to the value of the function at isolated points are easily constructed, and 
examples will be presented later in this section. Functions whose Fourier series diverge at one 
or more points are more pathological, and we will not consider them in this book. 

To guarantee convergence of a Fourier series to the function from which its coefficients 
were computed, it is essential to place additional conditions on the function. From a practical 
point of view, such conditions should be broad enough to cover all situations of interest, yet 
simple enough to be easily checked for particular functions. Through the years, several sets of 


conditions have been devised to serve this purpose. 


Before stating a convergence theorem for Fourier series, recall that during the introduction 
of the Laplace transform in Section 6.1, we defined a function f to be piecewise continuous 
on a bounded interval a < x < b if the interval can be partitioned by a finite number of points 


a = xo < X4 < +++ < x, = b so that 
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1. f is continuous on each open subinterval x;_; < x < xj. 


2. f approaches a finite limit as the endpoints of each subinterval are approached from within 
the subinterval. 


The graph of a piecewise continuous function is shown in Figure 10.3.1. 


| FIGURE 10.3.1 | A piecewise continuous function. 


The notation f(c*) is used to denote the limit of f(x) as x — c from the right; 
and, similarly, f(c~) denotes the limit of f(x) as x approaches c from the left. That is, 
f(ct) = lim, , f(x) and f(c7) — lim, ,.— f(x). 

Note that it is not essential that the function even be defined at the partition points x;. For 
example, in the following theorem we assume that f’ is piecewise continuous; but certainly 
f’ does not exist at those points where f itself is discontinuous. It is also not essential that the 
interval be closed; it may also be open, or open at one end and closed at the other. 


Theorem 10.3.1 | Fourier Convergence Theorem 


Suppose that f and f’ are piecewise continuous on the interval —L < x < L. Further, suppose that 
f is defined outside the interval —L < x < L so that it is periodic with period 2L. Then f has a 


Fourier series 
oo 
ao mmu x . (MTX 
F(x) 2 S +) (an cos( L ) + bn sin( L )) (4) 


m=1 
whose coefficients are given by equations (2) and (3). The Fourier series converges to f(x) at all 


1 
points where f is continuous, and it converges to 2 (f(xt) + f(x7)) at all points where f is 


discontinuous. 


1 
Note that 2 CfGx*) + f(x7)) is the mean value of the right- and left-hand limits at the 
point x. At any point where f is continuous, f(x*) = f(x^) = f(x). Thus it is correct to 
say that the Fourier series converges to 2 (f(x) + f(x ^)) at all points. Whenever we say 


that a Fourier series converges to a function f, we always mean that it converges in this sense. 

We emphasize that the conditions given in this theorem are only sufficient for the 
convergence of a Fourier series; they are by no means necessary. Nor are they the most general 
sufficient conditions that have been discovered. In spite of this, the proof of the theorem is 
fairly difficult, and we do not discuss it here.* Under more restrictive conditions, a much 
simpler convergence proof is possible; see Problem 18. 

To obtain a better understanding of the content of the theorem, it is helpful to consider 
some classes of functions that fail to satisfy the assumed conditions. Functions that are not 
included in the theorem are primarily those with infinite discontinuities in the interval [—L, L], 
such as 1/x? as x — 0, or In |x — L| as x — L. Functions having an infinite number of jump 
discontinuities in this bounded interval are also excluded; however, such functions are rarely 
encountered. 

It is noteworthy that a Fourier series may converge to a sum that is not differentiable, 
or even continuous, in spite of the fact that each term in the series (4) is continuous, and 
even differentiable infinitely many times. The example below is an illustration of this, as is 
Example 2 in Section 10.2. 


^Proofs of the convergence of a Fourier series can be found in most books on advanced calculus. See, for example, 
Kaplan (Chapter 7) or Buck (Chapter 6). 
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EXAMPLE 1 


Let 


fa) 0, -L«x«90, (5) 
A, -—- 
L, Ocxc-c«L, 


and let f be defined outside this interval so that f(x + 2L) = f(x) for all x. We will temporarily 
leave open the definition of f at the points x = 0, +L. Find the Fourier series for this function and 
determine where it converges. 


—o 
d——d 
-3L  -2L 


| FIGURE 10.3.2 | The square wave in Example 1. 


Solution: 


Three periods of the graph of y = f(x) are shown in Figure 10.3.2; it extends periodically to infinity 
in both directions. It can be thought of as representing a square wave. The interval [—L, L] can be 
partitioned into the two open subintervals (—L, 0) and (0, L). In (0, L), f(x) = Land f'(x) = 0. 
Clearly, both f and f’ are continuous and furthermore have limits as x — 0 from the right and as 
x — L from the left. The situation in ( — L, 0) is similar. Consequently, both f and f’ are piecewise 
continuous on [—L, L), so f satisfies the conditions of Theorem 10.3.1. If the coefficients am and 
bm are computed from equations (2) and (3), the convergence of the resulting Fourier series to f(x) 
is ensured at all points where f is continuous. Note that the values of the Fourier coefficients am and 
bm are the same regardless of the definition of f at its points of discontinuity. This is true because 
the value of an integral is unaffected by changing the value of the integrand at a finite number of 


points. From equation (2), 
1 f} L 
w=7/ foods e | Idx = L 
LJ 0 


1 b mmu x L MTX 
an = — fo) eos (ax = cos (77 )as 
LJ L " L 


2E Gator) ceo. sli 
m7 


and 


Similarly, from equation (3), 


"C feo sin( 22) ax = (= a 
a clo L West eS x 
0, m even; 
= ——(1-cos(mm)) =< 2 
T —, modd. 
m 


Hence 


f(x) L " 2b. (=) " l. 30x E l. 57x " 
x)= in 1n sin x 
Ru Ta T 5 L 


oo 
L 2L 1 . f (20n— Dax 
— . 6 
iem al L ) " 
n-l 
At the points x = 0, -EnL, where the function f in the example is not continuous, all terms in the 


series after the first vanish and the sum is L /2. This is the mean value of the limits from the right 
and left, as it should be. Thus we might as well define f at these points to have the value L/2. 
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If we choose to define it otherwise, the series still gives the value L/2 at these points, since all of 
the preceding calculations remain valid. The series simply does not converge to the function at those 
points unless f is defined to have the value L /2. This illustrates the possibility that the Fourier series 
corresponding to a function may not converge to it at points of discontinuity unless the function is 
suitably defined at such points. 

The manner in which the partial sums 


"m a in( 2) + " 1 .[(Qn-Umnx -1.2 
s) = 5 m sin L ae a L > 0AR-—1H2... 


of the Fourier series (6) converge to f(x) is indicated in Figure 10.3.3, where L has been chosen to be 
1 and the graph of sg(x) is plotted. The figure suggests that at points where f is continuous, the partial 
sums do approach f(x) as n increases. However, in the neighborhood of points of discontinuity, 
such as x = 0 and x = L, the partial sums do not converge smoothly to the mean value. Instead, 
they tend to overshoot the mark at each end of the jump, as though they cannot quite accommodate 
themselves to the sharp turn required at this point. This behavior is typical of Fourier series at points 
of discontinuity and is known as the Gibbs? phenomenon. 

Additional insight is attained by considering the error e,(x) = f(x) — s,(x). Figure 10.3.4 
shows a plot of |e;(x)| versus x for n = 8 and for L = 1. The least upper bound of |eg(x)| is 0.5 
and is approached as x — 0 and as x — 1. As n increases, the error decreases in the interior of the 
interval [where f(x) is continuous], but the least upper bound does not diminish with increasing n. 
Thus we cannot uniformly reduce the error throughout the interval by increasing the number of terms. 

Figures 10.3.3 and 10.3.4 also show that the series in this example converges more slowly than 
the one in Example 1 in Section 10.2. This is due to the fact that the coefficients in the series (6) are 
proportional only to 1/(2n — 1). 


The partial sum sg(x) in the Fourier series, 


equation (6), for the square wave. 


leg) 
0.5 


0.4 
0.3 


0.2 


| FIGURE 10.3.4 | A plot of the error |eg(x)| versus x for the square 


wave. 


>The Gibbs phenomenon is named after Josiah Willard Gibbs (1839-1903), who is better known for his work on 
vector analysis and statistical mechanics. Gibbs was professor of mathematical physics at Yale and one of the first 
American scientists to achieve an international reputation. The Gibbs phenomenon is discussed in more detail by 
Carslaw (Chapter 9). 


Problems 


In each of Problems 1 through 6, assume that the given function is 
periodically extended outside the original interval. 
a. Find the Fourier series for the extended function. 
b. Sketch the graph of the function to which the series converges 
for three periods. 


L ro =f") T<x <0, 
1, O<x<7 
2. ro ={" -r <x <0, 
X; OELSE 
L+x, -L<x <0, 
* fo - [rr O<x<L 
4. f(»)21-2x -l<x<l 
0, =T <x < —mn[2, 
5. f(x) =< 1, -2/2<x < 7/2, 


0, w/2<x< N 


0, 
6. f(x) = I 


In each of Problems 7 through 12, assume that the given function is 
periodically extended outside the original interval. 


—l<x< 0, 
O0<x<l 


a. Find the Fourier series for the given function. 
Q b. Let en(x) = f(x) — Sn(x). Find the least upper bound 
or the maximum value (if it exists) of |e,(x)| for n = 10, 20, 


and 40. 
Q c. If possible, find the smallest n for which |e,(x)| < 0.01 
for all x. 
-rm <x <0, 
T fa)= 0 he eeu f(x +27) = f(x) 
(see Section 10.2, Problem 15) 
x+1, -l<x<40Q, 
8. f(x) = l-z erei f(x +2) = f(x) 
(see Section 10.2, Problem 16) 
9. f(x =x, -l<x< 1; f(x+2) = f(x) 
(see Section 10.2, Problem 20) 
x+2, —2<x <0, 
10. f=), garaz TOD =F) 
(see Section 10.2, Problem 22) - 
0  -1xx«0, 
n. f(x) = deve, F429 = £0) 
(see Problem 6) i 
12. f(x) =x-—2x, -l<x <1; f(x4+2) = f(x) 


Periodic Forcing Terms. In this chapter we are concerned mainly 
with the use of Fourier series to solve boundary value problems for 
certain partial differential equations. However, Fourier series are also 
useful in many other situations where periodic phenomena occur. 

Problems 13 through 16 indicate how they can be employed to 
solve initial value problems with periodic forcing terms. 


13. Find the solution of the initial value problem 


y'-Fu?y =sin(nt), y(0) 20, y'(0) =0, 


where n is a positive integer and w? Æ n?. What happens if w? = n?? 


14. Find the formal solution of the initial value problem 


oo 
y' uy 2 V b,sinn), y(0) =0, y'(0) =0, 


n=1 
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where w > 0 is not equal to a positive integer. How is the solution 
altered if w = m, where m is a positive integer? 


15. Find the formal solution of the initial value problem 


y'w?y — f(t), y(0) =0, y(0) — 0, 


where f is periodic with period 27 and 


1, O«t«m; 
ft) 2 0 t20,7,2m; 
—l m-c«t«2m. 
See Problem 1. 


16. Find the formal solution of the initial value problem 


y’+wy=f(t), y(00-21, y(0) =0, 
where f is periodic with period 2 and 
ft) = 1-t, 0<ż< 1l; 
~ lalth 1<t<2. 


See Problem 8. 
17. Assuming that 


f(y == >t C cos (= 


n=1 


na) m 


show formally that 


j f} E 
Ji [f GOYdx — RE (a? + 855. 
SE = 


This relation between a function f and its Fourier coefficients is 
known as Parseval's$ equation. This relation is very important in the 
theory of Fourier series; see Problem 9 in Section 11.6. 

Hint: Multiply equation (7) by f(x), integrate from —L to L, and use 
the Euler-Fourier formulas. 


T 


18. This problem indicates a proof of convergence of a Fourier series 
under conditions more restrictive than those in Theorem 10.3.1. 
a. If f and f’ are piecewise continuous on — L < x < L, and if 
f is periodic with period 2L, show that nan and nb, are bounded 
asn — oo. Hint: Use integration by parts. 
b. If f is continuous on —L < x < L and periodic with period 
2L, and if f’ and f" are piecewise continuous on —L < x < L, 
show that n?a, and n?b, are bounded as n — on. If f is 
continuous on the closed interval, then it is continuous for all x. 
Why is this important? Hint: Again, use integration by parts. 
oo oo 


c. Using the result of part b, show that 5 |a,| and 5 lbn] 
n=1 n-l 

converge. 

d. From the result in part c, show that the Fourier series (4) 

converges absolutely’ for all x. 


®Marc-Antoine Parseval (1755-1836) was a relatively obscure French 
mathematician for whom an important result has been named. He presented 
a forerunner of this result in 1799, though not in the context of Fourier series. 


TIt also converges uniformly; for an explanation of what this means, see a book 
on advanced calculus or analysis. 
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Acceleration of Convergence. In the next problem, we show how it is 
sometimes possible to improve the speed of convergence of a Fourier 
series. 


19. Suppose that we wish to calculate values of the function g, 
where 


(n — 1) 


g(x) = »» VENDENT: 


n= 


sin( (2n 


Dax). (8) 


It is possible to show that this series converges, albeit rather slowly. 
However, observe that for large n, the terms in the series (8) are 
approximately equal to sin((2n — 1)7 x) / (2n — 1) and that the latter 
terms are similar to those in the example in the text, equation (6). 


a. Show that 
3 
2n — 

n=1 


where f is the square wave in the example with L = 1. 
b. Subtract equation (9) from equation (8) and show that 


1 = sin((2n — 1) rx) 
>) 3 (2n - D (1+ 2n — 1)?) 
(10) 


The series (10) converges much faster than the series (8) and thus 
provides a better way to calculate values of g(x). 


in( (2n — 1 EA. : 9 
j nin — i ae) fix)-5 » (9) 


= T 
g(x) = 2 f(x) 


104 Even and Odd Functions 


Before looking at further examples of Fourier series, it is useful to distinguish two classes 
of functions for which the Euler-Fourier formulas can be simplified. These are even and odd 
functions, which are characterized geometrically by the property of symmetry with respect to 
the y-axis and with respect to the origin, respectively (see Figure 10.4.1). 

Analytically, f is an even function if its domain contains the point —x whenever it 
contains the point x, and if 


f(—x) = f(x) 1) 


for each x in the domain of f. Similarly, f is an odd function if its domain contains —x 
whenever it contains x, and if 


fC-x) == f(x) Q) 


for each x in the domain of f. Examples of even functions are 1, x?, cos(nx), |x|, cosh(nx), 
and x2". The functions x, x?, sin(nx), sinh(nx), and x2"+1 are examples of odd functions. 
Note that according to equation (2), (0) must be zero if f is an odd function whose domain 
contains the origin. Most functions are neither even nor odd; an example is e*. Only one 
function, f identically zero, is both even and odd. 


ERE 


—— (a) An even function. (b) An odd function. 


Elementary properties of even and odd functions include the following:? 


1. The sum (difference) and product (quotient) of two even functions are even. 


2. The sum (difference) of two odd functions is odd; the product (quotient) of two odd 
functions is even. 


8 These statements may need to be modified if either function vanishes identically. 


3. The sum (difference) of an odd function and an even function is neither even nor odd; the 
product (quotient) of an odd function and an even function is odd. 


The proofs of all these assertions are simple and follow directly from the definitions. For 
example, if both fj and f» are odd, and if g(x) = fi(x) + fo(x), then 


fi(—x) + fax) = —filx) — f(x) 
=(A + R) = g(a), (3) 
so fı + f; is an odd function also. Similarly, if h(x) = fi(x) fo(x), then 


h(—x) = fi(—x) (9) = (fica) (a) = Ai) fax) = h(x), (4) 


g(—x) 


so that f, f2 is even. 
Also of importance are the following two integral properties of even and odd functions: 


4. If f is an even function, then 
L jn 
J f(x)dx = 2 f f(x)dx. (5) 
= 0 
5. If f is an odd function, then 


L 
/ f(x)dx — O. (6) 
HS 


Properties 4 and 5 are intuitively clear from the interpretation of an integral in terms of 
area under a curve, and they also follow immediately from the definitions. For example, if f 


is even, then 
L 0 L 
[ fea = | fds + | f(x)dx. 
—L c 0 


Letting x — —s in the first term on the right-hand side and using equation (1), we obtain 


L 0 E 
J fik -f fas + | foJis 
—L E 0 


0 L L 
-f foas+ | f(x)dx i f(x) dx. 
L 0 0 


The proof of the corresponding property for odd functions is similar. (See Problem 31.) 
Even and odd functions are particularly important in applications of Fourier series since 
their Fourier series have special forms, which occur frequently in physical problems. 


Cosine Series. Suppose that f and f’ are piecewise continuous on —L < x < L and that 
f is an even periodic function with period 2L. Then it follows from properties | and 3 that 
f(x) cos(n7x/L) is even and f(x) sin(nz x/ L) is odd. As a consequence of equations (5) 
and (6), the Fourier coefficients of f are then given by 


Zi F(x) (a JE (y 0.1.2 
n= cos (| —— = — cos| —— 2 —0.1:2. 1.3 
a Tla x T X E. X) co 7 X, n 


b, — 0, ned ox 


(7) 


Thus f has the Fourier series 


In other words, the Fourier series of any even function consists only of the even trigonometric 
functions cos(77 x/ L) and the constant term; it is natural to call such a series a Fourier 
cosine series. From a computational point of view, observe that only the coefficients a,, for 
n = 0,1,2, ... , need to be calculated from the integral formula (7). Each of the b,, for 
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483 
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n = 1,2, ..., is automatically zero for any even function and so does not need to be 
calculated by integration. 


Sine Series. Suppose that f and f’ are piecewise continuous on —L < x < L and that 
f is an odd periodic function of period 2L. Then it follows from Properties 2 and 3 that 
f (x) cos(nz x / L) is odd and f (x) sin(n7 x/ L) is even. Thus, from equations (5) and (6), 
the Fourier coefficients of f are 


dn = 0, n=0,1,2,..., 
b 3 fo) si ma Ie Jsi (=a 155 (8) 
n = — x) sin{ —— Jdx = — sin{ ——]dx, n=1,2,..., 
mm L ra ial L 
and the Fourier series for f is 
= nux 
= Sos") 
f(x) 2 sin I 


Thus the Fourier series for any odd function consists only of the odd trigonometric functions 
sin(n7 x / L); such a series is called a Fourier sine series. Again observe that only half of the 
coefficients need to be calculated by integration, since each a,, for n = 0, 1,2, ... , is zero 
for any odd function. 


EXAMPLE 1 


Let f(x) =x, -L < x < L,andlet f(—L) = f(L) = 0. Let f be defined elsewhere so that it 
is periodic of period 2L. The function defined in this manner is known as a sawtooth wave. Graph 
three periods of y = f(x). Find the Fourier series for this function. 


Solution: 


The graph of y = f(x) on [—L, L] and one period to the left and one period to the right is shown 
in Figure 10.4.2. 


| FIGURE 10.4.2 | The sawtooth wave in Example 1. 


Since f is an odd function, its Fourier coefficients are, according to equation (8), 


da —0, w= 01,22 224 
2 L .fínmnx 
b, = = x sin( Jax 
L Jo L 
2 
L (s (=>) nox (=) 
— sin cos 
(=) L L L 


2L 
= —-1)"*!, n=1,2,.... 
NT 
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Hence the Fourier series for f, the sawtooth wave, is 


f(xy) = = »» E sin (72) 


n= 
2L/ . (mx 1. 30x 1. 51x 
= sin( ) sin + sin ke be 
T L 3 L 5 L 
Observe that the periodic function f is discontinuous at the points +L, 3L, ... , as shown in 
Figure 10.4.2. At these points the series (9) converges to the mean value of the left and right limits, 


namely, to zero. The partial sum of the series (9) for n — 9 is shown in Figure 10.4.3. The Gibbs 
phenomenon (mentioned in Section 10.3) again occurs near the points of discontinuity. 


(9) 


A partial sum (blue) in the Fourier series, 
equation (9), for the sawtooth wave (red). 


Note that in this example f( —L) = f(L) = 0, as well as f(0) = 0. This is required if 
the function f is to be both odd and periodic with period 2L. When we speak of constructing 
a sine series for a function defined on 0 < x < L, it is understood that, if necessary, we must 
first redefine the function to be zero at x = 0 and x = L. 

It is important to point out that we have now found two different Fourier series 
representations of the function f(x) = x on the interval 0 < x < 2. In Example 1 of 
Section 10.2, we found a Fourier cosine series for f(x) on 0 < x < 2. On the other hand, 
in Example 1 in this section, just above, with L — 2, we expressed the same function with a 
Fourier sine series. Thus, if it is required to represent the function f(x) = x on0 < x < 2 
by a Fourier series, it is possible to do this by either a cosine series or a sine series. In the 
former case, f is extended as an even function into the interval —2 < x < 0 and elsewhere 
periodically (the triangular wave). In the latter case, f is extended into —2 < x < 0 as an odd 
function and elsewhere periodically (the sawtooth wave). If f is extended in any other way, 
the resulting Fourier series will still converge to x in 0 < x < 2 but will involve both sine 
and cosine terms. 

In solving problems in differential equations, it is often useful to expand in a Fourier series 
of period 2L a function f originally defined only on the interval [0, L]. As indicated previously 
for the function f(x) — x, with L — 2, several alternatives are available. Explicitly, we can 


1. Define a function g of period 2L so that 


(10) 


The function g is thus the even periodic extension of f . Its Fourier series, which is a cosine 
series, represents f on [0, L]. 
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2. Define a function h of period 2L so that 
f(x), Om xe: 
h(x) = 0, x —0;L, (11) 
—f(-x), -L<x<0. 


The function A is thus the odd periodic extension of f . Its Fourier series, which is a sine 
series, also represents f on (0, L). 


3. Define a function k of period 2L so that 
K(x) 2 f(x), O<x<L, (12) 


and let k(x) be defined for ( — L, 0) in any way consistent with the conditions of Theorem 
10.3.1. Sometimes it is convenient to define k(x) to be zero for —L < x < 0. The 
Fourier series for k, which involves both sine and cosine terms, also represents f on [0, L], 
regardless of the manner in which k(x) is defined in ( — L, 0). Thus there are infinitely 
many such series, all of which converge to f(x) in the original interval. 


Usually, the form of the expansion to be used will be dictated (or at least suggested) by the 
purpose for which it is needed. However, if there is a choice as to the kind of Fourier series to 
be used, the selection can sometimes be based on the rapidity of convergence. For example, the 
cosine series for the triangular wave (equation (20) of Section 10.2) converges more rapidly 
than the sine series for the sawtooth wave (equation (9) in this section), although both converge 
to the same function forO < x < L. This is because the triangular wave is a smoother function 
than the sawtooth wave and is therefore easier to approximate. In general, the more continuous 
derivatives possessed by a function over the entire interval —oo < x < oo, the faster its 
Fourier series will converge. See Problem 18 of Section 10.3. 


EXAMPLE 2 


Suppose that 


f(x) = l-x, O<x<l, (13) 
"I= ð; l<x<2. 


As indicated previously, we can represent f either by a cosine series or by a sine series. Sketch the 
graph of three periods of the sum of each of these series for —6 < x < 6. 


Solution: 


In this example, L = 2, so the cosine series for f converges to the even periodic extension of f of 
period 4, whose graph is sketched in Figure 10.4.4. 


Even periodic extension of f(x) given by equation (13). 


Similarly, the sine series for f converges to the odd periodic extension of f of period 4. The 
graph of this function is shown in Figure 10.4.5. 


| FIGURE 10.4.5 | Odd periodic extension of f(x) given by equation (13). 


Problems 


In each of Problems | through 6, determine whether the given function 
is even, odd, or neither. 


1. x3-2x 

2. x—-2x +1 
3. tan(2x) 

4. secx 

5. |xp 

6. e* 


In each of Problems 7 through 12, a function f is given on an interval 
of length L. In each case sketch the graphs of the even and odd 
extensions of f of period 2L. 


7, roo =f? xXx 
1, 2 = X-< 3 

8. ro ={" O0<x<1, 
x—l, l<x<2 

9. f(x) =2-x, O0<x<2 

10. f(x) =x -3, O0<x<4 
0, 0x 

= roy ={t I<x<2 


12. f(x) =4- x?, 
13. Prove that any function can be expressed as the sum of two 
other functions, one of which is even and the other odd. That is, for 
any function f, whose domain contains —x whenever it contains x, 
show that there are an even function g and an odd function h such that 
f(x) = g(x) +h(x). Hint: What can you say about f(x) + f(—x)? 
14. Find the coefficients in the cosine and sine series described in 
Example 2. 
In each of Problems 15 through 22: 

a. Find the required Fourier series for the given function. 

b. Sketch the graph of the function to which the series converges 

over three periods. 


1 O0<x<l 

15. Za ' 
in n 1«x«2; 
(Compare with Example 1 and Problem 5 of Section 10.3.) 


0x 


cosine series, period 4 


16. f(x)= B adiu sine series, period 4 
1, Lrg 2: 
17. f(x) =1, O<x <7; cosine series, period 27 
18. f(x) =1, 0 <x <7; sine series, period 27 
0, O<x< 7, 
19. f(x) =< 1, T <x < 27, sine series, period 67 
2. 2m <x < 3m; 
20. f(x) =x, 0 <x <1; series of period 1 


21. f(x) 2L-x, O<x <UL; cosine series, period 2L 
(Compare with Example 1 of Section 10.2.) 
22. f(x) -L-x, O< x < ZL; 
In each of Problems 23 through 26: 
a. Find the required Fourier series for the given function. 
b. Sketch the graph of the function to which the series converges 
for three periods. 
O c. Plot one or more partial sums of the series. 


sine series, period 2L 
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X5 0cxc«m, : : , 
23. f(x)- cosine series, period 47 
0, T<x< 20; 
24. f(x) =—-x, —r <x< 0; sine series, period 27 
25. f(x) 22-x?, 0 <x <2; sine series, period 4 
26. f(x) = x? — 2x, O«x-«4; cosine series, period 8 


In each of Problems 27 through 30, a function is given on an interval 
0 «c x. 
a. Sketch the graphs of the even extension g(x) and the odd 
extension A(x) of the given function of period 2L over three 
periods. 
b. Findthe Fourier cosine and sine series for the given function. 
Q c. Plot a few partial sums of each series. 
O d. For each series, investigate the dependence on n of the 
maximum error on [0, L]. 


27. f(x)23-x, 0<x <3 
X; Ocxc«l, 
em fo - fs I<x<2 
29. f(x) = (4x? — 4x -3)/4 O0<x<2 
30. f(x) =x —5x7 45x41, 0<x<3 


31. Prove that if f is an odd function, then 


L 
J f(x)dx = 0. 
zh 


32. Prove Properties 2 and 3 of even and odd functions, as stated in 
the text. 


33. Prove that the derivative of an even function is odd and that the 


derivative of an odd function is even. 


X 
34. Let F(x) = | f (t)dt. Show that if f is even, then F is odd, 
0 


and that if f is odd, then F is even. 
35. From the Fourier series for the square wave in Example 1 of 
Section 10.3, show that 


=0 


= 


This relation between 7 and the odd positive integers was discovered 
by Leibniz in 1674. 


36. From the Fourier series for the triangular wave (Example 1 of 
Section 10.2), show that 


T? 1 41 = 1 
Sipe a eem ——— 
52 2 um 


n=l 


37. Assume that f has a Fourier sine series 
oo 
.f(nnX 
f(x) = y» sin( 72). O<x<L. 
n=1 
a. Show formally that 


2 [5 o 
JI (fCOY deo o. 


n=1 


Compare this result (Parseval’s equation) with that of Problem 
17 in Section 10.3. What is the corresponding result if f has a 
cosine series? 
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b. Apply the result of part a to the series for the sawtooth wave 
given in equation (9), and thereby show that 


This relation was discovered by Euler in about 1735. 


More Specialized Fourier Series. Let f be a function originally 
defined on 0 < x < L and satisfying there the continuity conditions 
of Theorem 10.3.1. In this section we have shown that it is possible to 
represent f by either a sine series or a cosine series by constructing odd 
or even periodic extensions of f, respectively. Problems 38 through 
40 concern some other, more specialized Fourier series that converge 
to the given function f on (0, L). 


38. Let f be extended into (L, 2L] in an arbitrary manner subject to 
the continuity conditions of Theorem 10.3.1. Then extend the resulting 
function into ( 2L, 0) as an odd function and elsewhere as a periodic 
function of period 4L (see Figure 10.4.6). Show that this function 


: : A : ; ; n7 x 
has a Fourier sine series in terms of the functions sin 2L for 


n —1,2,3, ... ; thatis, 


Fx) = Y du sin (= ). 


n=1 


2L 
hcl syn = yax 
T E f 2L l 


This series converges to the original function on (0, L). 


where 


Graph of one period of 
the function in Problem 38. 
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39. Let f first be extended into ( L, 2L) so thatit is symmetric about 
x = L. Then f satisfies f(2L — x) = f(x) for 0 < x < L. Let 
the resulting function be extended into (—2L, 0) as an odd function 
and elsewhere as a periodic function of period 4L (see Figure 10.4.7). 
Show that this function has a Fourier series in terms of the functions 


o£ Hx . { 3nx _ [Sax ‘ 
sin| — ], sin{ —— J,sin| —— ], ... ; that is, 
2L 2L 2L 


f(x) = od, sin 


n=1 


p=? 2 M (20= Dax) , 
n= 7 A f(x) sin C 2L — x 


This series converges to the original function on (0, L]. 


where 


Graph of one period of 
the function in Problem 39. 


40. a. How should f, originally defined on [0, L], be extended 
so as to obtain a Fourier series involving only the functions 
"X 30x 5nx 
cos{ —— }, cos{ —— },cos| —— ]). -- 
2L 2L 2L 
38 and 39. 


b. If f(x) = x for 0 < x < L, sketch the function to which the 
Fourier series converges for —4L < x < 4L. 


. ? Refer to Problems 


Separation of Variables; Heat 
Conduction in a Rod 


The basic partial differential equations of heat conduction, wave propagation, and potential 
theory that we discuss in this chapter are associated with three distinct types of physical 
phenomena: diffusive processes, oscillatory processes, and time-independent or steady-state 
processes. These three partial differential equations are of fundamental importance in many 
branches of physics. They are also of considerable significance from a mathematical point of 
view. The partial differential equations whose theory is best developed and whose applications 
are most significant and varied are the linear equations of second order. All such equations can 
be classified into one of three categories: the heat conduction equation, the wave equation, 
and the potential equation, respectively, are prototypes of these categories. Thus a study of 
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these three equations yields much information about more general second-order linear partial 
differential equations. 

During the last two centuries, several methods have been developed for solving partial 
differential equations. The method of separation of variables is the oldest systematic method, 
having been used by d’Alembert, Daniel Bernoulli, and Euler about 1750 in their investigations 
of waves and vibrations. It has been considerably refined and generalized in the meantime, and 
it remains a method of great importance and frequent use today. To show how the method of 
separation of variables works, we consider first a basic problem of heat conduction in a solid 
body. The mathematical study of heat conduction originated? about 1800, and it continues 
to command the attention of modern scientists. For example, analysis of the dissipation and 
transfer of heat away from its sources in high-speed machinery is frequently an important 
technological problem. 

Let us now consider a heat conduction problem for a straight bar of uniform cross section 
and homogeneous material. Let the x-axis be chosen to lie along the axis of the bar, and let 
x = O and x = L denote the ends of the bar (see Figure 10.5.1). Suppose further that the sides 
of the bar are perfectly insulated so that no heat passes through them. We also assume that the 
cross-sectional dimensions are so small that the temperature u can be considered constant on 
any given cross section. Then u is a function only of the axial coordinate x and the time f. 


u(x, t) 


y= [9] x-L 
| FIGURE 10.5.1 | A heat-conducting solid bar. 


The variation of temperature in the bar is governed by a partial differential equation 
derived in Appendix A at the end of this chapter. The equation is called the heat conduction 
equation and has the form 


Q Uxx = Uy, O<x< E, f (1) 


where a” is a constant known as the thermal diffusivity. The parameter œ? depends only on 
the material from which the bar is made and is defined by 


es. (2) 


where & is the thermal conductivity, p is the density, and s is the specific heat of the material 
in the bar. The units of a? are length squared per time. Typical values of a? are given in 
Table 10.5.1. 

In addition, we assume that the initial temperature distribution in the bar is given; thus 


u(x,0) = f(x), Ox<xK<L, (3) 


where f is a given function. Finally, we assume that the ends of the bar are held at fixed 
temperatures: the temperature T; at x = 0 and the temperature T, at x = L. However, it turns 
out that we need only consider the case where T; = T) = 0. We show in Section 10.6 how to 
reduce the more general problem to this special case. Thus, in this section we will assume that 
u is always zero when x = 0 or x = L: 


u(0,t) 20, u(L,t) 20, t0. (4) 


The fundamental problem of heat conduction is to find u(x, t) that satisfies the differential 
equation (1) for 0 < x < L and for: > OQ, the initial condition (3) when t = 0, and the 
boundary conditions (4) at x — 0 and x — L. 


?The first important investigation of heat conduction was carried out by Joseph Fourier. He presented basic papers on 
the subject to the Academy of Sciences of Paris in 1807 and 1811. Although these papers were controversial and were 
not published at the time, Fourier continued to develop his ideas and eventually wrote one of the classics of applied 
mathematics, Théorie analytique de la chaleur, published in 1822. 
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Values of the Thermal Diffusivity for 


TABLE aera Some Common Materials 


Material a? (cm?/s) 
Silver (99.9% pure) 1.6563 
Gold 1.27 
Copper (at 25°C) 1.11 
Silicon 0.88 
Aluminum 0.8418 
Iron 0.23 
Air (at 300K) 0.19 
Cast Iron 0.12 
Steel (1% carbon) 0.1172 
Steel (stainless 310 at 25°C) 0.03352 
Quartz 0.014 
Granite 0.011 
Brick 0.0038 
Water 0.00144 
Wood (yellow pine) 0.00082 


The problem described by equations (1), (3), and (4) is an initial value problem in the time 
variable t; an initial condition is given, and the differential equation governs what happens 
later. However, with respect to the space variable x, the problem is a boundary value problem; 
boundary conditions are imposed at each end of the bar, and the differential equation describes 
the evolution of the temperature in the interval between them. Alternatively, we can consider 
the problem as a boundary value problem in the xt-plane (see Figure 10.5.2). The solution 
u(x,t) of equation (1) is sought in the semi-infinite strip O < x < L,t > 0, subject to the 
requirement that u(x, t) must assume a prescribed value at each point on the boundary of this 
semi-infinite strip. 


u(O,t) 20 4 u(L,t) 20 


x 


u(x,O) = f(x) 


Boundary value problem for the 
heat conduction equation. 


The heat conduction problem (1), (3), (4) is linear since the unknown temperature function 
u appears only to the first power throughout. The differential equation and boundary conditions 
are also homogeneous. This suggests that we might approach the problem by seeking solutions 
of the differential equation and boundary conditions, and then superposing them to satisfy the 
initial condition. The remainder of this section describes how this plan can be implemented. 


Method of Separation of Variables. One solution of the differential equation (1) that 
satisfies the boundary conditions (4) is the function u(x,t) = 0, but this solution does not 
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satisfy the initial condition (3) except in the trivial case in which f(x) is also zero. Thus our 
goal is to find other, nonzero solutions of the differential equation and boundary conditions. 
To find the needed solutions, we start by making a basic assumption about the form of the 
solutions that has far-reaching, and perhaps unforeseen, consequences. The assumption is that 
u(x, t) is a product of two functions, one depending only on x and the other depending only 
on £; thus 


u(x,t) — X(x)T(t). (5) 
Substituting from equation (5) for u in the differential equation (1) yields 
a?X"(x)T(t) = X(x)T'(t). 


We will simplify this derivation by often omitting the independent variables x and t; our 
notation is intended to indicate that the functions X and T' depend on the variables x and 
t, respectively. Thus, we will write the above equation as 


o?X"T = XT’, (6) 


where primes refer to ordinary differentiation with respect to the independent variable, whether 
x or t. Dividing both sides of equation (6) by a7 XT yields the equivalent equation 
x" 1 T” 
ee 7 
X æT i 
in which the variables are separated; that is, the left-hand side depends only on x and the 
right-hand side only on t. 

It is now crucial to realize that for equation (7) to be valid at each point (x, f) in the 
semi-infinite strip 0 < x < L,t > 0, it is necessary that both sides of equation (7) must be 
equal to the same constant. Otherwise, if one independent variable (say, x) were kept fixed 
and the other were allowed to vary, one side (the left in this case) of equation (7) would remain 
unchanged while the other varied, thus violating the equality. If we call this separation constant 
—A, then equation (7) becomes 

X" 1 T’ 


= = . 8 
X a? T i (8) 


Hence we obtain the following two ordinary differential equations for X(x) and T (t): 


X" -- AX — 0, (9) 
T' +a AT =0. (10) 


We denote the separation constant by —A (rather than A) because in this problem it turns out 
that it must be negative, and it is convenient to exhibit the minus sign explicitly. 

The assumption (5) has led to the replacement of the partial differential equation (1) by 
the two ordinary differential equations (9) and (10). Each of these equations is linear and 
homogeneous, with constant coefficients, and so can be readily solved for any value of A. The 
product of two solutions of equation (9) and (10), respectively, provides a solution of the partial 
differential equation (1). However, we are interested only in those solutions of equation (1) that 
also satisfy the boundary conditions (4). As we now show, this severely restricts the possible 
values of A. 

Substituting for u(x, t) from equation (5) in the boundary condition at x — 0, we obtain 


u(0,t) = X(0)T(t) =0. (11) 


If equation (11) is satisfied by choosing T(t) to be zero for all t, then u(x, t) is zero for all x 
and t, and we have already rejected this possibility. Therefore, equation (11) must be satisfied 
by requiring that 


X(0) — 0. (12) 
Similarly, the boundary condition at x = L requires that 


X(L) =0. (13) 
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We now want to consider equation (9) subject to the boundary conditions (12) and (13). 
This is an eigenvalue problem and, in fact, is the same problem that we discussed in detail at the 
end of Section 10.1; see especially the paragraph containing equation (29) in that section. The 
only difference is that the dependent variable there was called y rather than X. If we refer 
to the results obtained earlier (equation (31) of Section 10.1), the only nontrivial solutions of 
equations (9), (12), and (13) are the eigenfunctions 


. (nnx 
xx em sin( 777) n=1,2,3,... (14) 

associated with the eigenvalues 
Ape ae 23g 2. (15) 


Turning now to equation (10) for T(t) and substituting nen / L? for A, we have 


T+ T3 T —0. (16) 


Thus T (t) is proportional to exp(—n?7*a*t/L*). Hence, multiplying solutions of equations 
(9) and (10) together, and neglecting arbitrary constants of proportionality, we conclude that 
the functions 


un(X,t) = Ria sin( 75), Wig. (17) 


satisfy the partial differential equation (1) and the boundary conditions (4) for each positive 
integer value of n. The functions u, are sometimes called fundamental solutions of the heat 
conduction problem (1), (3), and (4). 

It remains only to satisfy the initial condition (3): 


u(x,0) = f(x), O<x <L. (18) 


Recall that we have often solved a linear initial value problem by using the principle of 
superposition; that is, we form a linear combination of a set of fundamental solutions of the 
differential equation and then choose the coefficients to satisfy the initial conditions. The 
analogous step in the present problem is to form a linear combination of the functions (17) 
and then to choose the coefficients to satisfy equation (18). The main difference from earlier 
problems is that there are infinitely many functions (17), so a general linear combination of 
them is an infinite series. Thus we assume that 


oo oo 
2:22 2* x nmnx 
u(x, t) =) ential t) = Y ee IP sin (=), (19) 


nal n=1 


where the coefficients c, are as yet undetermined. The individual terms in the series (19) satisfy 
the differential equation (1) and boundary conditions (4). We will assume that the infinite 
series of equation (19) converges and also satisfies equations (1) and (4). To satisfy the initial 
condition (3), we must have 


u(x, 0) = 7 sin( 77) = f(x). (20) 


n=1 


In other words, we need to choose the coefficients c, so that the series of sine functions in 
equation (20) converges to the initial temperature distribution f(x) for O < x < L. The series 
in equation (20) is just the Fourier sine series for f; according to equation (8) of Section 10.4, 
its coefficients are given by 


u 7 . (nn x d Bi 
o mp fo si( 77) x. (21) 


Hence the solution of the heat conduction problem of equations (1), (3), and (4) is given by 
the series in equation (19) with the coefficients computed from equation (21). 
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EXAMPLE 1 


Find the temperature u(x, t) at any time in a metal rod 50 cm long, insulated on the sides, which 
initially has a uniform temperature of 20°C throughout and whose ends are maintained at 0°C for all 
t 0. 


Solution: 


The temperature in the rod satisfies the heat conduction problem (1), (3), (4) with L = 50 and 
f(x) = 20 for 0 < x < 50. Thus, from equation (19), the solution is 


oo 
24.242 2 ANTR 
u(x,t) = oe isa (A). Q2) 
n=1 
where, from equation (21), 
80 
4 °° A ATX 40 —, nodd; 
== sin| —— |dx = — (1 — cosnT) = 4 nv (23) 
5 Jo 50 nT 
0, n even. 
Finally, by substituting for c, in equation (22), we obtain 
oo 
80 1.22.3 nux 
t) = — > a n n a t2500 .. eee dt 24 
u(x, t) z » xt sin 50 (24) 
n=l g Spa 


The expression (24) for the temperature is moderately complicated, but the negative exponential 
factor in each term of the series causes the series to converge quite rapidly, except for small values 
of t or a”. Therefore, accurate results can usually be obtained by using only the first few terms of 
the series. 

In order to display quantitative results, let us measure t in seconds; then o? has the units 
of cm?/s. If we choose a? = 1 for convenience, this corresponds to a rod of a material whose 
thermal properties are somewhere between copper and aluminum. The behavior of the solution can 
be seen from the graphs in Figures 10.5.3 through 10.5.5. In Figure 10.5.3 we show the temperature 
distribution in the bar at several different times. Observe that the temperature diminishes steadily as 
heat in the bar is lost through the end points. The way in which the temperature decays at a given point 
in the bar is indicated in Figure 10.5.4, where temperature is plotted against time for a few selected 
points in the bar. Finally, Figure 10.5.5 is a three-dimensional plot of u versus both x and t. Observe 
that we obtain the graphs in Figures 10.5.3 and 10.5.4 by intersecting the surface in Figure 10.5.5 by 
planes on which either f or x is constant. Note that the temperature is constant along each of the curves 
drawn on the surface in Figure 10.5.5; in this case, these level curves are known as isotherms. The 
slight waviness in Figure 10.5.5 at t = 0 results from using only five terms in the series for u(x, t) 
and from the slow convergence of the series for t = 0 (remember the Gibbs phenomenon introduced 
in Section 10.3). 


20 30 40 50 x 


Temperature distributions at several times using 
the first five terms of the series solution (24) of the heat conduction 
problem in Example 1. 
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Temperature profiles as a function of time at 
several locations using the first five terms of the series solution (24) 
of the heat conduction problem in Example 1. 


A problem with possible practical implications is to determine the time 7 at which the entire 
bar has cooled to a specified temperature. For example, when is the temperature in the entire bar no 
greater than 1°C? Because of the symmetry of the initial temperature distribution and the boundary 
conditions, the warmest point in the bar is always the center. Thus 7 is found by solving u(25, t) = 1 
for t. Using one term in the series expansion (24), we obtain 


2500 80^. 
T=— inl —]= 820 s. 
T T 


Plot of temperature u versus x and t using the first 10 terms in 
the series solution (24) of the heat conduction problem in Example 1. 


Problems 


In each of Problems 1 through 6, determine whether the method 
of separation of variables can be used to replace the given partial 
differential equation by a pair of ordinary differential equations. If so, 
find the equations. 


l. xusx +u;,=0 


2. tux, xu; =0 

3. usx tuyt +u, =0 

4. (p(x)ux)x —r(x)un =0 

5. ux +(x + y)uyy =0 

6. usx + Uyy T xu — 0 

7. Find the solution of the heat conduction problem 
100u,, = ui, 0cxcl,t30; 
u(0, t) = 0, u(1,1) 20, t > 0; 
u(x,0) = sin(2nx) —sin(5yx), O<x <1. 


8. Find the solution of the heat conduction problem 
Uy, — 4u, O«x«2, t»0 
u(0,t) 20, u(2,t)=0, t> 0; 


u(x, 0) = 2sin(1x/2) —sin(zx) -4sin2zx), O<x <2. 


Consider the conduction of heat in a rod 40 cm in length whose ends 
are maintained at 0°C for all t > 0. In each of Problems 9 through 12, 
find an expression for the temperature u(x, t) if the initial temperature 
distribution in the rod is the given function. Suppose that a? = 1. 


9. u(x,0) =50, O0< x < 40 
Xy 0 x x « 20, 
BE. Ha vee 20 <x < 40 
0, 0 € x « 10, 
1l. u(x,0) = < 50, 10 <x < 30, 
0, 30 < x < 40 
12. u(x,0) =x, 0<x< 40 


[N | 13. Consider again the rod in Problem 9. For t = 5 and x = 20, 
determine how many terms are needed to find the solution correct to 
three decimal places. A reasonable way to do this is to find n so that 
including one more term does not change the first three decimal places 
of u(20, 5). Repeat for t = 20 and t = 80. Form a conclusion about 
the speed of convergence of the series for u(x, t). 
@ 14. Repeat Problem 13 for the rod in Problem 10. 
@ 15. Repeat Problem 13 for the rod in Problem 11. 
@ 16. Repeat Problem 13 for the rod in Problem 12. 
17. Forthe rod in Problem 9: 
Q a. Plot u versus x for t = 5, 10, 20, 40, 100, and 200. Put all 
of the graphs on the same set of axes and thereby obtain a picture 
of the way in which the temperature distribution changes with 
time. 
@ b. Plot u versus t for x = 5, 10, 15, and 20. 
Q c. Draw a three-dimensional plot of u versus x and t. 
@ d. How long does it take for the entire rod to cool off to a 
temperature of no more than 1°C? 
@ 18. Repeat Problem 17 for the rod in Problem 10. 
Q) 19. Repeat Problem 17 for the rod in Problem 11. 
G 


20. Forthe rod in Problem 12: 

O a. Plot u versus x for t = 5, 10, 20, 40, 100, and 200. 

(3 b. For each value of t used in part a, estimate the value of x 
for which the temperature is greatest. Plot these values versus t 
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to see how the location of the warmest point in the rod changes 
with time. 

Q3 c. Plot u versus t for x = 10, 20, and 30. 

O d. Draw a three-dimensional plot of u versus x and t. 

o e. How long does it take the entire rod to cool to a 
temperature of no more than 1°C? 


@ 21. Leta metallic rod 20 cm long be heated to a uniform temp- 
erature of 100°C. Suppose that at t = 0, the ends of the bar are plunged 
into an ice bath at 0°C and thereafter maintained at this temperature, 
but that no heat is allowed to escape through the lateral surface. Find an 
expression for the temperature at any point in the bar at any later time. 
Determine the temperature at the center of the bar at time t = 30 s if 
the bar is made of a. silver, b. aluminum, and c. cast iron. 


@ 22. Approximate the times when the entire bar has cooled 
to 1°C 
a. using two terms; 
b. using three terms. 
C. Compare the times found in a and b with the time found using 
one term in Example 1. 


@ 23. For the rod of Problem 21, find the time that will elapse 
before the center of the bar cools to a temperature of 5°C if the bar is 
made of a. silver, b. aluminum, and c. cast iron. 

24. In solving differential equations, the computations can almost 
always be simplified by the use of dimensionless variables. 
a. Show that if the dimensionless variable € = 
introduced, the heat conduction equation becomes 
Ou  L?ÓOu 
0t? a? Or’ 
b. Since L?/ a has the units of time, it is convenient to use this 
quantity to define a dimensionless time variable 7 = (a? / L?)t. 
Then show that the heat conduction equation reduces to 


x/L is 


0<€<1, t>0. 


au = a 0<€<1, 7>0. 
Ət Or’ ; 
25. Consider the equation 
du x, — bu; + cu = 0, (25) 


where a, b, and c are constants. 
a. Let u(x,t) = e? w(x, t), where 6 is constant, and find the 
corresponding partial differential equation for w. 
b. If b. zz 0, show that 6 can be chosen so that the partial 
differential equation found in part a has no term in w. Thus, 
by a change of dependent variable, it is possible to reduce 
equation (25) to the heat conduction equation. 


26. The heat conduction equation in two space dimensions is 

a? (uy, + uyy) = Uy. 
Assuming that u(x, y, f) =X (x) Y(y)T (2), find ordinary differential 
equations that are satisfied by X(x), Y(y), and T(t). 


27. The heat conduction equation in two space dimensions may be 
expressed in terms of polar coordinates as 


2 1 1 
^| Urr Up +t -3400 | = ur. 
r r 


Assuming that u(r, 0, t) = R(r)O(0)T(t), find ordinary differential 
equations that are satisfied by R(r), O(0), and T(t). 
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10.6 Other Heat Conduction Problems 


In Section 10.5 we considered the problem consisting of the heat conduction equation 


2 


“Uy, = ur, O<*e< L, t> 0; (1) 
the boundary conditions 
u(0O,t) 20, u(L,t)=0, t> 0, (2) 
and the initial condition 
u(x, 0) = f(x), O<x <L. (3) 
We found the solution to be 
= -n?r?a?t/ L? os (PTX 
u(x,t) = Sce S si (777). (4) 


n=l 


where the coefficients c, are the same as in the series 


oo 
nux 
= n Sin : 5 
fe) (5>) 6) 
The series in equation (5) is just the Fourier sine series for f, according to Section 10.4, its 
coefficients are given by 

f n )sin( 75 )a (6) 

"LES sin| —— : 

[7 ET x T x 


Hence the solution of the heat conduction problem, equations (1) to (3), is given by the series 
in equation (4) with the coefficients computed from equation (6). 

We emphasize that, at this stage, the solution (4) must be regarded as a formal solution; 
that is, we obtained it without rigorous justification of the limiting processes involved. Such a 
justification is beyond the scope of this book. However, once the series (4) has been obtained, 
it is possible to show that in the semi-infinite strip 0 < x < L,t > 0, it converges to 
a continuous function; that the derivatives ux» and u, can be computed by differentiating 
the series (4) term by term; and that the heat conduction equation (1) is indeed satisfied. 
The argument relies heavily on the fact that each term of the series (4) contains a negative 
exponential factor, and this results in relatively rapid convergence of the series. A further 
argument establishes that the function u given by equation (4) also satisfies the boundary and 
initial conditions; this completes the justification of the formal solution. 

It is interesting to note that although f satisfies the conditions of the Fourier Convergence 
Theorem (Theorem 10.3.1), it may have points of discontinuity. In this case the initial 
temperature distribution u(x, 0) = f(x) is discontinuous at one or more points. Nevertheless, 
the solution u(x,t) is continuous for arbitrarily small values of t > 0. This illustrates the 
fact that heat conduction is a diffusive process that instantly smooths out any discontinuities 
that may be present in the initial temperature distribution. Finally, since f is bounded, it 
follows from equation (6) that the coefficients c, are also bounded. Consequently, the presence 
of the negative exponential factor in each term of the series (4) guarantees that 


lim u(x,t) =0 (7) 
too 
for all x regardless of the initial condition. This is in accord with the result expected from 
physical intuition. 
We now consider two other problems of one-dimensional heat conduction that can be 
handled by the method developed in Section 10.5. 


Nonhomogeneous Boundary Conditions. Suppose now that one end of the bar is held 
at a constant temperature 7, and the other is maintained at a constant temperature 75. 


10.6 Other Heat Conduction Problems 


The differential equation (1) and the initial condition (3) remain unchanged. The boundary 
conditions are 


ült) = Tiyo u(L,t) =m, t>O0. (8) 


This problem is only slightly more difficult, because of the nonhomogeneous boundary 
conditions, than the one in Section 10.5. We can solve it by reducing it to a problem having 
homogeneous boundary conditions, which can then be solved as in Section 10.5. The technique 
for doing this is suggested by the following physical argument. 

After a long time—that is, as £ — oo—we anticipate that the temperature approaches 
a steady-state temperature distribution, which is independent of the time £ and the initial 
conditions; we denote this distribution as v(x). Since v(x) does not depend on f, its time 


derivative v, is zero and spatial derivatives are ordinary derivatives: v, = v’. The heat 
conduction equation (1) for v then becomes 
v(x) 20 O<x<L. (9) 


Hence the steady-state temperature distribution is a linear function of x. Further, v(x) must 
satisfy the boundary conditions 


v(0) = Ti, v(L) = Tə, (10) 
which are valid even as f — oo. The solution of equation (9) satisfying equations (10) is 


X 
va) 2 (n- T) r T. (11) 


Returning to the original problem, equations (1), (3), and (8), we will try to express u(x, t) 
as the sum of the steady-state temperature distribution v(x) and another (transient) temperature 
distribution w(x, t); thus we write 


u(x,t) = v(x) +w(x, t). (12) 


Since v(x) is given by equation (11), the problem will be solved provided that we can 
determine w(x,f). The boundary value problem for w(x, t) is found by substituting the 
expression in equation (12) for u(x, t) in equations (1), (3), and (8). 

From equation (1), we have 


a^(v + W)ax = (v +w); 
since v, = 0 and v, = 0, it follows that 
o^ Wx = Wee (13) 
Similarly, from equations (12), (8), and (10), 
w(0,t) = u(0, t) — v(0) = T; — T; = 0, 
w(L,t) =u(L,t) —v(L) = T3 — T; = 0. 


(14) 


Finally, from equations (12) and (3), 

w(x, 0) = u(x, 0) — v(x) = f(x) — v(x), (15) 
where v(x) is given by equation (11). Thus the transient part of the solution to the original 
problem is found by solving the problem consisting of equations (13), (14), and (15). This 


latter problem is precisely the one solved in Section 10.5, provided that f(x) — v(x) is now 
regarded as the initial temperature distribution. Hence 


oo 
X 272 nox 
u(x,t) = (b T) ^ i ene" of sin( 77), (16) 
n=1 


where 
= : f ( [yes e m T) i (Ja 17 
a s f(x 2 VI zo ur x. (17) 


This is another case in which a more difficult problem is solved by reducing it to a 
simpler problem that has already been solved. The technique of reducing a problem with 
nonhomogeneous boundary conditions to one with homogeneous boundary conditions by 
subtracting the steady-state solution has wide application. 
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EXAMPLE 1 


Consider the heat conduction problem 


Uxx = Ut, 0<x< 30, t>0, (18) 
u(0,t) = 20, u(30,t) 250, t>0, (19) 
u(x, 0) = 60 — 2x, 0 « x « 30. (20) 


Find the steady-state temperature distribution and the boundary value problem that determines the 
transient distribution. 


Solution: 


We seek the solution in the form u(x,t) = v(x) + w(x, t). The steady-state temperature v(x) 
satisfies v"(x) = 0 and the boundary conditions v(0) = 20 and v(30) = 50. Thus v(x) = 20 + x. 
The transient distribution w(x, t) satisfies the heat conduction equation 


Wxx = Wr, Ocx«L, t»0, (21) 
the homogeneous boundary conditions 
w(0,t) 20, w(30,1) = 0, t> 0, (22) 
and the modified initial condition 
w(x, 0) = 60 — 2x — (20 + x) = 40 — 3x, O<x< L. (23) 


Note that this problem is of the form (1), (2), (3) with f(x) = 40 — 3x, a? = 1, and L = 30. Thus 
the solution is given by equations (4) and (6). 

Figure 10.6.1 shows a plot of the initial temperature distribution 60 — 2x, the final temperature 
distribution 20 + x, and the temperature at three intermediate times found by solving equations (21) 
through (23). Note that while the initial condition does not satisfy the boundary conditions, the 
intermediate temperature satisfies the boundary conditions (19) for any t > 0. As t increases, the 
effect of the boundary conditions gradually moves from the ends of the bar toward its center. 


5 10 15 20 25 30 x 


Temperature distributions at several times 
using the first 10 terms of the series solution of the heat 
conduction problem in Example 1. 


Bar with Insulated Ends. A slightly different problem occurs if the ends of the bar are 
insulated so that there is no passage of heat through them. According to equation (2) in 
Appendix A, the rate of flow of heat across a cross section is proportional to the rate of change 
of temperature in the x direction. Thus, in the case of no heat flow, the boundary conditions 
are 


ux(0,t) =0, u,(L,t) 20, t>0. Q4) 


10.6 Other Heat Conduction Problems 


The problem posed by equations (1), (3), and (24) can also be solved by the method of 
separation of variables. If we let 


u(x,t) = X(x)T(t), (25) 
and substitute for u in equation (1), then it follows, as in Section 10.5, that 
where À is a constant. Thus we obtain again the two ordinary differential equations 
X"--AX 20, O<x<L. (27) 
fT +a7AT =0, t> 0. (28) 


For any value of A, a product of solutions of equations (27) and (28) is a solution of the partial 
differential equation (1). However, we are interested only in those solutions that also satisfy 
the boundary conditions (24). 

If we substitute for u(x, t) from equation (25) in the boundary condition at x = 0, we 
obtain X’(0)7(t) = 0. We cannot permit T(t) to be zero for all t, since then u(x, f) would 
also be zero for all t. Hence we must have 


X'(0) — 0. (29) 
Proceeding in the same way with the boundary condition at x — L, we find that 
X'(L) — 0. (30) 


Thus we wish to solve equation (27) subject to the boundary conditions (29) and (30). It is 
possible to show that nontrivial solutions of this problem can exist only if A is real. One way 
to show this is indicated in Problem 18; alternatively, we can appeal to a more general theory 
to be discussed in Section 11.2. We will assume that A is real and will consider, in turn, the 
three cases A < 0, A = 0, and A > 0. 


Case l: If A < 0, we let \ = —u?, where u is real and positive, so that equation (27) 
becomes X" — 4? X = 0. It is convenient to write its general solution as 


X(x) = kı sinh(jx) + kz cosh( ux) (31) 


so that the boundary conditions are easier to apply. In this case the boundary conditions can be 
satisfied only by choosing kı = kz = 0. Since this forces the solution to be the trivial solution 
u(x,t) = 0, it follows that À cannot be negative; in other words, the problem (27), (29), and 
(30) has no negative eigenvalues. 


Case ll: If \ = 0, then equation (27) is X" = 0, and therefore, 
X(x) = kqx + k2. (32) 


The boundary conditions (29) and (30) require that kı = O but do not determine kz. Thus 
A = 0 is an eigenvalue, corresponding to the eigenfunction X(x) = 1. For À = 0, it follows 
from equation (28) that T(t) is also a constant, which can be combined with k2. Hence, for 
A = 0, we obtain the constant solution uo(x, t) = 1. 


Case Ill: Finally, if A > 0, let \ = ju”, where p is real and positive. Then equation (27) 
becomes X" + jj? X = 0, and consequently, 


X(x) = kı sin( ux) + k cos( ux). (33) 


The boundary condition (29) requires that k; = 0, and the boundary condition (30) requires 
that u = na/L for n = 1,2,3, ... but leaves k arbitrary. Thus the problem (27), 
(29), and (30) has an infinite sequence of positive eigenvalues A, = non / L? with the 
corresponding eigenfunctions X,(x) = cos(nax/L). For the eigenvalue A,, the solution 
T, (t) of equation (28) is proportional to exp( —na?o i i), 

Combining these results, we have the following fundamental solutions for the problem 
(1), (3), and (24): 


u(x,t) = 1, 


Tm (34) 
ünl xt) = g MR ATEL cos (77), n-1.2,.:- 
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where arbitrary constants of proportionality have been dropped. Each of these functions 
satisfies the differential equation (1) and the boundary conditions (24). Because both the 
differential equation and the boundary conditions are linear and homogeneous, any finite linear 
combination of the fundamental solutions satisfies them. We will assume that this is true for 
convergent infinite linear combinations of fundamental solutions as well. Thus, to satisfy the 
initial condition (3), we assume that u(x, t) has the form 


oo 
u(x, t) = Sol, t) s X cst t) 


n=1 
c ia nux 
_ €0 —n?r?a?t/L? ( ) 
= — + Cne cos| —— }. 35 
5 ttn j (35) 
n=1 
The coefficients c,, are determined by the requirement that 


u(x,0) = 7 Y re cos( 75) = f(x). (36) 


Thus the unknown coefficients in equation (35) must be the coefficients in the Fourier cosine 
series of period 2L for f. Hence 


n — L 5 41,45 +e 7 
c m 4 X) COS X n 


With this choice of the coefficients co, c1, C2, .. . , the series (35) provides the solution to the 
heat conduction problem for a rod with insulated ends, equations (1), (3), and (24). 

Itis worth observing that the solution (35) can also be thought of as the sum of a steady- 
state temperature distribution (given by the constant co/2), which is independent of time 
t, and a transient distribution (given by the rest of the infinite series) that approaches zero 
in the limit as ¢ approaches infinity. That the steady-state is a constant is consistent with 
the expectation that the process of heat conduction will gradually smooth out the initial 
temperature distribution in the bar as long as no heat is allowed to enter from, or to escape 
to, the outside. The physical interpretation of the term 


1 L 
+= zf f(x)dx (38) 
0 


is that the steady-state temperature distribution is the mean value of the original temperature 
distribution. 


EXAMPLE 2 


Find the temperature u(x, t) in a metal rod of length 25 cm that is insulated on the ends as well as 
on the sides and whose initial temperature distribution is u(x, 0) = x for 0 < x < 25. 


Solution: 


The temperature in the rod satisfies the heat conduction problem (1), (3), and (24) with L — 25. 
Thus, from equation (35), the solution is 


oo 
es M p gg reu uis (2). (39) 
n=1 


where the coefficients are determined from equation (37). We have 
25 


2 
co = 25 : xdx — 25 (40) 


10.6 Other Heat Conduction Problems 


and, forn > 1, 


100 


50(cos(n7) — 1 ————, n odd; 
= a = (nr)? (41) 
0, n even. 
Thus 
25 100 «\ 1 222 nux 
ux) = mo 5 T LM ii cos( 2E ) (42) 
5121,3,5, ... 


is the solution of the given problem. 

For a? = 1, Figure 10.6.2 shows plots of the temperature distribution in the bar at several 
times. Again, the convergence of the series is rapid so that only a relatively few terms are needed to 
generate the graphs. 


5 10 15 20 25 x 


Temperature distributions at several times using the 
first five terms of the series solution (42) of the heat conduction 
problem in Example 2. 


More General Problems. The method of separation of variables can also be used to solve 
heat conduction problems with boundary conditions other than those given by equations (8) 
and equations (24). For example, the left end of the bar might be held at a fixed temperature 
T while the other end is insulated. In this case the boundary conditions are 


u(0,t) =T, u,(L,t) 20, t> 0. (43) 


The first step in solving this problem is to reduce the given boundary conditions to 
homogeneous ones by subtracting the steady-state solution. The resulting problem is solved 
by essentially the same procedure as in the problems previously considered. However, the 
extension of the initial function f outside of the interval [0, L] is somewhat different from 
that in any case considered so far (see Problem 15). 

A more general type of boundary condition occurs when the rate of heat flow through the 
end of the bar is proportional to the temperature. It is shown in Appendix A that the boundary 
conditions in this case are of the form 


ux(0, 1) — hiu(0, t) =0, ux(L,t) +hou(L,t) 20, t2 0, (44) 
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where h; and h, are nonnegative constants. If we apply the method of separation of variables 
to the problem consisting of equations (1), (3), and (44), we find that X(x) must be a solution 
of 

X"+rAX =0, X'(0) —hj,X(0) 20, X'(L)-h5X(L) —0, (45) 
where À is the separation constant. Once again, it is possible to show that nontrivial solutions 
can exist only for certain nonnegative real values of A, the eigenvalues, but these values are not 
given by a simple formula (see Problem 20). It is also possible to show that the corresponding 
solutions of equations (45), the eigenfunctions, satisfy an orthogonality relation and that we 
can satisfy the initial condition (3) by superposing solutions of equations (45). However, the 
resulting series is not included in the discussion of this chapter. There is a more general theory 
that covers such problems, and it is outlined in Chapter 11. 


Problems 


In each of Problems 1 through 8, find the steady-state solution of the 


heat conduction equation a*u,, = u; that satisfies the given set of 
boundary conditions. 

1. u(0,t) = 10, u(50, f) = 40 

2. u(0,t) = 30, u(40, rt) = —20 

3. ux(0,t) 20, u(L,t) 20 

4. u,(0,t) 20, u(L,t) =T 

5. u(0,t) 20, ux(L,t) =0 

6. u(0,t) =T, u,(L.t) 20 

7. ux(0,t) —u(0,t) 20, u(L,t) =T 

8. u(0,0 =T, u,(L, t) -u(L,t) 20 

9. Letan aluminum rod of length 20 cm be initially at the uniform 


temperature of 25°C. Suppose that at time t = 0, the end x = O is 
cooled to 0°C while the end x = 20 is heated to 60°C, and both are 
thereafter maintained at those temperatures. 
a. Find the temperature distribution in the rod at any time t. 
Q b. Plot the initial temperature distribution, the final (steady- 
state) temperature distribution, and the temperature distributions 
at two representative intermediate times on the same set of axes. 
Q3 c. Plot u versus t for x = 5, 10, and 15. 
Q d. Estimate how much time must elapse before the 
temperature at x — 5 cm comes (and remains) within 196 of 
its steady-state value. 


10. a. Let the ends of a copper rod 100 cm long be maintained 
at 0°C. Suppose that the center of the bar is heated to 100°C 
by an external heat source and that this situation is maintained 
until a steady-state results. Find this steady-state temperature 
distribution. 

b. Ata time t = O (after the steady-state of part a has been 
reached), let the heat source be removed. At the same instant, let 
the end x = 0 be placed in thermal contact with a reservoir at 
20°C, while the other end remains at 0°C. Find the temperature 
as a function of position and time. 

Q c. Plot u versus x for several values of t. Also plot u versus 
t for several values of x. 

@ d. What limiting value does the temperature at the center of 
the rod approach after a long time? How much time must elapse 
before the center of the rod cools to within 1°C of its limiting 
value? 


11. Consider a rod of length 30 for which o? — 1. Suppose the 
initial temperature distribution is given by u(x, 0) — x(60 — x)/30 
and that the boundary conditions are u(0, t) = 30 and 4(30, t) = 0. 
a. Find the temperature in the rod as a function of position and 
time. 
Q b. Plot u versus x for several values of t. Also plot u versus 
t for several values of x. 
Q c. Plot u versus t for x = 12. Observe that u initially 
decreases, then increases for a while, and finally decreases to 
approach its steady-state value. Explain physically why this 
behavior occurs at this point. 


12. Consider a uniform rod of length L with an initial temperature 
given by u(x, 0) = sin(zx/ L),0 x x < L. Assume that both ends 
of the bar are insulated. 

a. Find the temperature u(x, t). 

b. What is the steady-state temperature as f — oo? 

O C. Leta? = 1 and L = 40. Plot u versus x for several values 

of t. Also plot u versus t for several values of x. 

d. Describe briefly how the temperature in the rod changes as 

time progresses. 


13. Considera bar of length 40 cm whose initial temperature is given 
by u(x, 0) = x(60 — x) /30. Suppose that a? = 1/4 cm?/s and that 
both ends of the bar are insulated. 

a. Find the temperature u(x, t). 

Q b. Plot u versus x for several values of t. Also plot u versus 

t for several values of x. 

c. Determine the steady-state temperature in the bar. 

Q d. Determine how much time must elapse before the 

temperature at x = 40 comes within 1°C of its steady-state value. 


14. Consider a bar 30 cm long that is made of a material for 
which a? = 1 and whose ends are insulated. Suppose that the initial 
temperature is zero except for the interval 5 « x « 10, where the 
initial temperature is 25°C. 
a. Find the temperature u(x, t). 
Q b. Plot u versus x for several values of t. Also plot u versus 
t for several values of x. 
O c. Plot u(4, t) and u(11, t) versus t. Observe that the points 
x —4 and x = 11 are symmetrically located with respect to 
the initial temperature pulse, yet their temperature plots are 
significantly different. Explain physically why this is so. 


15. Consider a uniform bar of length L having an initial temperature 
distribution given by f(x), 0 < x < L. Assume that the temperature 
at the end x = O is held at 0°C, while the end x = L is insulated so 
that no heat passes through it. 
a. Show that the fundamental solutions of the partial differential 
equation and boundary conditions are 


"FORI arem (e m ) 


n-—1,2,3,...5 


b. Find a formal series expansion for the temperature u(x, t) 
oo 
u(x,t) = J Calal Xt) 


n=1 


that also satisfies the initial condition u(x,0) = f(x). 

Hint: Even though the fundamental solutions involve only the 
odd sines, it is still possible to represent f by a Fourier series 
involving only these functions. See Problem 39 of Section 10.4. 


16. In the bar of Problem 15, suppose that L = 30, a? = 1, and the 
initial temperature distribution is f(x) = 30 — x for0 < x < 30. 
a. Find the temperature u(x, t). 
Q b. Plot u versus x for several values of r. Also plot u versus 
t for several values of x. 
c. How does the location x,, of the warmest point in the bar 
change as t increases? Draw a graph of xm versus t. 
Q d. Plot the maximum temperature in the bar versus t. 


17. Suppose that the conditions are as in Problems 15 and 16 except 
that the boundary condition at x = 0 is u(0, t) = 40. 
a. Find the temperature u(x, t). 
Q b. Plot u versus x for several values of r. Also plot u versus 
t for several values of x. 
C. Compare the plots you obtained in this problem with those 
from Problem 16. Explain how the change in the boundary 
condition at x = causes the observed differences in the 
behavior of the temperature in the bar. 


18. Consider the boundary value problem 


X"+AX=0, X(0)50, X(L)=0. (46) 


Let \ = p?, where p = v + ic with v and ø real. Show that if 
o # 0, then the only solution of equations (46) is the trivial solution 
X(x) = 0. Hint: Use an argument similar to that in Problem 23 of 
Section 10.1. 


19. The right end of a bar of length a with thermal conductivity « 
and cross-sectional area A, is joined to the left end of a bar of thermal 
conductivity x and cross-sectional area A>. The composite bar has a 
total length L. Suppose that the end x = 0 is held at temperature zero, 
while the end x = L is held at temperature T. Find the steady-state 
temperature in the composite bar, assuming that the temperature and 
rate of heat flow are continuous at x = a. Hint: See equation (2) of 
Appendix A. 


20. Consider the problem 


O<a< bet 0 
t0; 
O<x<L. 


(47) 


ou, = lt, 
u(0,t) =0, ux(L,t) + yu(L, t) — 0, 
u(x,0) = f(x), 
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a. Letu(x,t) = X(x)T(t), and show that 


X”"+\X=0, X(0)=0, X(L)+7X(L) =0, (48) 


and 
T' + Ao?T =0, 


where A is the separation constant. 

b. Assume that is real, and show that problem (48) has no 
nontrivial solutions if A < 0. 

C. If A > 0, let à = p? with jj > 0. Show that problem (48) 
has nontrivial solutions only if is a solution of the equation 


p cos(u L) + y sin( uL) = 0. (49) 
d. Rewrite equation (49) as tan(uL) = -—j/^. Then, by 
drawing the graphs of y = tan(uL) and y = —p/y for 


u > 0 on the same set of axes, show that equation (49) is 
satisfied by infinitely many positive values of p; denote these 
by H1, H2. --- , Hn, e , Ordered in increasing size. 
e. Determine the set of fundamental solutions 
corresponding to the values ju, found in part d. 


un(x, t) 


An External Heat Source. Consider the heat conduction problem in 
a bar that is in thermal contact with an external heat source or sink. 
Then the modified heat conduction equation is 


Uy = Oxy +5(X), (50) 


where the term s(x) describes the effect of the external agency; s(x) is 
positive for a source and negative for a sink. Suppose that the boundary 
conditions are 


u(0,t) =T, u(L,t) 2 T; (51) 
and the initial condition is 
u(x,0) = f(x). (52) 
Problems 21 through 23 deal with this kind of problem. 
21. Write u(x,t) = v(x) + w(x,t), where v and w are the 


steady-state and transient parts of the solution, respectively. State the 
boundary value problems that v(x) and w(x, t), respectively, satisfy. 
Observe that the problem for w is the fundamental heat conduction 
problem discussed in Section 10.5, with a modified initial temperature 
distribution. 


22. a. Suppose that a? = 1 and s(x) = k, a constant, in equation 
(50). Find v(x). 
b. Assume that T; = 0, T; = 0, L = 20, k = 1/5, and that 
f(x) = 0for0 < x < L. Determine w(x, t). 
Q c. Plot u(x,t) versus x for several values of t; on the same 
axes. 
Q d. Plot the steady-state part of the solution v(x). 


23. a. Let o? = | and s(x) = kx/L, where k is a constant, in 
equation (50). Find v(x). 
b. Assume that T, — 10, T; — 30, L — 20, k — 1/2, and that 
f(x) = 0for0 < x < L. Determine w(x, t). 
Q c. Plot u(x,t) versus x for several values of t; on the same 
axes. 
Q d. Plot the steady-state part of the solution v(x). 
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10.7 The Wave Equation: Vibrations 
of an Elastic String 


A second partial differential equation that occurs frequently in applied mathematics is the 
wave equation. Some form of this equation, or a generalization of it, almost inevitably arises in 
any mathematical analysis of phenomena involving the propagation of waves in a continuous 
medium. For example, the studies of acoustic waves, water waves, electromagnetic waves, 
and seismic waves are all based on this equation. 

Perhaps the easiest situation to visualize occurs in the investigation of mechanical 
vibrations. Suppose that an elastic string of length L is tightly stretched between two supports 
at the same horizontal level so that the x-axis lies along the string (see Figure 10.7.1). 


x20 x-L 


fil vale!) A vibrating string. 


The elastic string may be thought of as a guitar string, a guy wire, or possibly an electric 
power line. Suppose that the string is set in motion (for example, by plucking) so that it vibrates 
in a vertical plane, and let u(x, t) denote the vertical displacement experienced by the string 
at the point x at time f. If damping effects, such as air resistance, are neglected, and if the 
amplitude of the motion is not too large, then u(x, t) satisfies the partial differential equation 


a us, = Ug (1) 


in the domain 0 < x < L,t > 0. Equation (1) is known as the one-dimensional wave 
equation!? and is derived in Appendix B at the end of the chapter. The constant coefficient 
a? appearing in equation (1) is given by 


=L, (2) 


where T is the tension (force) in the string, and p is the mass per unit length of the string 
material. It follows that a has the units of length/time —that is, of velocity. In Problem 14 it is 
shown that a is the velocity of propagation of waves along the string. 

To describe the motion of the string completely, it is necessary also to specify suitable 
initial and boundary conditions for the displacement u(x, t). The ends are assumed to remain 
fixed, and therefore the boundary conditions are 


u(0,t) 20, u(L,t)=0, t>0. (3) 


Since the differential equation (1) is of second order with respect to f, it is plausible to prescribe 
two initial conditions. These are the initial position of the string 


u(x, 0) = f(x), O<x<L (4) 
and its initial velocity 


u;(x, 0) — g(x), O<x<L, (5) 


"The solution of the wave equation was one of the major mathematical problems of the mid-eighteenth century. The 
wave equation was first derived and studied by d’Alembert in 1746. It also attracted the attention of Euler (1748), 
Daniel Bernoulli (1753), and Lagrange (1759). Solutions were obtained in several different forms, and the merits 
of, and relations among, these solutions were argued, sometimes heatedly, in a series of papers extending over more 
than 25 years. The major points at issue concerned the nature of a function and the kinds of functions that can be 
represented by trigonometric series. These questions were not resolved until the nineteenth century. 
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where f and g are given functions. In order for equations (3), (4), and (5) to be consistent, it 
is also necessary to require that 


f(0) = f(L) =0, g(0) = g(L) = 0. (6) 


The mathematical problem then is to determine the solution of the wave equation (1) that 
also satisfies the boundary conditions (3) and the initial conditions (4) and (5). Like the heat 
conduction problem of Sections 10.5 and 10.6, this problem is an initial value problem in the 
time variable t and a boundary value problem in the space variable x. Alternatively, it can be 
considered as a boundary value problem in the semi-infinite strip 0 < x « L,t > 0 of the 
xt-plane (see Figure 10.7.2). One condition is imposed at each point on the semi-infinite sides, 
and two are imposed at each point on the finite base. 


t 


didus Boundary value problem for the wave equation. 


It is important to realize that equation (1) governs a large number of other wave problems 
besides the transverse vibrations of an elastic string. For example, it is only necessary to 
interpret the function u and the constant a appropriately to have problems dealing with water 
waves in an ocean, acoustic or electromagnetic waves in the atmosphere, or elastic waves in a 
solid body. If more than one space dimension is significant, then equation (1) must be slightly 
generalized. The two-dimensional wave equation is 


a?(Uxx + Uyy) = un. (7) 


This equation would arise, for example, if we considered the motion of a thin elastic sheet, 
such as a drumhead. Similarly, in three (spatial) dimensions the wave equation is 


a’ (us, F Uyy + ul.) = ug. (8) 


In connection with the latter two equations, the boundary and initial conditions must also be 
suitably generalized. 

We now solve three boundary value problems involving the wave equation in one 
dimension. 


Elastic String with Nonzero Initial Displacement. First suppose that the string is 
disturbed from its equilibrium position and then released at time t = O with zero velocity 
to vibrate freely. Then the vertical displacement u(x, t) must satisfy the wave equation (1) 


A Ux, = ug, ÜD«xwe d. 5c 
the boundary conditions (3) 
u(0,1) 20, u(L,t) =0, t > 0; 
and the initial conditions 


u(x, 0) = f(x), u(x, 0) = 0, O<x aL, (9) 


where f is a given function describing the configuration of the string at t = 0. 
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The method of separation of variables can be used to obtain the solution of equations (1), 
(3), and (9). Assuming that 


u(x,t) = X(x)T(t) (10) 


and substituting for u in equation (1), we obtain 


x" 1. T" 
X æT 


X; (11) 


where A is a separation constant. Thus we find that X(x) and T(t) satisfy the ordinary 
differential equations 


X" - AX =0, (12) 
T" c a?AT =0. (13) 


Further, by substituting from equation (10) for u(x, t) in the boundary conditions (3), we find 
that X(x) must satisfy the boundary conditions 


X(0) =0, X(L)-0. (14) 


Finally, by substituting from equation (10) into the second of the initial conditions (9), we also 
find that T(t) must satisfy the initial condition 


T'(0) — 0. (15) 


Our next task is to determine X(x), T(t), and A by solving equation (12) subject to 
the boundary conditions (14) and solving equation (13) subject to the initial condition 
(15). 

The problem of solving the differential equation (12) subject to the boundary conditions 
(14) is precisely the same problem that arose in Section 10.5 in connection with the heat 
conduction equation. Thus we can use the results obtained there and at the end of Section 
10.1: the problem (12), (14) has nontrivial solutions if and only if À is an eigenvalue 


nn 
Ay WH 1,25 305-3 (16) 


and X(x) is proportional to the corresponding eigenfunction sin(n7 x / L). 
Using the values of A given by equation (16) in equation (13), we obtain 


j n?ra? 
T aT =n (17) 
Therefore, 
T(t) k nmat dd nm at (18) 
= kı cos| ——— sin| — |J, 
f L A L 


where kı and k3 are arbitrary constants. The initial condition (15) requires that k2 = 0, so T(t) 
must be proportional to cos(n7 at / L). 
Thus the functions 


. (nux nit at 
us(x,t) = sin(*) cos 5 W2L2,. (19) 
L E 


satisfy the partial differential equation (1), the boundary conditions (3), and the second initial 
condition (9). These functions are the fundamental solutions for the given problem. 

To satisfy the remaining (nonhomogeneous) initial condition (9), we will consider a 
superposition of the fundamental solutions (19) with properly chosen coefficients. Thus we 
assume that u(x, t) has the form 


u(x,t) = 3 CUl at) = » sin(“) es( 27%), (20) 


n=1 n=1 
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where the constants c, remain to be chosen. The initial condition u(x,0) = f(x) requires 
that 


u(x,0) = a si( 77.) = f(x). (21) 


n=1 


Consequently, the coefficients c, must be the coefficients in the Fourier sine series of period 
2L for f; hence 


n f( ) sin( 75 )a , = 1 2 (22) 
n — T 5 Sy tee n 
C =A X X n 


Thus the formal solution of the problem of equations (1), (3), and (9) is given by equation (20) 
with the coefficients calculated from equation (22). 

For a fixed value of n, the expression sin(n7 x / L) cos(nxat/ L) in equation (19) is 
periodic in time t with the period 2L/na; it therefore represents a vibratory motion of 
the string having this period, or having the frequency n7a/ L. The quantities nza/L for 
n — 1,2, ... are the natural frequencies of the string—that is, the frequencies at which 
the string will freely vibrate. The factor sin(n7 x/ L) represents the displacement pattern 
occurring in the string when it is executing vibrations of the given frequency. Each 
displacement pattern is called a natural mode of vibration and is periodic in the space variable 
x; the spatial period 2L / n is called the wavelength of the mode of frequency n7 a / L. Thus the 
eigenvalues n?r?/ L? of the problem (12), (14) are proportional to the squares of the natural 
frequencies, and the eigenfunctions sin(7 x / L) give the natural modes. The first three natural 
modes are sketched in Figure 10.7.3. The total motion of the string, given by the function 
u(x, t) of equation (20), is thus a combination of the natural modes of vibration and is also a 
periodic function of time with period 2L /a. 


u 
1 


(a) (b) (c) 


First three fundamental modes of vibration of an elastic 
string. (a) Frequency = 7 a/ L, wavelength = 2L; (b) frequency = 27a/L, 
wavelength = L; (c) frequency = 3za/ L, wavelength = 2L /3. 


EXAMPLE 1 


Consider a vibrating string of length L — 30 that satisfies the wave equation 


Au, — ug, O<x< 30, t0. (23) 


Assume that the ends of the string are fixed and that the string is set in motion with no initial 
velocity from the initial position 
x/10, 0 € x x 10, 


u(x,0) = f(x) = s —x)/20, 10< x x 30. di 


Find the displacement u(x, t) of the string, and describe its motion through one period. 


Solution: 


The solution is given by equation (20) with a — 2 and L — 30; that is, 


oo 
2 
u(x, t) =X o (^2) cos( x) (25) 


n-l 


508 


CHAPTER 10 Partial Differential Equations and Fourier Series 


where c, is calculated from equation (22). Substituting from equation (24) into equation (22), we 


obtain 
2 10 X c nux 2 30 30—x . (nmx 
c= sin dx + sin dx. (26) 
30 Jy 10 30 30 Jip 20 30 
By evaluating the integrals in equation (26), we find that 
9 [nm 
mox m 3 " n-o1,2, RE (27) 


The solution (25), (27) gives the displacement of the string at any point x at any time t. The motion 
is periodic in time with period 30, so it is sufficient to analyze the solution for 0 < t < 30. 

The best way to visualize the solution is by a computer animation showing the dynamic behavior 
of the vibrating string. Here, we indicate the motion of the string in Figures 10.7.4, 10.7.5, and 
10.7.6. Plots of u versus x for t — 0, 4, 7.5, 11, and 15 are shown in Figure 10.7.4. Observe that the 
maximum initial displacement is positive and occurs at x = 10, while at t = 15, a half-period later, 
the maximum displacement is negative and occurs at x — 20. The string then retraces its motion and 
returns to its original configuration at t — 30. Figure 10.7.5 shows the behavior of the points x — 0, 5, 
10, 15, and 25 by plots of u versus t for these fixed values of x. The plots confirm that the motion is 
indeed periodic with period 30 and illustrate that each point remains motionless for a substantial part 
of each period. Figure 10.7.6 shows a three-dimensional plot of u versus both x and f, from which 


Plots of u versus x for t = 0, 4, 7.5, 11, and 15 using the first 
25 terms of the series solution (25) of the string in Example 1. 


Plots of u versus t for x — 0, 5, 10, 15, and 25 using the first 
25 terms of the series solution of the string in Example 1. 
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the overall nature of the solution is apparent. Of course, the curves in Figures 10.7.4 and 10.7.5 lie 
on the surface shown in Figure 10.7.6. 


u 


Plot of u versus x and ¢ using the first 25 terms of 
the series solution (25) of the string in Example 1. 


Justification of the Solution. As in the heat conduction problem considered earlier, 
equation (20) with the coefficients c, given by equation (22) is only a formal solution of 
equations (1), (3), and (9). To determine whether equation (20) actually represents the solution 
of the given problem requires some further investigation. As in the heat conduction problem, it 
is tempting to try to show this directly by substituting equation (20) for u(x, t) in equations (1), 
(3), and (9). However, upon formally computing uxx, for example, we obtain 


oo 
nn \2 nix nnat 
Uxx(x, t) = -Ma(T) sin( 77.) cos( L i 


1— 


due to the presence of the n? factor in the numerator, this series may not converge. This 
would not necessarily mean that the series (20) for u(x,t) is incorrect, but only that the 
series (20) cannot be used to calculate uxx and us. A basic difference between solutions 
of the wave equation and those of the heat conduction equation is that the latter contain 
negative exponential terms that approach zero very rapidly with increasing n, which ensures 
the convergence of the series solution and its derivatives. In contrast, series solutions of the 
wave equation contain only oscillatory terms that do not decay with increasing n. 

However, there is an alternative way to establish the validity of equation (20) indirectly. 
At the same time, we will gain additional information about the structure of the solution. First 
we will show that equation (20) is equivalent to 


u(x,t) = 5 (hG = ar) + h(x +at)), (28) 


where A is the function obtained by extending the initial data f into (— L, 0) as an odd function, 
and to other values of x as a periodic function of period 2L. That is, 


f fi), 0zxzL, 
Ke -L< x< 0; 


h(x +2L) = h(x). 


To establish equation (28), note that / has the Fourier series 


h(x) = i si( 7), (30) 


n=1 


(29) 
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where c, is given by equation (22). Then, using the trigonometric identities for the sine of a 
sum or difference, we obtain 


h( t) > sin(77—) cos nuar cos (777) sin _ 
x —at) = es —— 2 
L L L È 


n=1 


Wtan » sin(“™*) Us nm at eos (777) sin nv at 
X cat) = Ch —— dichas ! 
L L L L 


n=1 


and equation (28) follows immediately upon adding the last two equations. From equation (28), 
we see that u(x, t) is continuous for 0 < x < L,t > 0, provided that h is continuous on the 
interval (—oo, oo). This requires f to be continuous on the original interval [0, L]. Similarly, 
u is twice continuously differentiable with respect to either variable in 0 < x < L,t > 0, 
provided that h is twice continuously differentiable on (—oo, co). This requires f’ and f" to 
be continuous on [0, L]. Furthermore, since h” is the odd extension of f", we must also have 
f"(0) = f'"(L) = 0. However, since h’ is the even extension of f’, no further conditions 
are required on f’. Provided that these conditions are met, ux» and up can be computed 
from equation (28), and it is an elementary exercise to show that these derivatives satisfy the 
wave equation. Some of the details of the argument just indicated are given in Problems 19 
and 20. 

If some of the continuity requirements stated in the last paragraph are not met, then u is 
not differentiable at some points in the semi-infinite strip 0 < x < L,t > O0, and thus is 
a solution of the wave equation only in a somewhat restricted sense. Àn important physical 
consequence of this observation is that if there are any discontinuities present in the initial 
data f, then they will be preserved in the solution u(x, f) for all time. In contrast, in heat 
conduction problems, initial discontinuities are instantly smoothed out (Section 10.6). Suppose 
that the initial displacement f has a jump discontinuity at x = x9, 0 < xo < L. Since A is the 
odd periodic extension of f, the same discontinuity is present in h(€) at £ = xo + 2nL and at 
€ = —xg + 2nL, where n is any integer. Thus A(x —at) is discontinuous at points (x, f) when 
x — at = xo + 2nL or when x — at = —xo + 2nL. For a fixed x in [0, L], the discontinuity 
that was originally at xo will reappear in h(x — at) att = (x + xo — 2nL) /a. Similarly, 
h(x -- at) is discontinuous at the point x at f = (—x + xo +2mL) /a, where m is any integer. 
If we refer to equation (28), it then follows that the solution u(x, t) is also discontinuous at the 
given point x at these values of t. Since the physical problem is posed for t > 0, only those 
values of m and n that yield positive values of t are of interest. 


Elastic String with Nonzero Initial Velocity. Let us modify the problem just considered 
by supposing that the string is set in motion from its equilibrium position with a given velocity. 
Then the vertical displacement u(x, t) must satisfy the wave equation 


Guy, = ug, 0<x< L, t>0; 
the boundary conditions 
u(0,t) 20, u(L,t) =0, t > 0; 
and the initial conditions 
u(x,0) 20, u(x,0) =g(x), OK<xK<L, (31) 


where g(x) is the initial velocity at the point x of the string. 

The solution of this new problem can be obtained by following the procedure described 
above for the problem (1), (3), and (9). Upon separating variables, we find that the problem 
for X(x) is exactly the same as before. Thus, once again, A = na? [ L? and X(x) is 
proportional to sin(n7 x / L). The differential equation for T(t) is also the same as before 
(see equation (17)), 


nena? 


L? 


T" + T —0, (32) 


but the associated initial condition is now 


T(0) — 0, (33) 
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corresponding to the first of the initial conditions (31). Recall that the general solution of 


equation (32) is 
T(t) =k nn at ky si nu at 34 
= K] COS UE + Ka SiN tX š (34) 


but now the initial condition (32) requires that kı = 0. Therefore, T(t) is now proportional to 
sin(n7 at / L), and the fundamental solutions for the problem (1), (3), and (31) are 


. (nixN , (nrat 
us(x,t) = sin(*7—) sin ; ~- WH 1,2, Bed es (35) 
L L 


Each of the functions u,,(x, t) satisfies the wave equation (1), the boundary conditions (3), and 
the first of the initial conditions (31). The main consequence of using the initial conditions (31) 
rather than (9) is that the time-dependent factor in u,(x, t) involves a sine rather than a cosine. 

To satisfy the remaining (nonhomogeneous) initial condition, we assume that u(x, t) can 
be expressed as a linear combination of the fundamental solutions (33); that is, 


u(x,t) = RU c ja si (777) sin( 7). (36) 


n=1 n=1 


To determine the values of the coefficients k,, we differentiate equation (34) with respect to 
t, set t = 0, and use the second initial condition (31); this gives the equation 


u(x, = > kn sin(—) = g(x). (37) 


n=l 


Hence the quantities ky are the coefficients in the Fourier sine series of period 2L for g. 
Therefore, 
iiis et zr g(x) sin(77— Jas, je os (38) 
L L Jo L 
Thus equation (34), with the coefficients given by equation (36), constitutes a formal solution 
to the problem of equations (1), (3), and (31). The validity of this formal solution can be 


established by arguments similar to those previously outlined for the solution of equations (1), 
(3), and (9). 


General Problem forthe Elastic String. Finally, we turn to the problem consisting of the 
wave equation (1), the boundary conditions (3), and the general initial conditions (4), (5): 


u(x,0) = f(x), ui(x,0) —-g(x), 0<x< L, (39) 


where f(x) and g(x) are the given initial position and velocity, respectively, of the string. 
Although this problem can be solved by separating variables, as in the cases discussed 
previously, it is important to note that it can also be solved simply by adding together the 
two solutions that we obtained above. To show that this is true, let v(x, t) be the solution of 
the problem (1), (3), and (9), and let w(x, t) be the solution of the problem (1), (3), and (31). 
Thus v(x, t) is given by equations (20) and (22), and w(x, t) is given by equations (34) and 
(36). Now let u(x, t) = v(x, t) -- w(x, t). What problem does u(x, t) satisfy? First, observe 
that 


a ^us, — ug = (avg — Vu) + (A? Wax — wu) 204-0 — 0, (40) 
so u(x, t) satisfies the wave equation (1). Next, we have 
u(0,t) = v(0,t) +w(0,t) =0+0=0, u(L,t) =v(L,t)+w(L,t) 2040-0, (4D 
so u(x, t) also satisfies the boundary conditions (3). Finally, we have 
u(x, 0) = v(x,0) 4-w(x,0) = f(x) +0= f(x) (42) 
and 
u(x, 0) = v(x, 0) 4-w,(x,0) 20-4 g(x) = g(x). (43) 
Thus u(x, t) satisfies the general initial conditions (37). 
We can restate the result we have just obtained in the following way. To solve the wave 
equation with the general initial conditions (37), you can solve instead the somewhat simpler 


problems with the initial conditions (9) and (31), respectively, and then add together the two 
solutions. This is another use of the principle of superposition. 
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Problems 


Consider an elastic string of length L whose ends are held fixed. The 
string is set in motion with no initial velocity from an initial position 
u(x,0) = f(x). In each of Problems 1 through 4, carry out the 
following steps. Let L = 10 and a = 1 in parts (b) through (d). 

a. Find the displacement u(x, t) for the given initial position 

f(x). 

@ b. Plot u(x, t) versus x for0 < x < 10 and for several values 

of t between t = 0 and t = 20. 

Q c. Plot u(x,t) versus t for 0 < t < 20 and for several values 


of x. 
Ga d. Construct an animation of the solution in time for at least 
one period. 
e. Describe the motion of the string in a few sentences. 
2x/L, 0 <x <L/2, 
1. f(x) = 
AL—x)/L, L/2<x<L 
4x/L, 0<x<L/4, 
2. f(x) =4¢1, L/4< x « 3L/4, 
4(L—x)/L, 3L/4<x<L 


3. f(x) = 8x(L — x)?/L 
0 0<x<L/2-1 

f(x)=<4 1, L/2-1<x< L/2+1 (assume L > 2), 
0 L/2+1<x<1 


S 


Consider an elastic string of length L whose ends are held fixed. The 
string is set in motion from its equilibrium position with an initial 
velocity u;(x, 0) — g(x). In each of Problems 5 through 8, carry out 
the following steps. Let L — 10 and a — 1 in parts (b) through (d). 
a. Find the displacement u(x, t) for the given g(x). 
Q b. Plot u(x, t) versus x for0 < x < 10 and for several values 
of t between t = 0 and t = 20. 
O c. Plot u(x,t) versus t for 0 < t < 20 and for several values 
of x. 
@ d. Construct an animation of the solution in time for at least 
one period. 
e. Describe the motion of the string in a few sentences. 


2x/L, O<x<L/2, 
5. g(x) = 

XL-x)L, L/2<x<L 

4x/L, 0<x<L/4, 
6. g(x) = 41, L/4< x « 3L/4, 

4(L—x)/L, 3L/4<x<L 


7. g(x) = 8x(L —x)?/D? 
0 O<x<L/2-1 

g(x) = <1, L/2-1<x< L/2+1 (assume L > 2), 
0, L/2+1<x<1 


e 


9. If an elastic string is free at one end, the boundary condition 
to be satisfied there is that ux = 0. Find the displacement u(x, t) 
in an elastic string of length L, fixed at x = O and free at x = 
L, set in motion with no initial velocity from the initial position 
u(x,0) = f(x), where f is a given function. Hint: Show that 
the fundamental solutions for this problem, satisfying all conditions 
except the nonhomogeneous initial condition, are 


Un(x,t) = sin(A nx) cos( ànat), 


where A, 2 (2n — 1)7/(2L),n = 1,2, ... . Compare this problem 
with Problem 15 of Section 10.6; pay particular attention to the 
extension of the initial data out of the original interval [0, L]. 


10. Consider an elastic string of length L. The end x = 0 is held 
fixed, while the end x — L is free; thus the boundary conditions are 
u(O,t) = 0 and u,(L,t) = 0. The string is set in motion with no 
initial velocity from the initial position u(x, 0) — f(x), where 


0 O<x<L/2-1 
f(x) =< 1, L/2—-1«x « L/2+1 (assume L > 2), 
0, L/2+1<x<1 


a. Find the displacement u(x, t). 

[G] b. With L = 10 and a = 1, plot u versus x for0 < x < 10 
and for several values of t. Pay particular attention to values of t 
between 3 and 7. Observe how the initial disturbance is reflected 
at each end of the string. 

O c. With L = 10 anda = 1, plot u versus t for several values 
of x. 

(3 d. Construct an animation of the solution in time for at least 
one period. 

e. Describe the motion of the string in a few sentences. 


© 11. Suppose that the string in Problem 10 is started instead from 
the initial position f(x) = 8x(L — x)?/ L?. Follow the instructions 
in Problem 10 for this new problem. 
12. Dimensionless variables can be introduced into the wave 
equation ayy = u; in the following manner: 

a. Lets = x/L and show that the wave equation becomes 


Y 
aug, = Lys. 


b. Show that L/a has the dimensions of time and therefore can 
be used as the unit on the time scale. Let 7 = at/L and show 
that the wave equation then reduces to 


Uss = Urr. 


Problems 13 and 14 indicate the form of the general solution of the 
wave equation and the physical significance of the constant a. 


13. a. Show that the wave equation 
a^, = Utt 


can be reduced to the form uz; = 0 by the change of variables 
€=x-at,n =x +at. 
b. Show that u(x,t) can be written as 


u(x,t) = d(x — at) + v(x +a), 


where ó and v are arbitrary functions. 


14. Qa. Plot the value of d(x — at) for t = 0, 1/a, 2/a, and 
to/a if ¢(s) = sins. Note that for any £ 4 0, the graph of 
y = @(x — at) is the same as that of y = (x) when t = 0, 
but displaced a distance at in the positive x direction. Thus a 
represents the velocity at which a disturbance moves along the 
string. 
b. What is the interpretation of ó (x + at)? 
15. A steel wire 5 ft in length is stretched by a tensile force of 50 Ib. 
The wire has a weight per unit length of 0.026 Ib/ft. 
a. Find the velocity of propagation of transverse waves in the 
wire. 
b. Find the natural frequencies of vibration. 
C. If the tension in the wire is increased, how are the natural 
frequencies changed? Are the natural modes also changed? 


16. Consider the wave equation 
A ugs = d 
in an infinite one-dimensional medium subject to the initial conditions 
u(x,0) = f(x), u(x, 0) = 0, 


—0 < x < OO. 


a. Using the form of the solution obtained in Problem 13, show 
that ó and w must satisfy 


$(x) - 9G) = fO). 
—ó'(X) + 0x) — 0. 


b. Solve the equations of part a for @ and 7, and thereby show 
that 


u(x,t) = je — at) + f(x +at)). 


This form of the solution was obtained by d’Alembert in 1746. 
Hint: Note that the equation w/(x) = dó'(x) is solved by 
choosing (x) = @(x) +c. 


ĉi Let 
fo) 2, -l<x<l, 
= 
0, otherwise. 

Show that 

2, —l+at< x< l+at, 

f(x—at) = dios i 
0, otherwise. 


Also determine f(x + at). 

d. Sketch the solution found in part b at t = 0, t = 1/2a, 
t = l/a, and t = 2/a, obtaining the results shown in Figure 
10.7.7. Observe that an initial displacement produces two waves 
moving in opposite directions away from the original location; 
each wave consists of one-half of the initial displacement. 


didus tura Propagation of an initial disturbance in an infinite 
one-dimensional medium. 
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17. Consider the wave equation 
2 
a Uyy = Ug 
in an infinite one-dimensional medium subject to the initial conditions 


u(x,0) 20, u;(x,0 —g(x), —oo«x « oo. 


a. Using the form of the solution obtained in Problem 13, show 
that 


$(x) +¥(x) — 0, 
—aó'(x) c av'(x) = g(x). 


b. Use the first equation of part a to show that 
w'(x) = —o'(x). Then use the second equation to show that 
—2a@'(x) = g(x) and therefore that 


1 X 
(x) =--> f g(E)dE + $ (x0), 
a J, 


where x is arbitrary. Finally, determine 7 (x). 
c. Show that 


1 x-rat 
u(x,t) = x] 8E) dE. 


x—at 


18. By combining the results of Problems 16 and 17, show that the 
solution of the problem 


2 
a Uyy = Utt, 


u(x,0) = f(x), u:(x,0) = g(x), —oo«x« oo 
is given by 
1 1 x+at 
ua) = (fo -an e fo an) e f g(E)d€. 


Problems 19 and 20 indicate how the formal solution (20), (22) of 
equations (1), (3), and (9) can be shown to constitute the actual 
solution of that problem. 


1 
19. Use the identity sin A cos B = 5 (sin( A + B) + sin( A — B)), 


to show that solution (20) of the problem of equations (1), (3), and (9) 
can be written in the form (28). 


20. Let h(£) represent the initial displacement in [0, L], extended 
into ( —L, 0) as an odd function and extended elsewhere as a periodic 
function of period 2L. Assuming that h, h’, and h” are continuous, 
show by direct differentiation that u(x,t) as given in equation (28) 
satisfies the wave equation (1) and also the initial conditions (9). Note 
also that since equation (20) clearly satisfies the boundary conditions 
(3), the same is true of equation (28). Comparing equation (28) 
with the solution of the corresponding problem for the infinite string 
(Problem 16), we see that they have the same form, provided that the 
initial data for the finite string, defined originally only on the interval 
0 < x < L, are extended in the given manner over the entire x-axis. 
If this is done, the solution for the infinite string is also applicable to 
the finite one. 


21. The motion of a circular elastic membrane, such as a 


drumhead, is governed by the two-dimensional wave equation in polar 
coordinates 
1 1 E 
Urr + -Ur + -z400 5A Uy. 
r r 


Assuming that u(r, 0, t) = R(r)O(0)T(t), find ordinary differential 
equations satisfied by R(r), O (0), and T(t). 
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22. The total energy E(t) of the vibrating string is given as a 
function of time by 


Eo bs s (RA 
E(t) — zpu;(x,t) + -Tui(x,t) } dx; (44) 
0 X4 2 


the first term is the kinetic energy due to the motion of the string, and 
the second term is the potential energy created by the displacement 
of the string away from its equilibrium position. 
For the displacement u(x, t) given by equation (20) —that is, for 
the solution of the string problem with zero initial velocity — show that 
rT © 
Ej) = — ma. (45) 


n=1 


Note that the right-hand side of equation (45) does not depend on t. 
Thus the total energy E is a constant and therefore is conserved during 
the motion of the string. Hint: Use Parseval's equation (Problem 37 
of Section 10.4 and Problem 17 of Section 10.3), and recall that 
a)? =T/p. 

23. Dispersive Waves. Consider the modified wave equation 


a? ug 4-4?u = uy, O<x<L,t>0 (46) 


with the boundary conditions 


u(0,t) =0, u(L,t) = 0, t>0 (47) 
and the initial conditions 
u(x,0) = f(x), uj(x,0 20, Oc x « L. (48) 


a. Show that the solution can be written as 


99 [19.0 
«ns Ye T +2 «) sin( 77). 


n-l 


where 


10.8 


b. By using trigonometric identities, rewrite the solution as 


u(x,t) = 23»! (sin(“ tas) 


n-l 
. (NT 
+sin(“ (x — aJ). 


Determine a, the speed of wave propagation. 

C. Observe that a,, found in part b, depends on n. This means 
that components of different wave lengths (or frequencies) 
are propagated at different speeds, resulting in a distortion of 
the original wave form over time. This phenomenon is called 
dispersion. Find the condition under which a; is independent of 
n — that is, there is no dispersion. 


24. Consider the situation in Problem 23 with a? = 1, L = 10, 
and 

xX—4, 4<x<5, 

6—x, 5<x<6, 


0, otherwise. 


f(x) = 


a. Determine the coefficients c, in the solution of Problem 23a. 


Q b. Plot 


N 


. [nTx 
1 c,Sin| —— | for O x x < 10, 
10 


n-l 


choosing N large enough so that the plot accurately displays the 
graph of f(x). Use this value of N for the remaining plots called 
for in this problem. 

Oc. Let ^y — 0. Plot u(x, t) versus x for t — 60. 

O d. Lety = 1/8. Plot u(x, t) versus x for t = 20, 40, and 60. 
(G) e. Lety = 1/4. Plot u(x, t) versus x for t = 20, 40, and 60. 


Laplace’s Equation 


One of the most important of all partial differential equations occurring in applied mathematics 
is that associated with the name of Laplace:!! in two dimensions 


and in three dimensions 


Uxx + Uyy = 0, (1) 


Uxx + Uyy + uz; = 0. 2) 


For example, in a two-dimensional heat conduction problem, the temperature u(x, y, t) must 
satisfy the differential equation 


o Uxx + Uyy) = Uş, (3) 


!! Laplace's equation is named for Pierre-Simon de Laplace, who, beginning in 1782, studied its solutions extensively 
while investigating the gravitational attraction of arbitrary bodies in space. However, the equation first appeared in 
1752 in a paper by Euler on hydrodynamics. 


where a is the thermal diffusivity. If a steady state exists, then the time derivative vanishes 
and u is a function of x and y only; in this case equation (3) reduces to equation (1). 
Similarly, for the steady-state heat conduction problem in three dimensions, the temperature 
must satisfy the three-dimensional form of Laplace's equation. Equations (1) and (2) also 
occur in other branches of mathematical physics. In the consideration of electrostatic fields, 
the electric potential function in a dielectric medium containing no electric charges must satisfy 
either equation (1) or equation (2), depending on the number of space dimensions involved. 
Similarly, the potential function of a particle in free space acted on only by gravitational 
forces satisfies the same equations. Consequently, Laplace's equation is often referred to as 
the potential equation. Another example arises in the study of the steady (time-independent), 
two-dimensional, inviscid, irrotational motion of an incompressible fluid. This study centers 
on two functions, known as the velocity potential function and the stream function, both of 
which satisfy equation (1). In elasticity, the displacements that occur when a perfectly elastic 
bar is twisted are described in terms of the so-called warping function, which also satisfies 
equation (1). 

Since there is no time dependence in any of the problems just mentioned, there are no 
initial conditions to be satisfied by the solutions of equation (1) or (2). They must, however, 
satisfy certain boundary conditions on the boundary of the region in which the differential 
equation is to be solved. Since Laplace's equation is of second order, it might be plausible to 
expect that two boundary conditions would be required to determine the solution completely. 
This, however, is not the case. Recall that in the heat conduction problem for the finite bar 
(Sections 10.5 and 10.6), it was necessary to prescribe one condition at each end of the 
bar—the bar is represented as a (one-dimensional) interval and one condition is specified at 
each endpoint of the interval, that is, at each boundary point. If we generalize this observation 
to multidimensional problems, it is then natural to prescribe one condition on the function 
u at each point on the boundary of the region in which a solution of equation (1) or (2) is 
sought. When the region is a subset of the (two-dimensional) plane, the boundary is a (one- 
dimensional) curve; if the region is a subset in (three-dimensional) space, the boundary is a 
(two-dimensional) surface. 

The most common boundary condition occurs when the value of u is specified at each 
boundary point; in terms of the heat conduction problem, this corresponds to prescribing 
the temperature on the boundary. In some problems the value of the derivative, or rate of 
change, of u in the direction normal to the boundary is specified instead; the condition 
on the boundary of a thermally insulated body, for example, is of this type. It is entirely 
possible for more complicated boundary conditions to occur; for example, u might be 
prescribed on part of the boundary and its normal derivative specified on the remainder. The 
problem of finding a solution of Laplace's equation that takes on given boundary values 
is known as a Dirichlet problem, in honor of P.G.L. Dirichlet.? On the other hand, if 
the values of the normal derivative are prescribed on the boundary, the problem is said 
to be a Neumann problem, in honor of C.G. Neumann. The Dirichlet and Neumann 
problems are also known as the first and second boundary value problems of potential theory, 
respectively. 

Physically, it is plausible to expect that the types of boundary conditions just mentioned 
will be sufficient to determine the solution completely. Indeed, it is possible to establish the 
existence and uniqueness of the solution of Laplace's equation under the boundary conditions 
mentioned, provided that the shape of the boundary and the functions appearing in the 
boundary conditions satisfy certain very mild requirements. However, the proofs of these 
theorems, and even their accurate statement, are beyond the scope of the present book. Our 
only concern will be solving some typical problems by means of separation of variables and 
Fourier series. 


P Peter Gustav Lejeune Dirichlet (1805-1859) was born in Germany of Belgian ancestry. He was a professor at Berlin 
and, after the death of Gauss, at Góttingen. In 1829 he provided the first set of conditions sufficient to guarantee 
the convergence of a Fourier series. The definition of function that is usually used today in elementary calculus is 
essentially the one given by Dirichlet in 1837. Although he is best known for his work in analysis and differential 
equations, Dirichlet was also one of the leading number theorists of the nineteenth century. 


I3 Carl Gottfried Neumann (1832-1925), a German mathematician and professor at Leipzig for more than forty years, 
made contributions to differential equations, integral equations, and complex variables. 
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While the problems chosen as examples have interesting physical interpretations (in terms 
of electrostatic potentials or steady-state temperature distributions, for instance), our purpose 
here is primarily to point out some of the features that may occur during their mathematical 
solution. Itis also worth noting again that more complicated problems can sometimes be solved 
by expressing the solution as the sum of solutions of several simpler problems (see Problems 
3 and 4). 


Dirichlet Problem for a Rectangle. Consider the mathematical problem of finding the 
function u satisfying Laplace's equation 


Uxx + Uyy = 0, (4) 
in the rectangle 0 < x < a, 0 < y < b, and also satisfying the boundary conditions 
u(x,0) =0, u(x; b) = 0, 0< x < a; 
u(0, y) =0, u(a, y) — f(y), Oc y< b, 


(5) 


where f is a given function on 0 < y < b (see Figure 10.8.1). 


u(x,b) 20 


u(x,0) 20 a x 


Dirichlet problem for a rectangle. 


To solve this problem, we wish to construct a fundamental set of solutions satisfying the 
partial differential equation and the homogeneous boundary conditions; then we will superpose 
these solutions so as to satisfy the remaining boundary condition. Let us assume that 


u(x, y) = X(x)¥(y) (6) 


and substitute for u in equation (4). This yields 


where A is the separation constant. Thus we obtain the two ordinary differential equations 
X'"—AXZÉ0, O<x<a (7) 
Y"+rAY=0, O<y<b. (8) 
If we now substitute for u from equation (6) in each of the homogeneous boundary conditions 
(5), we find that 
X(0) =0 (9) 
and 
Y(0) 20, Y(b) =0. (10) 
We will first determine the solution of the differential equation (8) subject to the boundary 
conditions (10). Notice that this problem is essentially identical to one encountered previously 


in Sections 10.1, 10.5, and 10.7. We conclude that there are nontrivial solutions if and only if 
À is an eigenvalue, namely, 


r= (E. n=1,2 a, (11) 


and Y( y) is proportional to the corresponding eigenfunction sin(nz y / b). Next, we substitute 
from equation (11) for A in equation (7), obtaining 


ii nr \2 
x"- (Z) x=0. 


To solve this equation subject to the boundary condition (9), it is convenient to write the general 
solution as 


nux . nux 
xh cosh( 72} +k sinn (77). (12) 


The boundary condition (9) then requires that k; = 0. Therefore, X(x) must be proportional 
to sinh(n7 x / b). Thus we obtain the fundamental solutions 


] NTX\ , (ny 
us(x, y) = sin (77) sin( =), [area (13) 


These functions satisfy the differential equation (4) and all the homogeneous boundary 
conditions for each value of n. 

To satisfy the remaining nonhomogeneous boundary condition at x = a, we assume, as 
usual, that we can represent the solution u(x, y) in the form 


u(x, y) = 5 aiat, y= a sinh( 755) sin( Z2), (14) 


n=1 n=l 


The coefficients c,, are determined by the boundary condition 


oo 


u(a, y) = Y cn sin (777) sin(777.) = FO). (15) 


n=l 


Therefore, the quantities c, sinh(nmra/b) must be the coefficients in the Fourier sine series of 
period 2b for f and are given by 


c, sinh (7) = TEOG (16) 


Thus the solution of the partial differential equation (1) satisfying the boundary conditions (5) 
is given by equation (14) with the coefficients c, computed from equation (16). 

From equations (14) and (16), we can see that the n™ term of the solution contains the 
factor sinh(nz x / b) / sinh(n7za/b). To estimate this quantity for large n, we can use the 


approximation sinh € = zd and thereby obtain 


nx 
sinh ( ) 
b ^ 


nca Lu 
sinh ( ) 
b 


exp( 57) u e (- 58 — 2), 


1 

2 

1 nra — b 
sap) 


Thus this factor has the character of a negative exponential; consequently, the series (14) 
converges quite rapidly unless a — x is very small. 


EXAMPLE 1 


Find the solution to the Dirichlet problem for Laplace’s equation (4), (5) with a = 3, b = 2, and 


y, O<y<l, 
fy) = (17) 
2-y, l<y<2. 


Display the solution both as a three-dimensional plot and as a contour plot. (Use enough terms in the 
solution to see the true profile of the boundary conditions.) 
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Solution: 
Evaluating c, from equation (16), we find that 
8sin(n7/2) 


E n?z? sinh(3n7 /2) i 


Then u(x, y) is given by equation (14). Keeping 20 nonzero terms in the series, we can plot u versus 
x and y, as shown in Figure 10.8.2. Alternatively, we can construct a contour plot showing level 
curves of u(x, y); Figure 10.8.3 is such a plot, showing the eleven level curves from 0 to 1 with 
increments of 0.1. 


y 
u=0.1 
Eu =0.3 
u=0.5 
w= O07 
u-09 
u=0.8 
u=0.6 
u=0.4 
~u=0.2 
x 


Level curves of u(x, y) for Example 1. 


Dirichlet Problem for a Circle. Consider the problem of solving Laplace’s equation in a 
circular region r < a subject to the boundary condition 


u(a,0) = f(@), 0«0 «2n, (19) 


where f is a given function on 0 < 0 < 27 with f(0) = f(27) (see Figure 10.8.4). In polar 
coordinates (r, 0), Laplace's equation has the form (see Problem 15) 


1 1 
Urr + -Ur + Ugo = 0, O0<r<a, 0«0 «2m. (20) 
r r 


To complete the statement of the problem, we note that the assumption that f(0) = f(27) 
ensures that the periodic extension of f to the entire real line will be, at least, continuous. 
Therefore, u(r, 0) must also be periodic in 0 with period 27. Moreover, we state explicitly 
that u(r, 0) must be bounded for r < a; the reason for this boundedness assumption will 
become clear later. 


u(a, 0) 


Dirichlet problem for Laplace's 


equation in a circular region. 


To apply the method of separation of variables to this problem, we assume that 
u(r,0) = R(r)O(0) (21) 


and substitute for u in the differential equation (20). This yields 


[| 1 
R"O 4 --R'O + —RO" =0, 
r r2 


or 
R” R’ [9] n 
2 

- — 57 
d TE R 6 (22) 

where À is the separation constant. Thus we obtain the two ordinary differential equations 
r?R" E rR' - AR —0, Q3) 
Q"-F AO =0. (24) 


In this problem there are no homogeneous boundary conditions; recall, however, that solutions 
must be bounded and also periodic in 0 with period 277. It is possible to show (see Problem 
9) that the periodicity condition requires A to be real. We will consider, in turn, the cases in 
which À is negative, zero, and positive. 


Casel: IfA < 0,let A = —y?, where u > 0. Then equation (24) becomes ©” — 420 = 0, 
and consequently, 


O(0) = cie"? + cee ™?. (25) 

Thus © (0) can be periodic only if c; = c2 = 0, and we conclude that A cannot be negative. 
Case Il: If A = 0, then equation (24) becomes ©” = 0, and thus 

0(0) =c +020. (26) 


For © (0) to be periodic, we must have c2 = 0 so that O (0) is a constant. Further, for A = 0, 
equation (23) becomes 


r?R" +rR' =0. (27) 
This equation is of the Euler type and has the solution 
R(r) =k, + ko Inr. (28) 
The logarithmic term cannot be accepted if u(r,@) is to remain bounded as r — 0; 
hence k = 0. Thus, corresponding to A = 0, we conclude that u(r, 0) must be a 
constant— that is, proportional to the solution 
uo(r, 0) = 1. (29) 


Case Ill: Finally, if A > 0, we let A = u?, where u > 0. Then equations (23) and (24) 
become 


r?R" E rR' — p?R=0 (30) 
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and 
e" + 70 — 0, (31) 
respectively. Equation (30) is an Euler equation and has the solution 
R(r) = kar" +k”, (32) 


while equation (31) has the solution 
O(0) = cı sin( u0) + c2 cos( u0). (33) 


In order for O to be periodic with period 27, it is necessary for u to be a positive integer n. 
Since u = n is a positive integer, it follows that the solution r~™ in equation (32) becomes 
unbounded as r — 0 and so must be discarded. Consequently, k2 = 0 and the appropriate 
solutions of equation (20) are 


un(r,0) —r'cos(n0), v,(r,0) —r'sinnü), n=1,2,.... (34) 


These functions, together with u(r, 0) = 1, form a set of fundamental solutions for the present 
problem. 
In the usual way, we now assume that u can be expressed as a linear combination of the 
fundamental solutions; that is, 
co = 
u(r, 8) = > + Y Lr" Ce cos(n0) + kn sin(n0)). (35) 


n=1 


The boundary condition (19) then requires that 


u(a,0) = S t S a" (on cos(n0) + kn sin(n0)) = f(0) (36) 


n=l 


for0 < 0 < 2m. The function f may be extended outside this interval so that it is periodic 
with period 27 and therefore has a Fourier series of the form (36). Since the extended function 
has period 27 , we may compute its Fourier coefficients by integrating over any period of the 
function. In particular, it is convenient to use the original interval (0, 277); then 


1 2n 
ac, = — f(0)cos(n0)d0O, n=0,1,2,...; (37) 
T Jo 


2n 
a" k, = - f(0)sininO)dO0, n=1,2,.... (38) 
0 

With this choice of the coefficients, equation (35) represents the solution of the boundary 
value problem of equations (19) and (20). Note that in this problem we needed both sine and 
cosine terms in the solution. This is because the boundary data were given on 0 < 0 < 27 
and have period 27. As a consequence, the full Fourier series is required, rather than sine or 
cosine terms alone. 


Problems 


1. a. Find the solution u(x, y) of Laplace's equation in the of y and also plot u versus y for several values of x. (Use 
rectangle 0 < x « a,0 « y « b, that satisfies the boundary enough terms in the Fourier series to accurately approximate the 
conditions nonhomogeneous boundary condition.) 


Q d. Plot u versus both x and y in three dimensions. Also draw 
a contour plot showing several level curves of u(x, y) in the xy- 
u(x,0) =0, u(x,b) = g(x), O<x <a. plane. 


u(0,y) =0, u(a,y) =0, 0< y<b, 


b. Find the solution if 2. Find the solution u(x, y) of Laplace's equation in the rectangle 
! 0«x«a,0 « y< b, that satisfies the boundary conditions 


X5 0<x <a/2, 
g(x) = a—x, a/2<x<a. u(0, y) = 0, u(a,y) 20, O«yc«b, 


u(x,0) — h(x), u(x,b) 20, O<x <a. 
G C. Fora = 3 and b = 1, plot u versus x for several values 


3. a. Find the solution u(x, y) of Laplace's equation in the 
rectangle 0 < x < a, 0 < y < b, that satisfies the boundary 


conditions 
u(0, y) = 0, u(a,y) = f(y), O< y«b, 
u(x,0) = h(x), u(x, b) = 0, O<x<a. 


Hint: Consider the possibility of adding the solutions of two 
problems, one with homogeneous boundary conditions except 
for u(a, y) = f(y), and the other with homogeneous boundary 
conditions except for u(x, 0) = A(x). 

b. Find the solution if h(x) = (x/a)? and f(y) = 1 — y/b. 
O c. Let a = 2 and b = 2. Plot the solution in several ways: 
u versus x (for a uniform sample of y values), u versus y (for a 
uniform sample of x values), u versus both x and y, and a contour 
plot. 


4. Show how to find the solution u(x, y) of Laplace's equation in 
the rectangle 0 < x < a, 0 < y < b, that satisfies the boundary 
conditions 

u(0, y) = KCy), 
u(x,0) = h(x), 


u(a, y) = f(y), 
u(x, b) = g(x), 


0< y<b, 
O< x <a: 


Hint: See Problem 3. 


5. Find the solution u(r, 0) of Laplace’s equation 


1 1 
Urr + -Ur + —ugp = 0, r>a, 0«0«2m, 
r r 


outside the circle r = a, that satisfies the boundary condition 
u(a, 0) = f(0), 


on the circle. Assume that u(r, 0) is single-valued and bounded for 
ra. 


0x0 «2m, 


6. a. Find the solution u(r,@) of Laplace's equation in the 
semicircular region r < a, 0 < 0 < v, that satisfies the 
boundary conditions 


u(r,0) =0, u(r,7) =0, O<r<a, 
u(a,0) = f(A), 0c0«m. 


Assume that u is single-valued and bounded in the given region. 
b. Find the solution if £(0) = 0(m — 0). 
O c. Let a = 2 and plot the solution in several ways: u versus 
r, u versus 0, u versus both r and 0, and a contour plot. 
7. Find the solution u(r, 0) of Laplace's equation in the circular 
sector0 < r < a, 0 < 0 < a, that satisfies the boundary conditions 
u(r,0) =0, u(r,a) =0, O<r<a, 
u(a,0) = f(A), 0«0«2a. 
Assume that u is single-valued and bounded in the sector and that 
0co «2m. 


8. a. Find the solution u(x, y) of Laplace's equation in the semi- 
infinite strip 0 < x < a, y > 0, that satisfies the boundary 
conditions 


u(0, y) — 0, 
u(x, 0) = f(x). 


and the additional condition that u(x, y) — 0 as y > oo. 
b. Find the solution if f(x) = x(a — x). 


u(a,y) 20, y>O, 


O<x<a 
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@c. Let a = 5. Find the smallest value of yọ for which 
u(x, y) < 0.1 forall y > yo. 
9. Show that equation (24) has periodic solutions only if A is real. 
Hint: Let A = —pu?, where u = v +io with v and ø real. 
10. Consider the problem of finding a solution u(x, y) of Laplace's 
equation in the rectangle 0 < x < a, 0 « y « b, that satisfies the 
boundary conditions 


ux (0, y) = 0, 
uy(x,0) = 0, 


ux(a, y) = f(y), 
uy(x, b) = 0, 


O0<y<b, 
O<x<a. 


This is an example of a Neumann problem. 
a. Show that Laplace’s equation and the homogeneous 
boundary conditions determine the fundamental set of solutions 


ug(x, y) = C0, 


nux nm y 
Un(X, y) = Cy cosh (772.) cos ( b ) nel123.3,... 


b. By superposing the fundamental solutions of part (a), 
formally determine a function u satisfying the nonhomogeneous 
boundary condition uy(a, y) = f(y). Note that when us(a, y) 
is calculated, the constant term in u(x, y) is eliminated, and there 
is no condition from which to determine cp. Furthermore, it must 
be possible to express f by means of a Fourier cosine series of 
period 2b, which does not have a constant term. This means that 


b 
n f(y)dy «0 
0 


is a necessary condition for the given problem to be solvable. 
Finally, note that cg remains arbitrary, and hence the solution is 
determined only up to this additive constant. This is a property of 
all Neumann problems. 


11. Find a solution u(r, 0) of Laplace's equation inside the circle 
r — a that satisfies the boundary condition on the circle 


u,(a,0) —g(0), 0«0 « 2m. 

Note that this is a Neumann problem and that its solution is determined 
only up to an arbitrary additive constant. State a necessary condition 
on g(@) for this problem to be solvable by the method of separation 
of variables (see Problem 10). 


12. a. Find the solution u(x, y) of Laplace's equation in the 
rectangle 0 < x < a,0 < y < b, that satisfies the boundary 


conditions 
u(0,y) =0, u(a,y) —0, 0cyc«b, 
uy(x,0) —0, u(x,b) —g(x) O<xK<a. 


Note that this is neither a Dirichlet nor a Neumann problem, but 
a mixed problem in which u is prescribed on part of the boundary 
and its normal derivative on the rest. 

b. Find the solution if 


X; 
g(x) = { 
a-x, 


O c. Let a = 3 and b = 1. By drawing suitable plots, compare 
this solution with the solution of Problem 1. 


0<x <a/2, 


a/2<x<a. 
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13. a. Find the solution u(x, y) of Laplace's equation in the 
rectangle 0 < x < a, 0 < y < b, that satisfies the boundary 
conditions 

u(0,y) 20, u(a,y) 2 fly), O<y<b, 
u(x,0) =0, uy(x,b) =0, 0 «€ x «€ a. 


Hint: Eventually, it will be necessary to expand f(y) in a 
series that makes use of the functions sin( 7 y/ 2b), sin( 3 y / 2b), 
sin(5- y/2b), ... (see Problem 39 of Section 10.4). 

b. Find the solution if f(y) = y(2b — y). 

O c. Let a = 3 and b = 2; plot several different views of the 
solution. 


14. a. Find the solution u(x, y) of Laplace's equation in the 
rectangle 0 < x < a, 0 < y < b, that satisfies the boundary 
conditions 


ux(0, y) = 0, 
u(x,0) — 0, 


0cyc«b, 
O<x<a. 


ux(a, y) = 0, 
u(x, b) = g(x), 
b. Find the solution if g(x) = 1 + x?(x — a)?. 


O c. Let a = 3 and b = 2; plot several different views of the 
solution. 


15. Show that Laplace’s equation in polar coordinates is 
1 1 
Urr + -ur + -z400 = 0. 
r r 


Hint: Use x = r cos 0 and y =r sin@ and the chain rule. 
16. Show that Laplace’s equation in cylindrical coordinates is 


1 1 
Urr + -Uur + 3400 +uz =0. 
r i 


Hint: Use x = r cos 0, y = r sin 0, z = z, and the chain rule. 


17. Show that Laplace’s equation in spherical coordinates is 


cot @ 
—53 U00 + —5 
p? sin^ à r 
Hint: Use x = p sin ó cos 0, y = psing sinÜ, z = pcos 0, and the 
chain rule. 


2 1 
i eas gap se ug = 0. 
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18. a. Laplace's equation in cylindrical coordinates was found in 
Problem 15. Show that axially symmetric solutions (i.e., solutions 
that do not depend on 0) satisfy 


1 
Urr + -Ur + uz; = 0. 
r 


b. Assuming that u(r,z) = R(r)Z(z), show that R and Z 
satisfy the equations 


rR'LR'LXMrRA20, Z'—XMzzog. 


Note: The equation for R is Bessel's equation of order zero with 
independent variable Ar. 


19. Flow in an Aquifer. Consider the flow of water in a porous 
medium, such as sand, in an aquifer. The flow is driven by the 
hydraulic head, a measure of the potential energy of the water above 
the aquifer. Let R : 0 < x < a,0 « z « b be a vertical section 
of an aquifer. In a uniform, homogeneous medium, the hydraulic head 


u(x, z) satisfies Laplace's equation 
Uxx du; =0 inR. (39) 


If water cannot flow through the sides and bottom of R, then the 
boundary conditions there are 


ux(0,z) = 0, 
ü-(x,0) = 0, 


ux(4,z)=0, O<z<b (40) 
0 « x « a. (41) 


Finally, suppose that the boundary condition at z = b is 


u(x,b)=b+ax, O<x<a, (42) 


where a is the slope of the water table. 
a. Solve the given boundary value problem for u(x, z). 
O b. Leta = 1000, b = 500, anda = 0.1. Draw a contour plot 
of the solution in R; that is, plot some level curves of u(x, z). 
O c. Water flows along paths in R that are orthogonal to the 
level curves of u(x, z) in the direction of decreasing u. Plot some 
of the flow paths. 


Derivation of the Heat Conduction Equation In this section we derive the linear partial 
differential equation that, to a first approximation at least, governs the conduction of heat 
in solids. It is important to understand that the mathematical analysis of a physical situation 
or process such as this ultimately rests on a foundation of empirical knowledge of the 
phenomenon involved. The mathematician must have a place to start, so to speak, and this 
place is furnished by experience. 

Consider a uniform rod insulated on the lateral surfaces so that heat can flow only in 
the axial direction. It has been demonstrated many times that if two parallel cross sections of 
the same area A and different temperatures T, and 75, respectively, are separated by a small 
distance d, an amount of heat per unit time will pass from the warmer section to the cooler one. 
Moreover, this amount of heat is proportional to the area A and to the temperature difference 
IT — 7i| and is inversely proportional to the separation distance d. Thus 


Amount of heat per unit time — 


KA|T> — T; | 


1 
d , (1) 


where the positive proportionality factor & is called the thermal conductivity and depends 
primarily on the material!^ of the rod. The relation (1) is often called Fourier’s law of heat 
conduction. We repeat that equation (1) is an empirical, not a theoretical, result and that it 
can be, and often has been, verified by careful experiment. It is the basis of the mathematical 
theory of heat conduction. 

Now consider a straight rod of uniform cross section and homogeneous material, oriented 
so that the x-axis lies along the axis of the rod (see Figure 10.A.1). Let x = Oand x = L 
designate the ends of the bar. 


H = -kAu, —L ~|—a- KAU, 


X-2X9 Skot Ax 


Conduction of heat in an 
element of a rod. 


We will assume that the sides of the bar are perfectly insulated so that there is no passage 
of heat through them. We will also assume that the temperature u depends only on the axial 
position x and the time ¢, and not on the lateral coordinates y and z. In other words, we assume 
that the temperature remains constant on any cross section of the bar. This assumption is 
usually satisfactory for thin rods, that is, rods where the lateral dimensions of the rod are 
small compared to its length. 

The differential equation governing the temperature in the bar is an expression of a 
fundamental physical balance; the rate at which heat flows into any portion of the bar is equal 
to the rate at which heat is absorbed in that portion of the bar. The terms in the equation are 
called the flux (flow) term and the absorption term, respectively. 

We will first calculate the flux term. Consider an element of the bar lying between the 
cross sections x = xo and x = xg + Ax, where xo is arbitrary and Ax is small. The 
instantaneous rate of heat transfer H (xo, t) from left to right across the cross section x = xo 
is given by 


. u(Xxo + d/2, t) — u(xo — d/2, t) 
— lim KA 
d—0 d 


H (xo, t) 


= —KAu,(X9, t). (2) 


The minus sign appears in this equation because there will be a positive flow of heat from 
left to right only if the temperature is greater to the left of x = x than to the right; in this 
case u, (xo, f) is negative. In a similar manner, the rate at which heat passes from left to right 
through the nearby cross section x = xo + Ax is given by 


H(xo + Ax,t) = —&Auy(xo + Ax,t). (3) 


The net rate at which heat flows into the segment of the bar between x = x9 and x = xy» + Ax 
is thus given by 


Q = H(xo, t) — H(xo + Ax, t) = &A(us(xo-F Ax, t) — ux(x0, f)), (4) 
and the amount of heat entering this bar element in a short time interval of length At is 
QAt = kA(ux(xo + Ax, t) — ux(xo, t)) At. (5) 


Let us now calculate the absorption term. The average change in temperature Au, in 
the time interval At, is proportional to the amount of heat QA t introduced and is inversely 
proportional to the mass A m of the segment. Since the rod is assumed to be homogeneous, 


14 Actually, x also depends on the temperature, but if the temperature range is not too great, it is satisfactory to assume 
that & is independent of temperature. 
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with density p and cross-sectional area A, the mass of the segment between xo and xo + Ax 
is Am — pAA x. Thus 
1QAt QAt 

~ sAm spAAx’ 
where the constant of proportionality s is known as the specific heat of the material of the 
bar.!> The average temperature change A u in the bar element under consideration is the actual 
temperature change at some intermediate point x = xo + 0Ax, where 0 < 0 < 1. Thus 
equation (6) can be written as 


Au 


(6) 


A 
u(xo + 0A x,t + At) —u(xo + 0Ax,t) = QAt (7) 
spAAx 
or as 
QAt= (u(xo +0Ax,t+At) —u(xo -0Ax, t))spAAx. (8) 


To balance the flux and absorption terms, we equate the two expressions for QA t: 
& A (us (xo + Ax, t) — ux( xo, t))At 
= spA(u(xo + Ax, t+ At) — u(xo + Ax, t))Ax. (9) 
Upon dividing equation (9) by sp AAx At, we obtain 
K Ux (xo + Ax, t) — Ux (xo, t) u u (xo +0Ax,t+ At) — u (xo + 0Ax, t) 
Sp Ax At 
Now, letting Ax — 0 and At — 0 leads us to the heat conduction or diffusion equation 


a us = ür, (10) 
where the quantity œ? defined by 
2 K 
E 
is called the thermal diffusivity and is a parameter that depends only on the material of the 
bar. The units of o? are (length)? / time. Typical values of œ? are given in Table 10.5.1. 
Several relatively simple conditions may be imposed at the ends of the bar. For example, 
the temperature at an end may be maintained at some constant value T. This might be 
accomplished by placing the end of the bar in thermal contact with some reservoir of sufficient 
size so that any heat that flows between the bar and the reservoir does not appreciably alter 
the temperature of the reservoir. At an end where this is done, the boundary condition is 


Q 


(11) 


u — T. (12) 


Another simple boundary condition occurs if the end is insulated so that no heat passes 
through it. The amount of heat crossing the cross section of the bar at x = xp is given 
by expression (2): —K Au,(Xxo, t). At an insulated end, no heat crosses the boundary. The 
boundary condition that enforces an insulated end is 


u, =0. (13) 


A more general type of boundary condition occurs if the rate of flow of heat through an 
end of the bar is proportional to the temperature there. Let us consider the end x = 0, where 
the rate of flow of heat from left to right is given by —K Au,(0, t); see equation (2). Hence 
the rate of heat flow out of the bar (from right to left) at x = 0 is x Au,(0, t). If this quantity 
is proportional to the temperature u(0, t), then we obtain the boundary condition 


u,(0,t) —hyu(0,t) 20, +t > 0, (14) 


where h; is a nonnegative constant of proportionality. Note that ^; = 0 corresponds to an 
insulated end and that A, — oo corresponds to an end held at zero temperature. 

If heat flow is taking place at the right end of the bar (x — L), then in a similar way we 
obtain the boundary condition 


where again h, is a nonnegative constant of proportionality. 


15The dependence of the density and specific heat on temperature is relatively small and will be neglected. Thus both 
p and s will be considered as constants. 


Finally, to determine completely the flow of heat in the bar, it is necessary to state the 
temperature distribution at one fixed instant, usually taken as the initial time £ = 0. This 
initial condition is of the form 


u(x,0) = f(x), O<x<L. (16) 


The problem then is to determine the solution of the differential equation (10) subject to 
one of the boundary conditions (12) to (15) at each end and to the initial condition (16) at 
t=0. 

Several generalizations of the heat equation (10) also occur in practice. First, the bar 
material may be nonuniform, and the cross section may not be constant along the length of the 
bar. In this case, the parameters &, p, s, and A may depend on the axial variable x. Going back 
to equation (2), we see that the rate of heat transfer from left to right across the cross section 
at x = xo is now given by 


H (X90, t) = —K (xo) A(X) Ux (Xo, t) (17) 


with a similar expression for H ( xo + A x, t). If we introduce these quantities into equation (4) 
and eventually into equation (9), and proceed as before, we obtain the partial differential 
equation 


(KAU), = sp Aus. (18) 
We will usually write equation (18) in the form 
r(x)u; = (p(x) ux) x, (19) 


where p(x) = K(x) A(x) and r(x) = s(x) p(x) A(x). Note that both of these quantities are 
intrinsically positive. 

A second generalization occurs if there are other ways in which heat enters or leaves the 
bar. Suppose that there is a source that adds heat to the bar at a rate G(x, t, u) per unit time 
per unit length, where G(x, t, u) > 0. In this case we must add the term G(x, t, u) Ax At to 
the left-hand side of equation (9), and this leads to the differential equation 


r(x)u, = (p(x)ux)x + G(x, t, u). Q0) 


If G(x, t, u) < 0, then we speak of a sink that removes heat from the bar at the rate G(x, t, u) 
per unit time per unit length. To make the problem tractable, we must restrict the form of the 
function G. In particular, we assume that G is linear in u and that the coefficient of u does not 
depend on t. Thus we write 


G(x,t,u) = F(x,t) —q(x)u. (21) 


The minus sign in equation (21) has been introduced so that certain equations that appear later 
will have their customary forms. Substituting from equation (21) into equation (20), we obtain 


r(x)u, = (p(x)ux)x — q(x)u + F(x, t). (22) 


This equation is sometimes called the generalized heat conduction equation. Boundary value 
problems for equation (22) will be discussed to some extent in Chapter 11. 

Finally, if instead of a one-dimensional bar, we consider a body with more than one 
significant space dimension, then the temperature is a function of two or three space 
coordinates rather than of x alone. Considerations similar to those leading to equation (10) 
can be employed to derive the heat conduction equation in two dimensions 


2 
a (Uxx s uyy) = lt, (23) 
or in three dimensions 


o Uxx + Uyy + ul.) = Uy. (24) 


The boundary conditions corresponding to equations (12) and (13) for multidimensional 
problems correspond to a prescribed temperature distribution on the boundary, or to an 
insulated boundary. Similarly, the initial temperature distribution will, in general, be a function 
of x and y for equation (23) and a function of x, y, and z for equation (24). 
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B APPENDIX 


Derivation of the Wave Equation In this appendix we derive the wave equation in one space 
dimension as it applies to the transverse vibrations of an elastic string or cable; the elastic 
string may be thought of as a guitar string, a guy wire, or possibly an electric power line. The 
same equation, however, with the variables properly interpreted, occurs in many other wave 
problems having only one significant space variable. 

Consider a perfectly flexible elastic string stretched tightly between supports fixed at 
the same horizontal level (see Figure 10.B.1a). Let the x-axis lie along the string with the 
endpoints located at x = 0 and x = L. If the string is set in motion at some initial time 
t = O (for example, by plucking) and is thereafter left undisturbed, it will vibrate freely 
in a vertical plane, provided that damping effects, such as air resistance, are neglected. To 
determine the differential equation governing this motion, we will consider the forces acting 
on a small element of the string of length A x lying between the points x and x + Ax (see 
Figure 10.B.1b). We assume that the motion of the string is small and that, as a consequence, 
each point on the string moves solely in a vertical line. We denote by u(x,t) the vertical 
displacement of the point x at the time f. Let the tension in the string, which always acts in the 
tangential direction, be denoted by T(x, t), and let p denote the mass per unit length of the 
string. 


T (x + Ax, t) 


— 8 $0 A0 


a4 —-—-—---- 


| 
x=0 x-L l 
‘ x + Ax 
(a) (b) 
" — 
= V=T sing 
a7 H =T cos 0 


FAAEA (a) An elastic string under tension. (b) An element of the displaced string. (c) 
Resolution of the tension T into components. 


Newton's law, as it applies to the element A x of the string, states that the net external 
force, due to the tension at the ends of the element, must be equal to the product of the mass of 
the element and the acceleration of its mass center. Since there is no horizontal acceleration, 
the horizontal components must satisfy 


T(x + Ax,t) cos(0 + A0) — T(x,t) cos0 = 0. (1) 


If we denote the horizontal component of the tension (see Figure 10.B.1c) by H, then 
equation (1) states that H is independent of x. 
On the other hand, the vertical components satisfy 


T(x + Ax,t) sin(0 + A0) — T(x,t) sinü = p Axuqg(X,t), (2) 


where x is the coordinate of the center of mass of the element of the string under consideration. 
Clearly, x lies in the interval x < x < x+ Ax. The weight of the string, which acts vertically 
downward, is assumed to be negligible and has been neglected in equation (2). 


Let the vertical component of T be denoted by V; then equation (2) can be written as 


V(x + Ax,t) — V(x, ft) 
Ax 


= pug(x,t). 
Passing to the limit as Ax — 0 gives 
V(x, t) = pugu(x, t). (3) 
To express equation (3) entirely in terms of u, we note that 
V(x,t) = A(t) tand = H(t)u,(x,t). 
Hence equation (3) becomes 
(Huy), = pur, 
or, since H is independent of x, 
Huy, = pug. (4) 


For small motions of the string, the angle theta is small, and H = T cos is accurately 
approximated by T'. Then equation (4) takes its customary form 


aUx = Ur, (5) 
where 
T 
v=. (6) 
p 


We will assume further that a? is a constant, although this is not required in our derivation, even 
for small motions. Equation (5) is called the wave equation for one space dimension. Since 
T has the dimension of force, and p that of mass/length, it follows that the constant a has the 
dimension of velocity. Itis possible to identify a as the velocity with which a small disturbance 
(wave) moves along the string. According to equation (6), the wave velocity a varies directly 
with the tension in the string, but inversely with the density of the string material. These facts 
are in agreement with experience. 

As in the case of the heat conduction equation, there are various generalizations of the 
wave equation (5). One important equation is known as the telegraph equation and has the 
form 


Un A cu; +ku — a?uy, + F(x, t), (7) 


where c and k are nonnegative constants. The terms cu;, ku, and F(x, t) arise from a viscous 
damping force, an elastic restoring force, and an external force, respectively. Note the 
similarity of equation (7), except for the term a?uxx, to the equation for the spring—mass 
system derived in Section 3.7; the additional term a?uxx arises from a consideration of internal 
elastic forces. 

The telegraph equation also governs the flow of voltage, or current, in a transmission line 
(hence its name); in this case the coefficients are related to electrical parameters in the line. 

For a vibrating system with more than one significant space coordinate, it may be 
necessary to consider the wave equation in two dimensions 


2 
a (Uxx + uyy) = un, (8) 
or in three dimensions 


a (nus + Uyy + lizz) = ug. (9) 
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CHAPTER 11 


Boundary Value 
Problems and 
Sturm-Liouville Theory 


As aresult of separating variables in a partial differential equation in Chapter 10, we repeatedly 
encountered the differential equation 


X"”+AX=0, O<x<L 
with the boundary conditions 
X(0) 20, X(L)-0. 


This boundary value problem is the prototype of a large class of problems that are important in 
applied mathematics. These problems are known as Sturm-Liouville boundary value problems. 
In this chapter we discuss the fundamental properties of Sturm-Liouville problems, including 
existence and uniqueness of solutions; in the process we are able to generalize somewhat the 
method of separation of variables for partial differential equations. 


1i. The Occurrence of Two-Point 
Boundary Value Problems 


In Chapter 10 we described the method of separation of variables as a means of solving certain 
problems involving partial differential equations. The heat conduction problem consisting of 
the partial differential equation 


Q Ury = Uy, 0<x< L, t>0 (1) 
subject to the boundary conditions 
u(0,t) 20, u(L,t) 20, t>0 (2) 
and the initial condition 
u(x,0 = f(x), O<sx<L Q) 


is typical of the problems considered there. A crucial part of the process of solving such 
problems is to find the eigenvalues and eigenfunctions of the differential equation 


X"+r4X =0, O< x<L (4) 
with the boundary conditions 
X(0 20, X(L)=0 (5) 
or perhaps 
X'(0)—0, X'(L)-0. (6) 
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The sine or cosine functions that result from solving equation (4) subject to the boundary 
conditions (5) or (6), respectively, are used to expand the initial temperature distribution f(x) 
in an appropriate Fourier series. 

In this chapter we extend and generalize the results of Chapter 10. Our main goal is to 
show how the method of separation of variables can be used to solve problems somewhat 
more general than that of equations (1), (2), and (3). We are interested in three types of 
generalizations. 

First, we wish to consider more general partial differential equations—for example, the 
equation 


r(x)u; = (p(x)ux), — q (x)u + F(x, t). (7) 


This equation can arise, as indicated in Appendix A of Chapter 10, in the study of heat 
conduction in a bar of variable material properties in the presence of heat sources. If p(x) 
and r(x) are constants, and if the source terms g(x)u and F(x, t) are zero, then equation (7) 
reduces to equation (1). The partial differential equation (7) also occurs in the investigation of 
other phenomena of a diffusive character. 

A second generalization is to allow more general boundary conditions. In particular, we 
wish to consider boundary conditions of the form 


ux(0, t) — hu(0, t) = 0, ux(L, t) + hou(L, t) = 0. (8) 


Such conditions occur when the rate of heat flow through an end of the bar is proportional to 
the temperature there. Usually, ^; and h, are nonnegative constants, but in some cases they 
may be negative or depend on f£. The boundary conditions (2) are obtained in the limit as 
hı — oo and hy — co. The other important limiting case, hı = hz = 0, gives the boundary 
conditions for insulated ends. 

The final generalization that we discuss in this chapter concerns the geometry of the 
region in which the problem is posed. The results of Chapter 10 are adequate only for a rather 
restricted class of problems, chiefly those in which the region of interest is rectangular or, in 
a few cases, circular. Later in this chapter we consider certain problems posed in a few other 
geometrical regions. 

Let us consider the equation 


r(x)u, = (p(x)ux), — q(x)u (9) 


obtained by setting the term F(x, t) in equation (7) equal to zero. To separate the variables, 
we assume that 


u(x,t) = X(x)T(t) (10) 
and substitute for u in equation (9). We obtain 
r(x) XT! = (p(x) X) T — q(x) XT, (11) 
or, upon dividing by r(x) XT, 
T' (pCGOX) aw) - 


T r(x)X r(x) a 


(12) 


We have denoted the separation constant by —A in anticipation of the fact that usually it will 
turn out to be real and negative. From equation (12) we obtain the following two ordinary 
differential equations for X and T: 


(p(x) X)! = q(x) X  Ar(x) X — 0, (13) 

T' -AT =0. (14) 

If we substitute from equation (10) for u in equations (8) and assume that h; and hz are 
constants, then we obtain the boundary conditions 

X'(0) —h,X(0 =0, X'(L)-h;X(L) =0. (15) 

To proceed further we need to solve equation (13) subject to the boundary conditions 

(15). Although this is a more general linear homogeneous two-point boundary value problem 


than the problem consisting of the differential equation (4) and the boundary conditions (5) or 
(6), the solutions behave in very much the same way. For every value of A, the two-point 
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boundary value problem (13), (15) has the trivial solution X(x) = 0. For certain values 
of A, called eigenvalues, there are also other, nontrivial solutions called eigenfunctions. 
These eigenfunctions form the basis for series solutions of a variety of problems in partial 
differential equations, such as the generalized heat conduction equation (9) subject to the 
boundary conditions (8) and the initial condition (3). 

In this chapter we discuss some of the properties of solutions of two-point boundary 
value problems for second-order linear equations. Sometimes we consider the general linear 
homogeneous equation 


P(x)y" + QG) y + R(x)y 20, (16) 


investigated in Chapter 3. However, for most purposes it is better to discuss equations in which 
the first and second derivative terms are related as in equation (13). It is always possible to 
transform the general equation (16) so that the derivative terms appear as in equation (13) (see 
Problem 11). 

Boundary value problems with higher-order differential equations can also occur; in them 
the number of boundary conditions must equal the order of the differential equation. As a rule, 
the order of the differential equation is even, and half the boundary conditions are given at 
each end of the interval. It is also possible for a single boundary condition to involve values 
of the solution and/or its derivatives at both boundary points; for example, 


y(0) — y(L) = 0. (17) 


The following example involves one boundary condition of the form (15) and is therefore 
more complicated than the problems in Section 10.1. 


EXAMPLE 1 


Find the eigenvalues and the corresponding eigenfunctions of the boundary value problem 


y! Ay =0, (18) 
y(0) 20, y(1)+y(1)=0. (19) 


One place where this problem occurs is in the heat conduction problem in a bar of unit length. The 
boundary condition at x = 0 corresponds to a zero temperature there. The boundary condition at 
x = 1 corresponds to a rate of heat flow that is proportional to the temperature there, and units are 
chosen so that the constant of proportionality is 1 (see Appendix A of Chapter 10). 

Solution: 


The solution of the differential equation may have one of several forms, depending on A, so it is 
necessary to consider several cases. 


Casel: First, if A = 0, the general solution of the differential equation (18) is 
y — c1x +c. (20) 
The two boundary conditions (19) require that 
€? —0, 2c) +c. =0, (21) 


respectively. The only solution of equations (21) is c; = c» = 0, so the boundary value problem has 
no nontrivial solution in this case. Hence \ = 0 is not an eigenvalue. 


Case ll: If A > O, then the general solution of the differential equation (18) is 
y —c,sin (vax) + c2 cos (vax), (22) 


where / > 0. The boundary condition at x = 0 requires that c; = 0; from the boundary condition 
at x = 1 we then obtain the equation 


Cy (sin A + V/A cos VX) = 0. 


For a nontrivial solution y we must have c, Z 0, and thus A must satisfy 


sin VA + Vi cos VA = 0. (23) 
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Note that if À is such that cos VA = 0, then sin VÀ + 0, and equation (23) is not satisfied. Hence 
we may assume that cos VÀ # 0; dividing equation (23) by cos VÀ, we obtain 


VA = —tan A. (24) 


The solutions of equation (24) cannot be determined exactly; they must be approximated numerically. 
They can also be estimated by sketching the graphs of f( VA) = VX and g( VA) = —tan VÀ 
for VÀ > 0 on the same set of axes and identifying the points of intersection of the two curves 
(see Figure 11.1.1). The point VX = Ois specifically excluded from this argument because the 
solution (22) is valid only for VÀ Æ 0. Despite the fact that the curves intersect there, A = 0 
is not an eigenvalue, as we have already shown. The first four positive solutions of equation (24) 
are ,/À, X 2.02876, \/Az = 4.91318, \/A3 = 7.97867, and /A4 = 11.08554. Figure 11.1.1 
shows that the roots can be approximated with reasonable accuracy by /\, = (2n — 1)7/2 for 


n = 5,6, ... , the precision of this estimate improving as n increases. Hence the eigenvalues are 
A; 2 4.11586, Az = 24.13934, Az = 63.65911, Aq = 122.88916, 
2n —] 2 2 (25) 
An S ear, forn =S; Gede 


Finally, since c? = 0, the eigenfunction corresponding to the eigenvalue A, is 
fati An) = ke sin(/Anx)s no 12... (26) 


where the constant k, remains arbitrary. 


| FIGURE 11.1.1 | Graphical solution of JX = — tan VÀ. 


Case Ill: Next consider A < 0. In this case let A = —j so that j > 0. Then equation (14) 
becomes 


y" — py =0, (27) 
and its general solution is 


y =c, sinh( YH x) + c2 cosh( VH x), (28) 


where „/p > 0. Proceeding as in the previous case, we find that jj must satisfy the equation 


Jp = —tanh p. (29) 


From Figure 11.1.2 it is clear that the graphs of f (JE) = ,/p and e( Ju) = — tanh Vp intersect 
only at the origin. Hence there are no positive values of ,/j that satisfy equation (29), and hence the 


boundary value problem (18), (19) has no negative eigenvalues. 
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Finally, it is necessary to consider the possibility that 4 may be complex. It is possible to 
show by direct calculation that the problem (18), (19) has no complex eigenvalues. However, in 
Section 11.2 we consider in more detail a large class of problems that includes this example. One of 
the things we show there is that every problem in this class has only real eigenvalues. Therefore, we 
omit the discussion of the nonexistence of complex eigenvalues here. Thus all the eigenvalues and 
eigenfunctions of the problem (18), (19) are given by equations (25) and (26). 


| FIGURE 11.1.2 | Graphical representation of y/u = — tanh Ju. 


Problems 


In each of Problems 1 through 6, state whether the given boundary 
value problem is homogeneous or nonhomogeneous. 
1. y’+4y=0, y(-1) =0, y(1) 20 
2. ((1+x7)y’)’+4y =0, y(0) =0, y(1) =1 
3. y”’+4y=sinx, y(0)20, y(1) =0 
4. -y"+x?y= dy, y(0) —y(0) =0, y'(1) + (1) = 0 
& (1 +x°y) =ry+1, y(-0 =0, 70) =0 
6. —y”=(1+x7)y,  y(0) =0, y(1)-3y(1) =0 
In each of Problems 7 through 10, assume that all eigenvalues are real. 
a. Determine the form of the eigenfunctions and the 
determinantal equation satisfied by the nonzero eigenvalues. 
b. Determine whether \ = 0 is an eigenvalue. 
c. Find approximate values for A, and A5, the nonzero 
eigenvalues of smallest absolute value. 
d. Estimate A, for large values of n. 


7. y"+Ay=0, y(0) =0, y(m) + y'(1) =0 
8. y’+Ay=0, y(0) =0, y(1) +y/(1) 20 
9. y"+Ay=0, y(0) —y(0) = 0, yx) +y) =0 
10. y'—Ay 20, y(0) + y(0) =0, y() =0 
11. Consider the general linear homogeneous second-order equation 


P(x)y" + QG)y' + R(x)y = 0. (30) 


We seek an integrating factor u(x) such that, upon multiplying 
equation (30) by u(x), we can write the resulting equation in the form 


Gu GO P(x) y) + ux) RGO y = 0. (31) 


a. By equating coefficients of y' in equations (30) and (31), 
show that jz must be a solution of 


Pu' =(O- P)p. (32) 


b. Solve equation (32) and thereby show that 


* Q(s) 
o( E P) is) (33) 


Compare this result with that of Problem 31 in Section 3.2. 


1 
u(x) = Py” 


In each of Problems 12 through 15, use the method of Problem 11 to 
transform the given equation into the form ( p(x) y’)’ + q(x)y = 0. 


12. y”—2xy’+Ay =0 
13. x?y" +xy' + (x3? — 72)y =0 
14. xy”+(1—x)y’+Ay =0 

15. (1—x2)y" — xy' c a?y =0 


(Hermite equation) 
(Bessel equation) 
(Laguerre equation) 
(Chebyshev equation) 
16. Theequation 


Un + cu; +ku = a2uy, + F(x, t), (34) 


where à? > 0,c > 0, and k > O are constants, is known as 


the telegraph equation. It arises in the study of an elastic string 
under tension (see Appendix B of Chapter 10). Equation (34) also 
occurs in other applications. Consider equation (34) with F(x, t) = 0. 
Let u(x,t) = X(x)T(t), separate variables, and derive ordinary 
differential equations for X and T. 


17. Consider the boundary value problem 


y'—2y' -(1--A)y 20, y(0 20, y(1)=0. 


a. Introduce a new dependent variable u by the relation 
y — s(x)u. Find the equivalent boundary value problem in terms 
of u. Determine s(x) so that the differential equation for u has 
no u’ term. 

b. Solve the boundary value problem for u and thereby 
determine the eigenvalues and eigenfunctions of the original 
problem. Assume that all eigenvalues are real. 

c. Solve the given problem directly (without introducing u). 
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18. Consider the boundary value problem 


y"+4y' -(449))y 20, x0) =0, y(L) —0. 


a. Determine, at least approximately, the real eigenvalues and 
the corresponding eigenfunctions by proceeding as in Problem 
17a and b. 

b. Also solve the given problem directly (without introducing 
a new variable). Hint: In part (a), be sure to pay attention to the 
boundary conditions as well as the differential equation. 


The differential equations in Problems 19 and 20 differ from those 
in previous problems in that the parameter A multiplies the y’ term 
as well as the y term. In each of these problems, determine the real 
eigenvalues and the corresponding eigenfunctions. 
19. y” +y +A + y) 20 y(0-20, y(1) =0 
20. x?" — (xy - y) 20, yO) =0, y2-y(Q2) 20 
21. Consider the problem 

y" + Ay = 0, 


2y(0) +y(0) 20, y(1) =0. 


a. Find the determinantal equation satisfied by the positive 
eigenvalues. 

b. Show that there is an infinite sequence of such eigenvalues. 
c. Find A, and A5. Then show that A, = ((2n + 2/2)? for 
large n. 

d. Find the determinantal equation satisfied by the negative 
eigenvalues. 

e. Show that there is exactly one negative eigenvalue and find 
its value. 


22. Consider the problem 


y"+Ay=0, ay(0)+y'(0) —0, y(1)—0 

where a is a given constant. 
a. Show that for all values of a there is an infinite sequence of 
positive eigenvalues. 
b. If o < 1, show that all (real) eigenvalues are positive. Show 
that the smallest eigenvalue approaches zero as o approaches 1 
from below. 
c. Show that A = 0 is an eigenvalue only if a = 1. 
d. Ifa > 1, show that there is exactly one negative eigenvalue 
and that this eigenvalue decreases as a increases. 

23. Consider the problem 

y(L) — 0. 


y”+Ay=0, y(0) =0, 


Show that if ọm and ¢, are eigenfunctions corresponding to the 
eigenvalues Am and Àn, respectively, with Am Æ An, then 


L 
f Pm(X) bn(x)dx =0. 
0 


Hint: Note that 
" Anm = 9, or + Andn — O. 


Multiply the first of these equations by $,, the second by ¢,,, and 
integrate from 0 to L, using integration by parts. Finally, subtract one 
equation from the other. 


24. Inthis problem we consider a higher-order eigenvalue problem. 
The analysis of transverse vibrations of a uniform elastic bar is based 
on the differential equation 


y? — Ay 20, 


where y is the transverse displacement and à = mw?/EI; m is the 
mass per unit length of the rod, E is Young's modulus, / is 
the moment of inertia of the cross section about an axis through 
the centroid perpendicular to the plane of vibration, and w is the 
frequency of vibration. Thus, for a bar whose material and geometrical 
properties are given, the eigenvalues determine the natural frequencies 
of vibration. Boundary conditions at each end are usually one of the 
following types: 


y=y’=0, clamped end, 
y =y” =0, simply supported or hinged end, 
y" =y" =0, free end. 


For each of the following three cases, find the form of the 
eigenfunctions and the equation satisfied by the eigenvalues of this 
fourth-order boundary value problem. Determine À; and A5, the two 
eigenvalues of smallest magnitude. Assume that the eigenvalues are 
real and positive. 

a. y(0) = y"(0 20, y(L) = y"(L) =0 

b. y(0) =y"(0) =0, y(L)=y(L) =0 

€. y(0) = y(0) = 0, y"(L) = y"(L) = 0 (cantilevered 

bar) 
25. This problem illustrates that the eigenvalue parameter 
sometimes appears in the boundary conditions as well as in the 
differential equation. Consider the longitudinal vibrations of a uniform 
straight elastic bar of length L and cross-sectional area A. It can 
be shown that the axial displacement u(x,t) satisfies the partial 
differential equation 


E 
—Hu ug; O< x<_L, ftl 0, (35) 
where E is Young's modulus and p is the mass per unit volume. If the 


end x = 0 is fixed, then the boundary condition there is 


ulot) =0, r0. (36) 


Suppose that the end x = L is rigidly attached to a mass m but is 
otherwise unrestrained. We can obtain the boundary condition here by 
writing Newton’s law for the mass. From the theory of elasticity, it 
can be shown that the force exerted by the bar on the mass is given 
by — E Au,( L, t). Hence the boundary condition is 


EAus(L,t) 4 mug(L,t) =0, t» 0. (37) 


a. Assume that u(x,t) = X(x) T(t), and show that X(x) and 
T(t) satisfy the differential equations 


X"+rXK = 0, (38) 


E 
T" +A—T =0. (39) 
p 


b. Show that the boundary conditions are 


X(0 20, X'(L)—w«ALX(L) —0, (40) 


where y = is a dimensionless parameter that gives the 


ratio of the end mass to the mass of the bar. 

Hint: Use the differential equation for T(t) in simplifying the 
boundary condition at x = L. 

c. Determine the form of the eigenfunctions and the equation 
satisfied by the real eigenvalues of equations (38) and (40). 

d. Find the first two eigenvalues A, and \ if y = 0.5. 
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12. Sturm-Liouville Boundary Value 
Problems 


We now consider two-point boundary value problems of the type obtained in Section 11.1 by 
separating the variables in a heat conduction problem for a bar of variable material properties 
and with a source term proportional to the temperature. This kind of problem also occurs in 
many other applications. 

These boundary value problems are commonly referred to as Sturm-Liouville! boundary 
value problems. They consist of a differential equation of the form 


(p(x)y')’ — qG)y + Ar(x) y 20 (1) 
on the interval 0 < x < 1, together with the boundary conditions 
o1y(0) +azy'(0) =0, B1y(1) + 82y'(1) 20 Q) 


at the endpoints. It is often convenient to introduce the linear homogeneous differential 
operator L defined by 


L[y] = -(pGO y) + qG2y. (3) 
Then the differential equation (1) can be written as 
Liy] = Ar(x) y. (4) 


We assume that the functions p, p', q, and r are continuous on the interval 0 < x < 1 
and, further, that p(x) > Oandr(x) > Oatall points inO < x < 1. In this case the boundary 
value problem is said to be regular. These assumptions are necessary to render the theory as 
simple as possible while retaining considerable generality. It turns out that these conditions 
are satisfied in many significant problems in mathematical physics. For example, the equation 
y"-FAy = 0, which arose repeatedly in the preceding chapter, is of the form (1) with p(x) = 1, 
q(x) =0, andr(x) = 1. 

The boundary conditions (2) are said to be separated; that is, each involves only one of the 
boundary points. These are the most general separated boundary conditions that are possible 
for a second-order differential equation. Note that for the first boundary condition to impose 
any restriction on y, it is necessary for at least one of the numbers a and o? to be nonzero. 
Similarly, at least one of the numbers (3; and 8; must be nonzero. 

Before proceeding to establish some of the properties of the Sturm-Liouville problem (1), 
(2), it is necessary to derive an identity, known as Lagrange’s identity, that is basic to the 
study of linear boundary value problems. Let u and v be functions having continuous second 
derivatives on the interval 0 < x < 1. Then? 


1 1 
n L|u]vdx = | (—(pu')'v + quv)dx. 
0 0 
Integrating the first term on the right-hand side twice by parts, we obtain 


n Ludt = -pGowG)voo |  pooucov'Go |, + f (—u( pv')' + uqv)dx 
0 0 


i 1 
-plau G)v() -uovo f uL[v]dz. 


'Charles-Francois Sturm (1803-1855) and Joseph Liouville (1809-1882), in a series of papers in 1836 and 1837, set 
forth many properties of the class of boundary value problems associated with their names, including the results stated 
in Theorems 11.2.1 to 11.2.4. Sturm was born in Geneva, Switzerland, but spent almost his entire adult life in Paris. 
He is also famous for a theorem on the number of real zeros of a polynomial and, in addition, did extensive work 
in physics and mechanics. Liouville was a French mathematician who did notable research in analysis, algebra, and 
number theory. One of his most important results was the proof (in 1844) of the existence of transcendental numbers. 
He was also the founder, and for 39 years the editor, of the influential Journal de mathématiques pures et appliquées. 


; ! z 1 1 A ! 
?For brevity, we sometimes use the notation Í 0 f dx rather than i " f(x)dx in this chapter. 
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Hence, upon transposing the integral on the right-hand side, we have 
1 1 
n (L(u)v —uL(v))dx = — p(x) (ula) v(x) — uG)v'Q))|- (5) 
0 


Equation (5) is known as Lagrange’s identity. 

Now let us suppose that the functions u and v in equation (5) also satisfy the boundary 
conditions (2). Then, if we assume that a. + 0 and 85 + 0, the right-hand side of equation (5) 
becomes 


1 
— p(x) (u(x)v(x) — u(x) v'(x)) l 


= —p(D)'(Dv(1) — u(1)v(1)) + p(0)(u(0)v(0) — u(0)v'(0)) 

_ By By Q1 Q1 

= —p(1) | —zu(1)v(1) + z-u(1)v(1) } + p(0)| ——u(0)v(0) + —u(0)v(0) 
Bo Bo az a 

= 0. 


The same result holds if either o? or 95 is zero; the proof in this case is even simpler 
and is left for you. Thus, if the differential operator L is defined by equation (3), and if the 
functions u and v satisfy the boundary conditions (2), then Lagrange's identity reduces to 


1 
f (L[u]v — uL[v] dx = 0. (6) 
0 


Let us now write equation (6) in a slightly different way. In equation (4) of Section 10.2 
we introduced the inner product (u, v) of two real-valued functions u and v on a given interval; 
using the interval 0 € x < 1, we have 


1 
(u, v) = u(x)v(x)dx. (7) 
0 


In this notation, equation (6) becomes 
(L[u], v) — (u, L[v]) = 0. (8) 


In proving Theorem 11.2.1 below, it is necessary to deal with complex-valued functions. 
By analogy with the definition in Section 7.2 for vectors, we define the inner product of two 
complex-valued functions on 0 € x < 1 as 


1 
(u, v) Es u(x)v(x)dx, (9) 
0 


where v is the complex conjugate of v. Clearly, equation (9) coincides with equation (7) if 
u(x) and v(x) are real. It is important to know that equation (8) remains valid under the 
stated conditions if u and v are complex-valued functions and if the inner product is defined 


as in equation (9). To see this, you can start with the quantity L[u]v dx and retrace the 


steps leading to equation (6), making use of the fact that p(x), g(x), o1, o», 61, and 83 are 
all real-valued quantities (see Problem 22). 

We now consider some of the implications of equation (8) for the Sturm-Liouville 
boundary value problem (1), (2). We assume without proof? that this problem actually has 
eigenvalues and eigenfunctions. In Theorems 11.2.1 to 11.2.4 below, we state several of their 
important, but relatively elementary, properties. Each of these properties is illustrated by the 
basic Sturm-Liouville problem 


y’+rAy=0, y(0)=0, y(1) 2-0 (10) 


whose eigenvalues are A, = n?n’, with corresponding eigenfunctions $,(x) = sin(nm x). 


Theorem 11.2.1 


All the eigenvalues of the Sturm-Liouville problem (1), (2) are real-valued. 


3For a proof of this statement, see the books by Sagan (Chapter 5) or by Birkhoff and Rota (Chapter 10) listed in the 
References at the end of this chapter. 
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To prove this theorem, let us suppose that A is a (possibly complex) eigenvalue of the 
problem (1), (2) and that @ is a corresponding eigenfunction, also possibly complex-valued. 
Let us write A = u + iv and d(x) = U(x) +iV(x), where u, v, U(x), and V(x) are 
real-valued. Then, if we let u = $ and also v = à in equation (8), we have 


(L[6], 6) = Có. LI). (11) 
However, we know that L[ó] = Ard, so equation (11) becomes 
(Arg, $) — (6, Ard). (12) 


Writing out equation (12) in full, using the definition (9) of the inner product; we obtain 
f E m f PITTS a3) 
Since r(x) is real, r(x) = r(x), and equation (13) reduces to 
(=D [ r(x)b(x) O(a) dx =0. 
However, ¢(x)¢(x) = U?(x) + V?(x), so we have 
A-J) [ r(x) (Pe + V(x) dx =0. (14) 


The integrand in equation (14) is nonnegative and not identically zero. Since the integrand is 
also continuous, it follows that the integral is positive. Therefore, the factor A — A = 2iv must 
be zero. Hence v = 0 and A is real-valued, so the theorem is proved. 

An important consequence of Theorem 11.2.1 is that in finding eigenvalues and 
eigenfunctions of a Sturm-Liouville boundary value problem, we need look only for real 
eigenvalues. Recall that this is what we did in Chapter 10. It is also possible to show that the 
eigenfunctions of the boundary value problem (1), (2) are real-valued. A proof is sketched in 
Problem 23. 


Theorem 11.2.2 | Orthogonality of Sturm-Liouville 


Eigenfunctions 


Eigenfunctions of the Sturm-Liouville boundary value problem (1), (2) from different eigenvalues 
are orthogonal with respect to the weight function r. That is, if ¢,, and 9, are two eigenfunctions of 
the Sturm-Liouville problem (1), (2) corresponding to eigenvalues A,, and ,,, respectively, and if 
Am Æ Xn, then 


ji 
I r(x) @m(X) @n(x)dx = 0. (15) 
0 


To prove Theorem 11.2.2, we note that m and à, satisfy the differential equations 


Llom] = AnrÓm (16) 
and 

Ll onl = Anrons (17) 
respectively. If we let u = @m, let v = dy, and substitute for L[u] and L[v] in equation (8), 
we obtain 


(Agr m, Q5) um (Om; AnT Pn) = 0, 


or, using equation (9), 
1 1 
rw f r(x) bm(x)b,(x)dx -%, f bm(x)F(x) b,(x)dx = 0. 


Because An, r(x), and ¢,(x) are real, this equation becomes 


jl 
(Am — An) n r(X)Om(X) On(x)dx = 0. (18) 
0 
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Since by hypothesis Am Æ An, it follows that m and ¢, must satisfy equation (15), and the 
theorem is proved. 


Theorem 11.2.3 


The eigenvalues of the Sturm-Liouville problem (1), (2) are all simple; that is, to each eigenvalue there 
corresponds only one linearly independent eigenfunction. Further, the eigenvalues form an infinite 
sequence and can be ordered according to increasing magnitude so that 


Ay < Ad < AZ x AQX cn. 


Moreover, À, — oo as n — oc. 


The proof of this theorem is somewhat more advanced than those of the two previous 
theorems and will be omitted. However, a proof that the eigenvalues are simple is outlined in 
Problem 20. 

Again, we note that all the properties stated in Theorems 11.2.1 to 11.2.3 are exemplified 
by the eigenvalues A,, = n?7? and eigenfunctions ó,(x) = sin(n x) of the example problem 
(10). Clearly, the eigenvalues are real-valued. The eigenfunctions satisfy the orthogonality 
relation 


1 1 
I r(x)ós(x)ós(x)dx = » sin(mmx) sin(nrx)dx =0, m zn, (19) 
0 0 


which was established in Section 10.2 by direct integration. Further, the eigenvalues can be 
ordered so that Àj; < A» < --- , and A, — œ as n — oo. Finally, to each eigenvalue there 
corresponds a single linearly independent eigenfunction. 

We will now assume that the eigenvalues of the Sturm-Liouville problem (1), (2) are 
ordered as indicated in Theorem 11.2.3. Associated with the eigenvalue A, is a corresponding 
eigenfunction $,, determined up to a multiplicative constant. It is often convenient to choose 
the arbitrary constant multiplying each eigenfunction so as to satisfy the condition 


1 
| Axe (jaral,y nal? 2... (20) 
0 

Equation (20) is called a normalization condition, and eigenfunctions satisfying this 
condition are said to be normalized (with respect to the weight function r). Indeed, in this case, 
the eigenfunctions are said to form an orthonormal set (with respect to the weight function 
r) since they already satisfy the orthogonality relation (15). It is sometimes useful to combine 
equations (15) and (20) into a single equation. To this end we introduce the symbol 6,,,, which 
is known as the Kronecker^ delta and is defined by 


0, ifm zn, 
bmn = š 21 
s ifm=n. QD 
Making use of the Kronecker delta, we can write equations (15) and (20) as 
1 
f 1(X) Om(X) On(x) dx = 6 mn- (22) 
0 


EXAMPLE 1 


Determine the normalized eigenfunctions of the problem (10): 


y”+Ay=0, (0) =0, y(1 =0. 


4The German mathematician Leopold Kronecker (1823-1891), a student of Dirichlet, was associated with the 
University of Berlin for most of his life, although (since he was independently wealthy) he held a faculty position 
only from 1883 onward. He worked in number theory, elliptic functions, algebra, and their interconnections. 


11.2 Sturm-Liouville Boundary Value Problems 


Solution: 
The eigenvalues of this problem are A; = 1, A2 = 4n7,...,An = nT’, ... , and the 
corresponding eigenfunctions are kı sin(7 x), ko sin(2z x), ... , k, sin(nz x), ... , respectively. In 


this case the weight function is r(x) = 1. To satisfy equation (20), we must choose k, so that 
1 
n (ky sin(nzx))?dx 21 (23) 
0 
for each value of n. Since 


sf aa ad i. ly 
ki sin (nz x)dx = k; z-—zcos(2nazx) |dx = =k}, 
0 o \2 2 2 


Equation (23) is satisfied if k, is chosen to be y2 for each value of n. Hence the normalized 
eigenfunctions of the given boundary value problem are 


on(x) = y2sin(nzx), n=1,2,3,.... (24) 


EXAMPLE 2 


Determine the normalized eigenfunctions of the problem 
y"+Ay=0, y(0)=0, y(D) +y) =0. (25) 


Solution: 


In Example 1 of Section 11.1, we found that the eigenvalue A, satisfies the equation 


sin NA + VAn cos V As =0 (26) 


and that the corresponding eigenfunction is 
n(x) = ka sin( VAn x), Q7) 


where kn is arbitrary. We can determine kn from the normalization condition (20). Since r(x) = 1 
in this problem, we have 


i 1 
7 r(x) ó2(x)dx xd sin ( VAn x)ax 
0 0 


17I c x  sin2 VAn x) 
-g Pele J epp. eve) 
=g f (; 5 cos(2 a) Jas «(i FR ) 


E 1 sin(2 VAn) 2/1 sin / As cos VAn 
OE 2. e J "N2 NEM 


1 


0 


where in the last step we have used equation (26). Hence, to normalize the eigenfunctions ¢,, we 
must choose 


2 1/2 
ky = ——3 7 . (28) 
1 + cos VAn 
The normalized eigenfunctions of the given problem are 
2sin/A, x 
n(x) = Vasin A, 02:152. does (29) 


1/2’ 
(1 + cos? Vn) 


We now turn to the question of expressing a given function f as a series of eigenfunctions 
of the Sturm-Liouville problem (1), (2). We have already seen examples of such expansions 
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in Sections 10.2 to 10.4. For example, it was shown in Section 10.4 that if a function f is 
continuous and has a piecewise continuous derivative on0 < x < 1, and satisfies the boundary 
conditions f(0) = f(1) = 0, then f can be expanded in a Fourier sine series of the form 


oo 
f(x) = M b, sin(nnx). (30) 
n=1 
The functions sin(nz x), n = 1,2, ... , are precisely the eigenfunctions of the boundary 
value problem (10). The coefficients b, are given by 
1 
b, — 2 | f(x) sin(ntx)dx (31) 
0 


and the series (30) converges for each x in0 < x < 1. In a similar way, f can be expanded in 
a Fourier cosine series using the eigenfunctions cos(n7 x), n = 0,1,2, ... , of the boundary 
value problem y" + Ay = 0, y'(0) 20, y(1) =0. 

Now suppose that a given function f, satisfying suitable conditions, can be expanded in 
an infinite series of eigenfunctions of the more general Sturm-Liouville problem (1), (2). If 
this can be done, then we have 


oo 
f(x) = Do enbal x), (32) 
n=1 
where the functions ¢,(x) satisfy equations (1), (2) and also the orthonormality condition 
(22). To compute the coefficients in the series (32), we multiply equation (32) by r(x) m(x), 
where m is a fixed positive integer, and integrate from x = 0 to x = 1. Assuming that the 
series can be integrated term by term, we obtain 


1 oo 1 oo 
n (x) FOod ads ec cn f r(x)dm(x)on(x)dx =X cnm 83) 


n=1 n=1 


By the definition of mn, the last sum in equation (33) reduces to the single term cm. Hence, 
using the definition of the inner product, we have 


1 
Cm = r(x) f(x) bm(x)dx = (frm) m=1,2,.... GA 


The coefficients in the series (32) have thus been formally determined. Equation (34) has the 
same structure? as the Euler— Fourier formulas for the coefficients in a Fourier series, and the 
eigenfunction series (32) also has convergence properties similar to those of Fourier series. 

The following theorem is analogous to the Fourier Convergence Theorem (Theorem 
10.3.1). 


Theorem 11.2.4 


Let 1, 95, ... , Ọn, ... be the normalized eigenfunctions of the Sturm-Liouville problem (1), (2): 
(pG) y)! — q(x)y + Ar(x)y = 0, 
o1X(0) c a2y'(0) =0, 81y(1) + Boy’) — 0. 
Let f and f' be piecewise continuous on 0 < x < 1. Then the series (32) whose coefficients 
Cm are given by equation (34) converges to K f(x*) + f(x7)) at each point in the open interval 


VEE 

Moreover, the series (32) converges at each point in the closed interval 0 < x < 1, provided f is 
continuous (and f’ is piecewise continuous) on 0 < x < 1 and f satisfies the following boundary 
conditions: 


(i) If $,(0) = 0 for all n, that is, a2 = 0 in the first of boundary conditions (2), then assume that 


f(0) = 0, and 
(ii) If $,(1) = 0 for all n, that is, 9; = O in the second of boundary conditions (2), then assume 
that f(1) — 0. 


5The alert reader will have noticed that equation (34) does not reduce to equation (31) when ¢,, = sin( mr x). The 
extra factor of 2 appears because these eigenfunctions are orthogonal but not orthonormal. 
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EXAMPLE 3 


Expand the function 
f@)=x, 0zxzl (35) 


in terms of the normalized eigenfunctions ¢,(x) of the problem (25). 


Solution: 


In Example 2, where r(x) = 1, we found the normalized eigenfunctions to be 
n(x) = k, sin( V Àn X), (36) 


where k, is given by equation (28) and A, satisfies equation (26). To find the expansion for f in 


terms of the eigenfunctions $,, we write 
oo 


f= > edata), (37) 


n-l 


where the coefficients are given by equation (34). Thus 


1 1 
o | OSOD = hy f xsin( v) ax 
0 0 


Integrating by parts, we obtain 


-—— sin/À, | cos / A, -k 2sin /An 
n n Nn IAS n An EJ 


where we have used equation (26) in the last step. Upon substituting for k, from equation (28), we 
obtain 


24/2sin /An 
1/2° 
An (1 + cos? JX) 


(38) 


Cg — 


Thus 


f(x) 2437 PAGAR. aiibi (39) 


n=l An (1 + cos? X.) 
In Example 1 of Section 10.4, we found the Fourier sine series for f(x) — x (with L — 1) to be 


-pr 


2 x- 
f(x) = = 3 ———— sin(nma). 


While equation (39) is a series of sine functions, it is not a Fourier sine series. This means its 


convergence cannot be ascertained from the Fourier Convergence Theorem (Theorem 10.3.1). But 
from Theorem 11.2.4, we conclude that series (39) converges pointwise to f(x) = x for0 <x < 1. 


Self-Adjoint Problems. Sturm-Liouville boundary value problems are of great importance 
in their own right, but they can also be viewed as belonging to a much more extensive class 
of problems that have many of the same properties. For example, there are many similarities 
between Sturm-Liouville problems and the algebraic system 


AX — Ax, (40) 


where the n x n matrix À is real symmetric or Hermitian. Comparing the results mentioned 
in Section 7.3 with those of this section, we note that in both cases the eigenvalues are real, 
and the eigenfunctions or eigenvectors form an orthogonal set. Further, the eigenfunctions 
or eigenvectors can be used as the basis for expressing an essentially arbitrary function or 
vector, respectively, as a sum. The most important difference is that a matrix has only a finite 
number of eigenvalues and eigenvectors, while a Sturm-Liouville problem has infinitely many. 
It is interesting and of fundamental importance in mathematics that these seemingly different 
problems —the matrix problem (40) and the Sturm-Liouville problem (1), (2) — which arise in 
different ways, are actually parts of a single underlying theory. This theory is usually referred 
to as linear operator theory and is part of the subject of functional analysis. 

We now point out some ways in which Sturm-Liouville problems can be generalized, 
while still preserving the main results of Theorems 11.2.1 to 11.2.4—the existence of a 
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sequence of real eigenvalues tending to infinity, the orthogonality of the eigenfunctions, 
and the possibility of expressing an arbitrary function as a series of eigenfunctions. These 
generalizations depend on the continued validity of the crucial relation (8). 

Let us consider the boundary value problem consisting of the differential equation 


Liy] =Ar(x)y, O<x <1, (41) 


where 


d"y dy 
Liy] = P,(x) —— + +++ + Pi(x) + Po(x)y, (42) 
dx" dx 


and n linear homogeneous boundary conditions at the endpoints. If equation (8) is valid for 
every pair of sufficiently differentiable functions that satisfy the boundary conditions, then the 
given problem is said to be self-adjoint. It is important to observe that equation (8) involves 
restrictions on both the differential equation and the boundary conditions. The differential 
operator L must be such that the same operator appears in both terms of equation (8). This 
requires L to be of even order. Further, a second-order operator must have the form (3), a 
fourth-order operator must have the form 


Lily] = (p(x)y")" — (q(x) y) + s(x) y, (43) 


and higher-order operators must have an analogous structure. In addition, the boundary 
conditions must be such as to eliminate the boundary terms that arise during the integration 
by parts used in deriving equation (8). For example, in a second-order problem, this is true for 
the separated boundary conditions (2) and also in certain other cases, one of which is given in 
Example 4 below. 

Let us suppose that we have a self-adjoint boundary value problem for equation (41), 
where L[y] is given now by equation (43). We assume that p, g, 7, and s are continuous on 
0 < x < 1 and that the derivatives of p and q indicated in equation (43) are also continuous. 
If in addition p(x) > O and r(x) > O for 0 < x < 1, then there is an infinite sequence of 
real eigenvalues tending to +-oo, the eigenfunctions are orthogonal with respect to the weight 
function r, and an arbitrary function can be expressed as a series of eigenfunctions. However, 
the eigenvalues may not be simple in these more general problems. 

We turn now to the relation between Sturm-Liouville problems and Fourier series. 
We have noted previously that Fourier sine and cosine series can be obtained by using 
the eigenfunctions of certain Sturm-Liouville problems involving the differential equation 
y’+Ay = 0 with boundary conditions y(0) = y(1) = Oand y'(0) = y'(1) = 0, respectively. 
This raises the question of whether we can obtain a full Fourier series, including both sine and 
cosine terms, by choosing a suitable set of boundary conditions. The answer is provided by the 
following example, which also serves to illustrate the occurrence of nonseparated boundary 
conditions. 


EXAMPLE 4 


Find the eigenvalues and eigenfunctions of the boundary value problem 


y' +Ay =0, (44) 
y(-D—»(1) 20, y(—1)-—y(1) = 0. (45) 


Solution: 


This is not a Sturm-Liouville problem because the boundary conditions are not separated. The 
boundary conditions (45) are called periodic boundary conditions since they require that y and 
y’ assume the same values at x = 1 as at x = —1. Nevertheless, it is straightforward to show that 
the problem (44), (45) is self-adjoint. A simple calculation establishes that Ao = 0 is an eigenvalue 
and that the corresponding eigenfunction is @9(x) = 1. Further, there are additional eigenvalues 
A, = 1, Ar = (27)*, ...,An = (n7)?, ... . To each of these nonzero eigenvalues there 
correspond two linearly independent eigenfunctions; for example, corresponding to A, are the two 
eigenfunctions @,(x) = cos(mmx) and v,(x) = sin(nz x). This illustrates that the eigenvalues 
may not be simple when the boundary conditions are not separated. Further, if we seek to expand a 
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given function f of period 2 in a series of eigenfunctions of the problem (44), (45), we obtain the 


series 


= E + Y (aq cos(nax) + by sinn), 


n=1 


which is just the Fourier series for f on the interval —1 < x < 1. 


We will not give further consideration to problems that do not have separated boundary 
conditions, nor will we deal with problems of higher than second order, except in a few 
problems. There is, however, one other kind of generalization that we do wish to discuss. 
That is the case in which the coefficients p, g, and r in equation (1) do not quite satisfy the 
rather strict continuity and positivity requirements laid down at the beginning of this section. 
Such problems are called singular Sturm-Liouville boundary value problems and are the 
subject of Section 11.4. 


Problems 


In each of Problems 1 through 5, determine the normalized 
eigenfunctions of the given problem. 


l. y”’+Ay=0, y(020, y(1)=0 
2. y"+dAy=0, y(0)=0, y(1) =0 
3. y’+Ay=0, y(0)=0, y(120 
4. y"+rAy=0, y(0 =0, y«(1-4»y»(D-0; 


see Section 11.1, Problem 8. 


5. y"—2y' F (1 A)y «0, 
see Section 11.1, Problem 17. 


x0) =0, y(1) =0; 


In each of Problems 6 through 9, find the coefficients in the 
oo 


eigenfunction expansion X a, Q4 (x) of the given function, using the 
n-l 
normalized eigenfunctions of Problem 1. 


6. f(x) 21, O<xK<1 
7. f(x) 2x, O<x<1 
1 
1, Sees S 
8. f(x) = 1 
0, —~<x<l 
2 
1 
2x, D SUR G 
9. f(x) = i 
1, ~<x<l 
2 


In each of Problems 10 through 13, find the coefficients in the 
oo 


eigenfunction expansion Y` an¢n(x) of the given function, using the 
n=1 
normalized eigenfunctions of Problem 4. 


10. f(x) =1, Oxxxl 
11. f(x) =x, OK<x<1 
12. f(x)21-x, O<x<1 
1, O<x< = 
13. f(x) = 1 
0, fe 


In each of Problems 14 through 18, determine whether the given 
boundary value problem is self-adjoint. 


14. y"+y'+2y=0, y(0)=0, y(1) =0 
15. (1+x?)y”+2xy'+y=0, y(0) —0, y(1)+2y’(1) =0 
16. y’+y=Ay, y(0-y(120, y/(0) —y(1) =0 


17. (14+x*)y" + 2xy’ + y 2 (14 x2)y, 
XO — y'(1) 20, y'(0) +2y(1) 20 
18. y”+Ay=0, y(0)=0, yr)+y(r) =0 
19. Show that if the functions u and v satisfy equations (2), and 
either a = 0 or 85 = 0, or both, then i 
p(x) u(x)v(x) — u(x)v'(x)) a 0. 
20. In this problem we outline a proof of the first part of Theorem 
11.2.3: that the eigenvalues of the Sturm-Liouville problem (1), (2) 
are simple. The proof is by contradiction. 
a. Suppose that a given eigenvalue A is not simple. Then 
there exist two corresponding eigenfunctions o and ó» that are 
linearly independent— that is, not multiples of each other. 
b. Compute the Wronskian W[ó;,$5](x), and use the 
boundary conditions (2) to show that W[¢,, $5](0) = 0. 
C. Use Theorem 3.2.7 to reach a contradiction, which 
establishes that the eigenvalues must be simple, as asserted in 
Theorem 11.2.3. 


21. Consider the Sturm-Liouville problem 


—(G)»y)' -q(x)y = Ar(x)y, 0€ x «1 
ayy(0) + a2y'(0) 20, Bi y(1) + 85y'(1) = 0, 


where p, q, and r satisfy the conditions stated in the text. 


a. Show that if A is an eigenvalue and ọ a corresponding 
eigenfunction, then 


1 1 
af rds = | Qo tagd + Epe 
0 0 B5 


- 21p(0) 9X0), 
a2 


provided that œ) Z 0 and 85 ¥ 0. How must this result be 
modified if o. = 0 or 8; = 0? 

b. Show that if g(x) > 0 and if 81/085 and —a,/az are 
nonnegative, then the eigenvalue A is nonnegative. 

C. Under the conditions of part b, show that the eigenvalue A is 
strictly positive unless a; = 3; = 0 and q(x) = 0 for each x in 
O<x<l. 


22. Derive equation (8) using the inner product (9) and assuming 
that u and v are complex-valued functions. Hint: Consider the quantity 


1 
n L[u]vdx, split u and v into real and imaginary parts, and proceed 
0 


as in the text. 
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23. Inthis problem we outline a proof that the eigenfunctions of the 
Sturm-Liouville problem (1), (2) are real-valued. 
a. Let A be an eigenvalue and let ó(x) = U(x) + iV(x) 
be a corresponding eigenfunction. Show that U and V are also 
eigenfunctions corresponding to A. 
b. Using Theorem 11.2.3, or the result of Problem 20, show that 
U and V are linearly dependent. 
C. Show that ¢ must be real-valued, apart from an arbitrary 
multiplicative constant that may be complex. 


24. Consider the problem 


xy" 2 A(xy' - y), y) =0, y(2) =0. 


Note that À appears as a coefficient of y' as well as of y itself. It is 
possible to extend the definition of self-adjointness to this type of 
problem and to show that this particular problem is not self-adjoint. 
Show that the problem has eigenvalues but that none of them is real. 
This illustrates that in general, nonself-adjoint problems may have 
eigenvalues that are not real. 

Buckling of an Elastic Column. An investigation of the buckling of a 
uniform elastic column of length L by an axialload P (Figure 11.2.1a) 
leads to the differential equation 


yO + Ay'20 O<x<L. (46) 
y 
P 
——| — 
x 
| | 
x=0 =L 


column 


(a A 
under compression. (b) Shape of 
the buckled column. 


The parameter À is equal to P/(EI), where E is Young's 
modulus and / is the moment of inertia of the cross section about an 
axis through the centroid perpendicular to the x y-plane. The boundary 
conditions at x — 0 and x — L depend on how the ends of the column 
are supported. Typical boundary conditions are 


yy =0, clamped end, 


y=y"=0, simply supported (hinged) end. 


The bar shown in Figure 11.2.1a is simply supported atx = 0 
and clamped at x = L. It is desired to determine the eigenvalues 
and eigenfunctions of equation (46) subject to suitable boundary 
conditions. In particular, the smallest eigenvalue A; gives the load 
at which the column buckles, or can assume a curved equilibrium 
position, as shown in Figure 11.2.15. The corresponding eigenfunction 
describes the configuration of the buckled column. Note that the 
differential equation (46) does not fall within the theory discussed in 
this section. It is possible to show, however, that in each of the cases 
given here all the eigenvalues are real and positive. Problems 25 and 
26 deal with column-buckling problems. 


25. For each of the following boundary conditions, find the smallest 
eigenvalue (the buckling load) of y^? + Ay" = 0, and also find the 
corresponding eigenfunction (the shape of the buckled column). 

a. y(0) =y"(0) =0, y(L) = y"(L) 20 

b. y(0) 2 y"(0) =0, y(L) = y'(L) 20 

€. y(0) 2 0 =0,  y(L) 2y'(L) 20 
26. In some buckling problems the eigenvalue parameter appears 
in the boundary conditions as well as in the differential equation. One 
such case occurs when one end of the column is clamped and the other 
end is free. In this case the differential equation y? + Ay" = 0 must 
be solved subject to the boundary conditions 


y(0) —0, y(0)=0, y'(L)-—0, y"(L)-Ay(L) =0. 


Find the smallest eigenvalue and the corresponding eigenfunction. 


27. Solutes in an aquifer are transported by two separate 
mechanisms. The process by which a solute is transported by the bulk 
motion of the flowing groundwater is called advection. In addition, 
the solute is spread by small-scale fluctuations in the groundwater 
velocity along the tortuous flow paths within individual pores, a 
process called mechanical dispersion. The one-dimensional form of 
the advection -dispersion equation for a nonreactive dissolved solute 
in a saturated, homogeneous, isotropic porous medium under steady, 
uniform flow is 


c; tvecy = Dey, O«x«L, t»0, (47) 


where c(x,t) is the concentration of the solute, v is the average 
linear groundwater velocity, D is the coefficient of hydrodynamic 
dispersion, and L is the length of the aquifer. Suppose that the 
boundary conditions are 


c(0, ft) 20, cx(L,t) =0, t0 (48) 
and that the initial condition is 
c(x,0) = f(x), O<x<L, (49) 


where f(x) is the given initial concentration of the solute. 
a. Assume that c(x,t) =  X(x)T(t), use the method of 
separation of variables, and find the equations satisfied by X(x) 
and T(t), respectively. Show that the problem for X(x) can be 
written in the Sturm-Liouville form 


(p(x)X)' + Ar(x)X 20, 0<x< L, (50) 


X(0)=0, X(L) =0, (51) 


where p(x) = r(x) = exp(—vx/D). Hence the eigenvalues 
are real, and the eigenfunctions are orthogonal with respect to 
the weight function r(x). 

b. Let u? = A — v?/(4D?). Show that the eigenfunctions are 


X,4(x) = e"/ OD sin( nx), (52) 


where ,1, satisfies the equation 
tan( u L) = ——— (53) 


Q c. Show graphically that equation (53) has an infinite 
sequence of positive roots and that jj, = (2n — 1)r /(2L) for 
large n. 

d. Show that 


x 2 L V ee) 
r(x) X;(x)dx = — + —, sin (nL). 
0 2. 4Dun 


e. Find a formal solution of the problem (47), (48), (49) in terms 
of a series of the eigenfunctions X,(x). 


Of. Letv = 1, D = 0.5, L = 10, and f(x) = 6(x — 3), 
where 6 is the Dirac delta? function. Using the solution found 
in part (e), plot c(x,t) versus x for several values of t, such 
ast = 0.5, 1, 3, 6, and 10. Also plot c(x,t) versus t for 
several values of x. Note that the number of terms that are needed 
to obtain an accurate plot depends strongly on the values of t 
and x. 

g. Describe in a few words how the solution evolves as time 
advances. 
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cx(L,t) = 0, £i 0; 


c(x, 0) = 0, «XL, 


where c(x, t), v, and D are as in Problem 27. 


a. Assuming that c(x,t) = co + u(x, t), find the boundary 
value problem satisfied by u(x, t). 

b. Proceeding as in Problem 27, find u(x,t) in terms of an 
eigenfunction expansion. 

Qc. Letv = 1, D = 0.5, co = 1, and L = 10. Plot c(x, t) 


28. A nonreactive tracer at concentration co is continuously 
introduced into a steady flow at the upstream end of a column of length 
L packed with a homogeneous granular medium. Assuming that the 
tracer concentration in the column is initially zero, the boundary value 


versus x for several values of t, and also plot c(x, t) versus t for 
several values of x. 

d. Describe in a few words how the solution evolves with time. 
For example, about how long does it take for the steady-state 


problem that models this process is 


€; c vér = Dey, 0 cx «c Lb, t> 0, 


c(0, t) = co, t>0, 


1.3. Nonhomogeneous Boundary Value 
Problems 


In this section we discuss how to solve nonhomogeneous boundary value problems for both 
ordinary and partial differential equations. Most of our attention is directed toward problems 
in which the differential equation alone is nonhomogeneous, while the boundary conditions 
are homogeneous. We assume that the solution can be expanded in a series of eigenfunctions 
of a related homogeneous problem, and then we determine the coefficients in this series so 
that the nonhomogeneous problem is satisfied. We first describe this method as it applies to 
boundary value problems for second-order linear ordinary differential equations. Later we 
illustrate its use for partial differential equations by solving a heat conduction problem in a 
bar with variable material properties and in the presence of source terms. 


Nonhomogeneous Sturm-Liouville Problems. Consider the boundary value problem 
consisting of the nonhomogeneous differential equation 


Liy] = -(pG)y) + q(x) y = ur(x)y + f(x), (1) 


where u is a given constant and f is a given function on 0 < x < 1, and the boundary 
conditions 


Biy(1) + B5y'(1) 2 0. Q) 


As in Section 11.2, we assume that p, p', q, and r are continuous on 0 < x < 1 and 
that p(x) > O and r(x) > 0 there. We will solve the problem (1), (2) by making use of 
the eigenfunctions of the corresponding homogeneous problem consisting of the differential 
equation 


o1X(0) + a2y'(0) — 0, 


Liy] = Ar(x)y (3) 


and the boundary conditions (2). Let Àj < A5 < +++ < An < --+ be the eigenvalues of this 
problem, and let 61, $5. ... , @n, ... be the corresponding normalized eigenfunctions. 

We now assume that the solution y = @(x) of the nonhomogeneous problem (1), (2) can 
be expressed as a series of the form 


(x) = M bósQ). (4) 


n=l 


ÓSee Section 6.5, especially equation (16) of that section. 


solution to be essentially attained? 


546 


CHAPTER 11 Boundary Value Problems and Sturm-Liouville Theory 


From equation (34) of Section 11.2, we know that 
1 
b, -| r(x)o(x)ó,(x)dx, n=1,2,.... (5) 
0 


However, since we do not know $ (x), we cannot use equation (5) to calculate b,,. Instead, we 
will try to determine b, so that the problem (1), (2) is satisfied and then use equation (4) to 
find o (x). Note first that o as given by equation (4) always satisfies the boundary conditions 
(2) since each ġ,„ does. 

The differential equation that ó must satisfy is just equation (1) with y replaced by 9: 


LII = ur(x)ó(x) + f(x). (6) 


We substitute the series (4) into the differential equation (6) and attempt to determine b, so that 
the differential equation is satisfied. The term on the left-hand side of equation (6) becomes 


L[6Yx) = L |X bnon |) = M br Llena) = So bn Ant (x) bala), (7) 


n=1 n=1 n=1 


where we have assumed that we can interchange the operations of summation and 
differentiation. 

Note that the function r appears in equation (7) and also in the term pur(x)@(x) in 
equation (6). This suggests that we rewrite the nonhomogeneous term in equation (6) as 
r(x)Cf Go) / r(x)) so that r(x) also appears as a multiplier in this term. If the function f/r 
satisfies the conditions of Theorem 11.2.4, then 


fO) x 
"S d (8) 
where, using equation (5) with ¢ replaced by f/r, 
1 1 
ae] roo e coax f OCDE: scd uus ©) 
0 r(x) 0 


Upon substituting for o (x), L[ó ]( x), and f(x) in equation (6) from equations (4), (7), and 
(8), respectively, we find that 


oo oo [oe] 
XO bnAnt (x) bnl) = ur(x) X bnbn(X) +r(x) Sia: 
n=1 n=1 n=1 
After collecting terms and canceling the common nonzero factor r(x), we have 
oo 
Y XO = ib. — en) Onl x) = 0. (10) 
n=1 


If equation (10) is to hold for each x in the interval 0 < x < 1, then the coefficient of ,(x) 
must be zero for each n; see Problem 14 for a proof of this fact. Hence 


(A,—p)b,—c,—-50, n=1,2,.... (11) 


We must now distinguish two main cases, one of which also has two subcases. 


Case |: First suppose that jj 4 A, for n = 1,2,3, ... ; that is, jj is not equal to any 
eigenvalue of the corresponding homogeneous problem. Then 
Boe aeque uu (12) 
Àn = 
and 
tx 8 
y=% =)> gr. (13) 


n=1 


Equation (13), with c, given by equation (9), is a formal solution of the nonhomogeneous 
boundary value problem (1), (2). Our argument does not prove that the series (13) converges. 
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However, any solution of the boundary value problem (1), (2) clearly satisfies the conditions of 
Theorem 11.2.4; indeed, it satisfies the more stringent conditions given in the second paragraph 
of that theorem. Thus it is reasonable to expect that the series (13) does converge at each point, 
and this fact can be established, provided, for example, that f is continuous. 


Case Il: The second main case is when jz is equal to one of the eigenvalues of the 
corresponding homogeneous problem, say, jj = Am; then the situation is quite different. In this 
event, forn = m, equation (11) has the form 0-5, —c,, = 0. Again we must consider two cases. 


Casella: If jg = A, and c, + 0, then there is no value of b,, that satisfies equation (11), 
and therefore the nonhomogeneous problem (1), (2) has no solution. 


Case llb: If u = A, and c, = 0, then equation (11) is satisfied regardless of the value 
of bm; in other words, bm remains arbitrary. In this case the boundary value problem (1), (2) 
does have a solution, but it is not unique, since it can include an arbitrary multiple of the 
eigenfunction 4. 

Since Cm is given by equation (9), the condition cm = 0 means that 


1 
n f(x) bm(x)dx = 0. (14) 


Thus, if j| = Am, the nonhomogeneous boundary value problem (1), (2) can be solved only if 
f is orthogonal to the eigenfunction corresponding to the eigenvalue Am. 
The results we have formally obtained are summarized in the following theorem. 


Theorem 11.3.1 


The nonhomogeneous boundary value problem (1), (2) has a unique solution for each continuous f 
whenever p is different from all the eigenvalues of the corresponding homogeneous problem; the 
solution is given by equation (13), and the series converges for each x in 0 < x < 1. 

If u is equal to an eigenvalue A, of the corresponding homogeneous problem, then the 
nonhomogeneous boundary value problem has no solution unless condition (14) holds; that is, unless 
f is orthogonal to m. In that case, the solution is not unique and contains an arbitrary multiple of 


Pm(X). 


The main part of Theorem 11.3.1 is sometimes stated in the following way: 


Theorem 11.3.2 


For a given value of u, either the nonhomogeneous problem (1), (2) has a unique solution for each 
continuous f (if p is not equal to any eigenvalue Àm of the corresponding homogeneous problem), 
or else the homogeneous problem (3), (2) has a nontrivial solution (the eigenfunction corresponding 
to Àj). 


This latter form of the theorem is known as the Fredholm’ alternative theorem. This 
is one of the basic theorems of mathematical analysis and occurs in many different contexts. 
You may be familiar with it in connection with sets of linear algebraic equations where the 
vanishing or nonvanishing of the determinant of coefficients replaces the statements about ji 
and Am. See the discussion in Section 7.3. 


EXAMPLE 1 


Solve the boundary value problem 


y"+2y=—-x, y(0)=0, y(D-y(D)-0. 


7The Swedish mathematician Erik Ivar Fredholm (1866-1927), professor at the University of Stockholm, established 
the modern theory of integral equations in a fundamental paper in 1903. Fredholm’s work emphasized the similarities 
between integral equations and systems of linear algebraic equations. There are also many interrelations between 
differential and integral equations; for example, see Section 2.8 and Problem 22 of Section 6.6. 
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Solution: 
This particular problem can be solved directly in an elementary way and has the solution 
sin ( /2 x) " 


y= . (15) 


sin \/2 + v/2.cos \/2 2 


The method of solution described below illustrates the use of eigenfunction expansions, a method 
that can be employed in many problems not accessible by elementary procedures. 
We begin by rewriting the differential equation as 


—y”’ =2y+x (16) 


so that it will have the same form as equation (1). We seek the solution of the given problem as a 
series of normalized eigenfunctions $, of the corresponding homogeneous problem 


y"+Ay=0, y(0) =0, yD) +y) =0. (17) 


These eigenfunctions were found in Example 2 of Section 11.2 and are 


dubi) es ks sin( nx), (18) 


where 


> 1/2 
kn = (=) (19) 
locos A, 


sin VAn + VAn cos V/A, — 0. (20) 
Recall that in Example 1 of Section 11.1, we found that 


Ai Š 4.11586, A> & 24.13934, Az 63.65911, Aq = 122.88916, 


2 — 1)? 2 
aa ODT, for n = 5,6, .... 


and the eigenvalue A, satisfies 


We assume that y is given by equation (4) 


y= 3 bbs), 
n-l 


and it follows that the coefficients b, are found from equation (12) 
Cn 


bn = > 
An —2 


where the c, are the expansion coefficients of the nonhomogeneous term f(x) = x in equation (16) 
in terms of the eigenfunctions @,,. These coefficients were found in Example 3 of Section 11.2 and 
are 


24/2sin VAn 
An(l+ cos? V Xn) 1 


Putting everything together, we finally obtain the solution 


(21) 


Cn 


oo 


y-4M sin VAn sin( VAnx). (22) 


AnCAn — 2)(1 + cos? VAn) 


n=1 


Although equations (15) and (22) are quite different in appearance, they are actually two 
different expressions for the same function. This follows from the uniqueness part of Theorem 11.3.1 
or 11.3.2, since À = 2 is not an eigenvalue of the homogeneous problem (17). Another alternative 
to show the equivalence of equations (15) and (22) is to expand the right-hand side of equation (15) 
in terms of the eigenfunctions @,(x). 

The series (22) converges rapidly to the exact solution. Figure 11.3.1(a) shows the exact solution 
and the first term of series (22). (With two or more terms of the series the two solution curves are 
indistinguishable.) Figures 11.3(b) and (c) show the difference between the exact solution (15) and 
the first three and fifteen terms of the series solution (22). 
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For this problem, it is fairly obvious that equation (15) is a more convenient expression for the 
solution than equation (22). However, we emphasize again that in other problems we may not be 
able to obtain the solution except by series (or numerical approximation) methods. 


x Exact 
solution (15) 


First term of 
series solution (22) 


(a) 


(b) (c) 


Graphical comparison of the exact solution (15) and partial sums of the series (22). 
(a) The exact solution (red) and the first term of the series solution (blue). The difference between the 
exact solution and the first three (5) and fifteen (c) terms of the series solution. 


Nonhomogeneous Heat Conduction Problems. To show how eigenfunction expansions 
can be used to solve nonhomogeneous problems for partial differential equations, let us 
consider the generalized heat conduction equation 


r(x)u, = (p(x)ux)x — q(x)u + F(x, t) Q3) 


with the boundary conditions 
ux(0, 1) — hiu(0, t) =0, ux(1,t) - hou(1,1) 20, 120 Q4) 
and the initial condition 
u(x,0) = f(x), O< x« I. (25) 
This problem was previously discussed in Appendix A of Chapter 10 and in Section 11.1. In 
the latter section we let u(x, t) = X(x)T (t) in the homogeneous equation obtained by setting 
F(x, t) = 0 and showed that X(x) must be a solution of the boundary value problem 
—( p(x) X^)! + q(x) X = Ar(x) X, (26) 
X'(0) —hyX(0) 20, X'(l) +42 X(1) = 0. (27) 
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If we assume that p, q, and r satisfy the proper continuity requirements and that p(x) and 
r(x) are always positive, the problem (26), (27) is a Sturm-Liouville problem as discussed in 
Section 11.2. Thus we obtain a sequence of eigenvalues A1 < Ao < + < A, < +++ and 
the corresponding normalized eigenfunctions $1(x), Qa(x), .... ó4(x), .... 

We will solve the given nonhomogeneous boundary value problem (23), (24), and (25) 
by assuming that u(x, t) can be expressed as a series of eigenfunctions 


u(x,t) =X bó) (28) 


n=1 


and then showing how to determine the coefficients b,(t). The procedure is basically the 
same as that used in the nonhomogeneous Sturm-Liouville problem (1), (2) that we considered 
earlier, although it is more complicated in certain respects. For instance, the coefficients b, 
must now depend on f, because otherwise u would be a function of x only. Note that the 
boundary conditions (24) are automatically satisfied by an expression of the form (28) because 
each (x) satisfies the boundary conditions (27). 

Next we substitute from equation (28) for u in equation (23). From the first two terms on 
the right-hand side of equation (23), we formally obtain 


fa) oo oo 
(P(x) = qGOu = F— | PC) D7 baw | 7460 Y bul 0.0 


n-l n=1 
= Y^ 50 (PPD 7 4G) 64()). (29) 
m 

Since (p(x) $/,G)) — q(x) On(X) = —Agar(x) ó4(x), we obtain finally 

(Gu), — qG)u = —r(x) S ossi (30) 
"m 
Now consider the term on the left-hand side of equation (23). We have 
r(x)u, — ray Y bal bax) = r(x) Y (bax). GI) 
n=1 n=1 


We must also express the nonhomogeneous term in equation (23) as a series of 
eigenfunctions. Once again, it is convenient to look at the ratio F(x, t)/r(x) and to write 


F(x, t) » 
= Yn(t) onl), (32) 
r(x) n=1 
where the coefficients are given by 
: F(x,t) 1 
Ynt) = r(x) es Pn(x)dx x F(x, t)@n(x)dx, n=1,2,.... (33) 
0 0 


Since F(x, t) is given, we can consider the functions y„(t) to be known. 
Gathering all these results together, we substitute from equations (30), (31), and (32) in 
equation (23) and find that 


r(x) M Bt) dn(x) = —r(x) M dat) AnD Hr) 3o a aa). GA 


n-l n=1 n=1 
To simplify equation (34), we cancel the common nonzero factor r(x) from all terms and write 
everything in one summation: 


ob.) + Anbalt) — W(t) bax) = 0. (35) 


n=1 
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Once again, if equation (35) is to hold for all x in O < x < 1, it is necessary for the quantity 
in square brackets to be zero for each n (again, see Problem 14). Hence b,(t) is a solution of 
the first-order linear ordinary differential equation 


bi (t) + Anbr(t) = Falt), n = 1, 2; e...’ (36) 


where y,,(t) is given by equation (33). 
To determine b,,(t) completely, we must have an initial condition 


b,(0) —B, n=1,2,... (37) 


for equation (36). This we obtain from the initial condition (25). Setting t = 0 in equation (28) 
and using equation (25), we have 


oo oo 
u(x,0) = M ba) pax) = M, B.ós(x) = f(x). (38) 
n=1 n=l 
Thus, for each n, the initial value B, is the coefficient of ¢„(x) in the (weighted) eigenfunction 
expansion for f(x), that is, 


1 
B, =l FCIS (arda; n=1,2,.... (39) 
0 


Note that everything on the right-hand side of equation (39) is known, so we can consider B, 
as known. 

The initial value problem (36), (37) is solved by the methods of Section 2.1. The 
integrating factor is u(t) = exp(A,1), and it follows that 


t 
ba(t) = Bye ^"! «f e U79.(s)ds, n=1,2,.... (40) 
0 


The details of this calculation are left to the reader. Note that the first term on the right-hand 
side of equation (40) depends on the function f through the coefficients B,, while the second 
depends on the nonhomogeneous term F through the coefficients ^, (5). 

Thus an explicit solution of the boundary value problem (23), (24), and (25) is given by 
equation (28): 


oo 
u(x, t) = 5 br(t)on(x), 
n=1 
where the coefficients b,(t) are determined from equation (40). The quantities B, and y,(s) 
in equation (40) are found, in turn, from equations (39) and (33), respectively. 
Summarizing, to use this method to solve a boundary value problem such as that given 
by equations (23), (24), and (25), we must 


1. Find the eigenvalues A, and the normalized eigenfunctions ¢, of the homogeneous 
problem (26), (27). 


2. Calculate the coefficients B, and y,(t) from equations (39) and (33), respectively. 


w 


. Evaluate the integral in equation (40) to determine an explicit expression for b„(t). 
4. Sum the infinite series (28). 
Since any or all of these steps may be difficult, the entire process can be quite formidable. 
One redeeming feature is that, as was seen in Example 1, often the series (28) converges 


rapidly, in which case only very few terms may be needed to obtain an adequate approximation 
to the solution. 


EXAMPLE 2 


Find the solution of the heat conduction problem 


Ut = Uyy FXE, <x <1, £530, (41) 


u(0,t) 20, u,(Lt)-cu(lt)-—-0, t0, (42) 
u(x,0) = 0, Ü« xx T (43) 
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Solution: 


Again, we use the eigenvalues A, and the normalized eigenfunctions ¢, of the problem (17) and 
assume that u is given by equation (28) 


oo 
u(x,t) = M b) ósQ). 
n-l 
The coefficients b, are determined from the differential equation 
bi, + Anbn = Yalt (44) 


where y(t) is the coefficient of the n" term in the eigenfunction expansion of the nonhomogeneous 
term xe™. Thus we have 


1 1 
Y(t) = n xe bn(x)dx = e] xbn(x)dx = cne“, (45) 
0 0 


1 
where c, = n X@n(x)dx is given by equation (21). The initial condition for equation (44) is 
0 


b,(0) 20 (46) 
since the initial temperature distribution (43) is zero everywhere. The solution of the initial value 
problem (44), (46) is 
gOn-Dt 1 

An — 1 


t 
b,(t) = as e"c,e ds = c,e t 
0 


e™ — egt 
2 —— ——. 47 
Ped (47) 


Thus the solution of the heat conduction problem (41) to (43) is given by 


uin) m (sin Xn) (e7* A ew!) sin( Ax) 
n-l An(An — 1) (1 + cos? Vn) 


(48) 


The solution given by equation (48) is exact but complicated. To judge whether a satisfactory 
approximation to the solution can be obtained by using only a few terms in this series, we must 
estimate its speed of convergence. First we split the right-hand side of equation (48) into two 
parts: 


is sin /À, sin( VAn x Se sin VÀ, sin( Ax 
u(x,t) = 4e 5 ( ( ) (49) 
vb a= »(1 + cos? Xn) ud tae 1) (1+ cos” X.) 


Recall from Example 1 in Section 11.1 that the eigenvalues A, are very nearly proportional to n?. In 
the first series on the right-hand side of equation (49), the trigonometric factors are all bounded as 
oo 


oo 
: , ae ; -2 ax 
n — oo; thus this series converges similarly to the series » An Or 5 n^. Hence at most two or 
n=l n=1 
three terms are required for us to obtain an excellent approximation to this part of the solution. The 
second series contains the additional factor e~*”’, so its convergence is even more rapid for t > 0; 
all terms after the first are almost surely negligible. 


Further Discussion. Eigenfunction expansions can be used to solve a much greater variety 
of problems than the preceding discussion and examples may suggest. For example, time- 
independent nonhomogeneous boundary conditions can be handled much as in Section 10.6. 
To reduce the problem to one with homogeneous boundary conditions, subtract from u 
a function v that is chosen to satisfy the given boundary conditions. Then the difference 
w = u — v satisfies a problem with homogeneous boundary conditions, but with a modified 
forcing term and initial condition. This problem can be solved by the procedure described in 
this section. 

One potential difficulty in using eigenfunction expansions is that the normalized 
eigenfunctions of the corresponding homogeneous problem must be found. For a differential 
equation with variable coefficients, this may be difficult, if not impossible. In such a case it is 
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sometimes possible to use other functions, such as eigenfunctions of a simpler problem, that 
satisfy the same boundary conditions. For instance, if the boundary conditions are 


u(0,1) 20, u(1,t) =0, (50) 


then it may be convenient to replace the functions ¢,(x) in equation (28) by sin(n7 x). 
These functions at least satisfy the correct boundary conditions, although, in general, they 
are not solutions of the corresponding homogeneous differential equation. Next we expand 
the nonhomogeneous term F(x, t) in a series of the form (32), again with ¢,(x) replaced 
by sin(n7 x), and then substitute for both u and F in equation (23). Upon collecting the 
coefficients of sin(n7 x) for each n, we have an infinite set of first-order linear differential 
equations from which to determine b(t), b(t), ... . The essential difference between 
this case and the one considered earlier is that now the equations for the functions b,„(t) 
are coupled. Thus they cannot be solved one by one, as before, but must be dealt with 
simultaneously. In practice, the infinite system is replaced by an approximating finite system, 
from which approximations to a finite number of coefficients are calculated. 

Boundary value problems for equations of higher than second-order can also often be 
solved by eigenfunction expansions. In some cases the procedure parallels almost exactly that 
for second-order problems. However, a variety of complications can also arise. 

Finally, we emphasize that the discussion in this section has been purely formal. Separate 
and sometimes elaborate arguments must be used to establish convergence of eigenfunction 
expansions or to justify some of the steps used, such as term-by-term differentiation of 
eigenfunction series. 

There are also other, altogether different methods for solving nonhomogeneous boundary 
value problems. One of these leads to a solution expressed as a definite integral rather than as 
an infinite series. This approach involves certain functions known as Green's functions and, 
for ordinary differential equations, is the subject of Problems 28 through 36. 


Problems 


In each of Problems 1 through 5, solve the given problem by means 14. Leti, 6», ...,@n, ... be the normalized eigenfunctions of 

of an eigenfunction expansion. the differential equation (3) subject to the boundary conditions (2). If 
oo 

1l. y"c2y-—-x, y(0)=0, y(D-0 XO cnbn(x) converges to f(x), where f(x) = 0 for all values of 


2. y" +2y=-x, y(0) =0, y 1) =0; n-l 
see Section 11.2, Problem 7. 


x inQ < x < 1, show that c, = 0 for each n. 
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Hint: Multiply by r(x)@(%), integrate, and use the orthogonality 


3. y"+2y=-x, y(0 20, y(1)=0; 


roperty of the eigenfunctions. 
see Section 11.2, Problem 3. pre ii 


15. Let L be a second-order linear differential operator. Show that 


4. y"-2yc-x, y(0 20, y(1)-4»y(1)20; 
see Section 11.2, Problem 11. 


5. y"+2y=—-14+ |1—2x|, y(0)=0, y(1)=0 


the solution y = o (x) of the problem 
Lily] = f(x), o1x(0) + a2y(0) =a, 


B1y(1) + B2y' (1) =b 


In each of Problems 6 through 9, determine a formal eigenfunction can be written as y = u + v, where u = (x) and v = (x) are 


series expansion for the solution of the given problem. Assume that f solutions of the problems 


satisfies the conditions of Theorem 11.3.1. State the values of ju for Lic. œ;u(0) + azu’(0) =a, 
which the solution exists. 

6. y"+py=—f(x), y(0)=0, y(1)=0 and 

7. y"+py=-f(x), y(0)=0, y(1) =0 L[v] 2 f(x), a1v(0) + a2v'(0) = 0, 


8. y"+uy=—f@), yO) =0, y(D0-0 


respectively. 
9. y"+py=—f(x), y(0)=0, y(D-c-»x10)20 


16. Show that the problem 
In each of Problems 10 through 13, determine whether there is any 


E P s n P a 
value of the constant a for which the problem has a solution. Find the y HT YST X, 
solution for each such value. 


10. y’+n?y=at+x, y(0)=0, y()20 
11. y"+4r?2y=a+x, y(0)=0, y(1) =0 
12. y"+a*y=a, y(0)=0, y(1)=0 


has the solution 


y-c,sin(-x) +cos(7x) +x. 


Biu(1) + Bou (1) =b 


Byv(1) + B5v'(1) = 0, 


Also show that this solution cannot be obtained by splitting the 
z problem as suggested in Problem 15, since neither of the two 
13. y’+7*y=a-—cos(rx), y(0)=0, y(1)=0 subsidiary problems can be solved in this case. 
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17. Consider the problem 


y" + p(x)y'+q(x)y=0, y(0) =a, yl) =b. 

Let y = u + v, where v is any twice-differentiable function satisfying 
the boundary conditions (but not necessarily the differential equation). 
Show that u is a solution of the problem 

u^ + p(x)u t q(x)u = g(x), u(0) —0, u(1) —0, 

where g(x) = —(v" + p(x)v' + q(x)v) and is known once v is 
chosen. Thus nonhomogeneities can be transferred from the boundary 
conditions to the differential equation. Find a function v for this 


problem. 


18. Using the method of Problem 17, transform the problem 


y'-2y22-4x, y(0)=1, y(1)+y(1) = -2 
into a new problem in which the boundary conditions are 
homogeneous. Solve the latter problem by reference to Example 1 of 


the text. 


In each of Problems 19 through 22, use eigenfunction expansions 
to find for value problem. Note: Each differential equation must be 
satisfied 0 < x < 1,1 > 0; the pair of boundary conditions holds for 
t > 0; and the initial condition holds on 0 < x < 1. 


19. us:=uxx—x, u(0,t)=0, ux(l,t)=0, 
u(x, 0) = sin(7x/2); see Problem 2. 


20. uj —u-ce', u,(001) 20, uwu(Lt) --u(1 t) 20, 
u(x,0) = 1 — x; see Section 11.2, Problems 10 and 12. 


21. u-u;u-c-1-|1-2x|l u(0,t) 20, u(L t) =0, 
u(x, 0) — 0; see Problem 5. 
22. u,2u,,-e'(1—x) u(0,020, wu,(10-20, 


u(x, 0) — 0; see Section 11.2, Problems 6 and 7. 

23. Consider the boundary value problem 
r(x)u; = (p(x)ux)xs —q(x)u + F(x), O< x <1, t>0, 
u(0,t) = Ti, u(1,t) = T», 
u(x,0) = f(x), 


t > 0, 
GLT 


a. Let v(x) be a solution of the problem 


(p(x)v)'—q(x)v = —F(x), v(0 2T, vI) =. 
If w(x, t) = u(x,t) — v(x), find the boundary value problem 
satisfied by w. Note that this problem can be solved by the 
method of this section. 

b. Generalize the procedure of part (a) to the case where u 
satisfies the boundary conditions 
ux(0, t) —hqiu(0, 0) = Tj, ux(1,t) + hu(1, t) = To. 

In each of Problems 24 and 25, use the method indicated in Problem 

23 to solve the given boundary value problem. 


24. Uy = Ux, — 2, OL xe< 1, - £0, 
u(0,t) =1, u(1,t)=0, t>0, 
u(x,0) =x* —2x +2, O< x < 1; 

25. Us = uy, — T? cos(mx), Ücx-«cl t > 0, 
uut 2) =0, u(1,t) = T; t» 0, 
u(x,0) —cos(37x/2) —cos(sx), O<x< 1. 


26. The method of eigenfunction expansions is often useful for 
nonhomogeneous problems related to the wave equation or its 


generalizations. Consider the problem 

r(x)uy = (p(x)uy), —4(x)u + F(x,t), 0< x< Il, t» 0 (51) 
(52) 
(53) 


ux(0,t) — hqu(0, t) =0, ux(1,t) +hou(1,t) 20, t > 0, 
u(x,0) = f(x), u,(x,0) = g(x), O< x <1. 


This problem can arise in connection with generalizations of the 
telegraph equation (Problem 16 in Section 11.1) or the longitudinal 
vibrations of an elastic bar (Problem 25 in Section 11.1). 
a. Let u(x,t) = X(x)T(t) in the homogeneous equation 
corresponding to equation (51), and show that X(x) satisfies 
equations (26) and (27) of the text. Let A, and o, (x) denote the 
eigenvalues and normalized eigenfunctions of this problem. 


oo 
b. Assume that u(x,t) = » ba(t) $4 (x), and show that b, (t) 


n-l 
must satisfy the initial value problem 


b, (t) + Anbn(t) = Yn(t), b,(0) = Qn, b, (0) — Bn, 


where Qn, Bn, and y,(t) are the (weighted) coefficients of 
f(x), g(x), and F(x, t) / r(x), respectively, in terms of the 
eigenfunctions Q1(x), .... ó4(x), .... 


27. In this problem we explore a little further the analogy between 
Sturm-Liouville boundary value problems and Hermitian matrices. 


Let A be an n x n Hermitian matrix with eigenvalues Aj, ... , An 
and corresponding orthonormal eigenvectors £(U, ... , €. 
Consider the nonhomogeneous system of equations 

AX — ux = b, (54) 


where ji is a given real number and b is a given vector. We will point 
out a way of solving equation (54) that is analogous to the method 
presented in the text for solving equations (1) and (2). 


n 
a. Show that b = Y b;£® , where b; = (b, £?)). 
i=l 


n 
b. Assume that x = $^ a;€. Show that for equation (54) to be 
i=l 
satisfied, it is necessary that a; = b; /(Ai — ji). Thus 
n (i) 
(BE) eO 
i—-H 


x= 


(55) 


provided that u is not one of the eigenvalues of A, that is, jj + A; 
fori =1,...,n. 
c. Compare this result with equation (13). 


Green's? Functions. Consider the nonhomogeneous system of 
algebraic equations 


Ax — ux = b, (56) 


where A is ann x n Hermitian matrix, pu is a given real number, and 
b is a given vector. Instead of using an eigenvector expansion as in 
Problem 27, we can solve equation (56) by computing the inverse 
matrix (A — I) ~!, which exists if jz is not an eigenvalue of A. Then 


x =(A—pD~'b. (57) 


8Green’s functions are named after George Green (1793-1841) of England. 
He was almost entirely self-taught in mathematics and made significant 
contributions to electricity and magnetism, fluid mechanics, and partial 
differential equations. His most important work was an essay on electricity 
and magnetism that was published privately in 1828. In this paper Green was 
the first to recognize the importance of potential functions. He introduced the 
functions now known as Green's functions as a means of solving boundary 
value problems and developed the integral transformation theorems, of which 
Green's theorem in the plane is a particular case. However, these results did 
not become widely known until Green's essay was republished in the 1850s 
through the efforts of William Thomson (Lord Kelvin). 


Problems 28 through 36 indicate a way of solving nonhomogeneous 
boundary value problems that is analogous to using the inverse matrix 
for a system of linear algebraic equations. The Green’s function plays 
a part similar to the inverse of the matrix of coefficients. This method 
leads to solutions expressed as definite integrals rather than as infinite 
series. Except in Problem 35, we will assume that jj = 0 for simplicity. 


28. a. Show by the method of variation of parameters that the 
general solution of the differential equation 


=y" = f(x) 


can be written in the form 


x 
y= (x) = c1 + cx — i (x — s) f(s)ds, 
0 
where c; and c are arbitrary constants. 
b. Let y = ó(x) also be required to satisfy the boundary 
conditions y(0) = 0, y(1) = 0. Show that in this case 


1 
ép =0, a= [ a- fous 
0 


C. Show that, under the conditions of parts a and b, o (x) can be 
written in the form 


x 1 
so f a-xfodi s | x(1— s) f(s)ds. 
0 x 


d. Defining 


show that the solution can be written as 


1 
on = f G(x, s) f(s)ds. 
0 


The function G(x,s) appearing under the integral sign is a 
Green's function. The usefulness of a Green's function solution 
rests on the fact that the Green's function is independent of the 
nonhomogeneous term in the differential equation. Thus, once 
the Green's function is determined, the solution of the boundary 
value problem for any nonhomogeneous term f(x) is obtained 
by a single integration. Note further that no determination of 
arbitrary constants is required, since ó(x) as given by the 
Green's function integral formula automatically satisfies the 
boundary conditions. 


29. By a procedure similar to that in Problem 28, show that the 
solution of the boundary value problem 


=" +y) = f(x), y0) =0, y) =0 


is 


1 
y= 9(x) = G(x, s) f(s)ds, 
0 


where 
sins sin(1 — x) 
— ~ Osx, 
sin 1 
G(x,s) = 
sinx sin( 1 — s) 
a <5 K< 1. 


sinl 
30. It is possible to show that the Sturm-Liouville problem 


Lily] = —(p(x)y')’ +(x) y = f(x), (58) 
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o1X(0) +a2y(0) 20, Bry) +foy'()=0 — (59) 


has a Green's function solution 
1 
y= (x) zi G(x, s) f(s)ds, (60) 
0 


provided that À = 0 is not an eigenvalue of L[y] = Ay subject to the 
boundary conditions (59). Further, G(x, s) is given by 


yis) y2(x) — 
E PWs yl C 07 
ME iG 3G) pa 


s #555 
P(x) W[yi, y2](x) 
where y; is a solution of L[y] = 0 satisfying the boundary condition at 


x = 0, y; is a solution of L[y] = 0 satisfying the boundary condition 
at x = 1, and W[y;, y2] is the Wronskian of y; and yp. 


a. Verify that the Green’s function obtained in Problem 28 is 
given by formula (61). 

b. Verify that the Green’s function obtained in Problem 29 is 
given by formula (61). 

€. Show that p( x) W[yi, y2](x) is a constant by showing that 
its derivative is zero. 

d. Using equation (61) and the result of part c, show that 
G(x,s) — G(s, x). 

e. Verify that y = ¢(x) from equation (60) with G(x, s) given 
by equation (61) satisfies the differential equation (58) and the 
boundary conditions (59). 


In each of Problems 31 through 34, solve the given boundary 
value problem by determining the appropriate Green's function and 
expressing the solution as a definite integral. Use equations (58) to 
(61) of Problem 30. 


31. —y"'— f(x), y(0 =0, y(D =0 

32. —y" = f(x), (0) =0, xX) - y (1) 20 
33. —(y"+y) = f(x), y(0)=0, y(1)=0 
34. —y"'— f(x) y(0) =0, y(D =0 


35. Consider the boundary value problem 
Liy] = -(pG)»' + qG)y = pr(x)y + f(x), (62) 


01X(0) +azy(0) =0, Biy) - 82y (1) =0. (63) 


In this section the solution y = ¢ (x) of the Sturm-Liouville boundary 
value problem (62), (63) is given by equation (13), where c, is 
defined by equation (9), provided that jz is not an eigenvalue of the 
corresponding homogeneous problem. In this case it can also be shown 
that the solution is given by a Green's function integral of the form 


1 
y= (x) =j G(x, s, u) f(s)ds. (64) 
0 


Note that in this problem the Green’s function also depends on the 
parameter ju. 
a. Show that if these two expressions for ¢(x) are to be 
equivalent, then 


G(x, s, u) = 5 pi) P:s) (65) 


where A; and ¢; are the eigenvalues and eigenfunctions, 
respectively, of equations (3), (2) of this section. Again, we see 
from equation (65) that u cannot be equal to any eigenvalue A;. 
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b. Derive equation (65) directly by assuming that G(x, s, 11) 
has the eigenfunction expansion 

oo 

GG, s, i) = X a(x, m Gis). (66) 

i-l 
Determine a;(x, 4) by multiplying equation (66) by r(5) ¢ ;(s) 
and integrating with respect to s from s = 0 to s = 1. 
Hint: Show first that A; and œ; satisfy the equation 


1 
$;(x) =O; - f GG. s, n)r(s) ói(s)ds. (67) 
0 


36. Consider the boundary value problem 


d?y 
——-6(s-x)» y(0 20 y(1) =0, 
ds 


where s is the independent variable, s — x is a definite point in the 
interval 0 < s < 1, and ô is the Dirac delta function (see Section 6.5). 
Show that the solution of this problem is the Green's function G(x, s) 
obtained in Problem 28. 

In solving the given problem, note that 6(s—x) = Oin the intervals 
0x s « x and x < s < 1. Note further that —dy/ds experiences a 
jump of magnitude 1 as s passes through the value x. 

This problem illustrates a general property, namely, that the 
Green's function G(x, s) can be identified as the response at the point 
s to a unit impulse at the point x. A more general nonhomogeneous 
term f on 0 < x < 1 can be regarded as a continuous 
distribution of impulses with magnitude f(x) at the point x. The 
solution of a nonhomogeneous boundary value problem in terms of 
a Green's function integral can then be interpreted as the result of 
superposing the responses to the set of impulses represented by the 
nonhomogeneous term f(x). 


11.4 Singular Sturm-Liouville Problems 


In the preceding sections of this chapter, we considered Sturm-Liouville boundary value 
problems: the differential equation 


L[y] = — 


(G)y)' +4(x)y =Ar(x)y, 0<x< l, (1) 


together with boundary conditions of the form 


a1y(0) -- a2y'(0) 20, Biy) + B2y'(1) = O. Q) 


Until now, we have always assumed that the problem is regular. That is, we have assumed that 
p is differentiable, that p', q, and r are continuous, and that p(x) > 0 and r(x) > O at all 
points in the closed interval 0 < x < 1. However, there are also equations of physical interest 
in which some of these conditions are not satisfied. 
For example, suppose that we wish to study Bessel's equation of order v on the interval 
0 « x « 1. This equation is sometimes written in the form? 
2 
-ay + Ly = Axy 6) 


so that p(x) = x, q(x) = vx and r(x) = x. Thus p(0) = 0,r(0) = 0, and g(x) is 
unbounded and hence discontinuous as x — 0. However, the conditions imposed on regular 
Sturm-Liouville problems are met elsewhere in the interval. 

Similarly, for Legendre's equation we have 


-((1-3)y) 2v, -]«x«l, (4) 


where A = a(a+1), p(x) «1 —x?, q(x) = 0, and r(x) = 1. Here the required conditions 
on p, q, and r are satisfied in the interval 0 < x < 1 except that p(1) = 0. 

We use the term singular Sturm-Liouville problem to refer to a certain class of boundary 
value problems for the differential equation (1) in which the functions p, q, and r satisfy the 
conditions stated earlier on the open interval 0 < x < 1, but at least one of these functions 
fails to satisfy them at one or both of the boundary points. We also prescribe suitable separated 
boundary conditions of a kind to be described in more detail later in this section. Singular 
problems also occur if the interval is unbounded, for example, O < x < oo. We do not 
consider this latter kind of singular problem in this book. 


11.4 Singular Sturm-Liouville Problems 


As an example of a singular problem on a finite interval, consider the equation 
xy" + y' + Axy =0, 6) 
or 
=y’) = Axy, (6) 


on the interval 0 < x < 1, and suppose that A > 0. This equation arises in the study of 
free vibrations of a circular elastic membrane and is discussed further in Section 11.5. If we 
introduce the new independent variable t defined by t = JX x, then 


dy dy d’y d*y 
— —XA : —A . 
dx vA dt? dx? dr? 


Hence equation (5) becomes 


or, if we cancel the common factor JX in each term, 


dy dy 
P EP tty = 0. (7) 


Equation (7) is Bessel's equation of order zero (see Section 5.7). The general solution of 
equation (7) for t > Ois 


y = c1 Jolt) + c2Yo(t); 
hence the general solution of equation (6) for x > 0 is 


y = eM ( Vx) tox (VAx). (8) 


where Jo and Y, denote the Bessel functions of the first and second kinds of order zero. From 
equations (6) and (12) of Section 5.7, we have 


qmm .,.2m 
Jo( Vrs) =14 Mone , x>0, (9) 


2 Ax (—1 "UH, AU x 2m 
YAVA x) = m ( +m =)) JVA x + 5 i 0x 2 0, 
m=1 ` 


(10) 
where H, = 1 + 1/2+ --- + 1/m andy =  lim(H, — Inm). The graphs of 


m—>> oo 


[y = Jo(x)] and y = Yo(x) are given in Figure 5.7.2. 
Suppose that we seek a solution of equation (6) that also satisfies the boundary conditions 


y(0) =0, y(1) =0, (11) 


which are typical of those we have met in other problems in this chapter. Since Jo(0) = 
and Yo(x) — -—oo as x — 0, the condition y(0) = O can be satisfied only by choosing 
€, = c? = Q in equation (8). Thus the boundary value problem (6), (11) has only the trivial 
solution. 

One interpretation of this result is that the boundary condition y(0) = 0 is too restrictive 
for the differential equation (6). This illustrates the general situation, namely, that at a singular 
boundary point it is necessary to consider a modified type of boundary condition. In the present 
problem, suppose that we require only that the solution (8) and its derivative remain bounded. 
In other words, we take as the boundary condition at x — 0 the requirement 


y and y'remain bounded as x — O°. (12) 
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This condition can be satisfied by choosing c; = 0 in equation (8), so as to eliminate the 
unbounded solution Yo. The second boundary condition, y(1) = 0, then yields 


JVA) = 0. (13) 


It is possible to show? that equation (13) has an infinite set of discrete positive roots, which 
yield the eigenvalues 0 < Ay < Ao < + < A, < --- of the given problem. The 
corresponding eigenfunctions 


bn(x) = Jo(Vrnx), (14) 


are determined only up to a multiplicative constant. 

The boundary value problem with differential equation (6) and boundary conditions 
y(1) = 0 and (12) is an example of a singular Sturm-Liouville problem. This example 
illustrates that if the boundary conditions are relaxed in an appropriate way, then a singular 
Sturm-Liouville problem may have an infinite sequence of eigenvalues and eigenfunctions, 
just as a regular Sturm-Liouville problem does. 

Because of their importance in applications, it is worthwhile to investigate singular 
boundary value problems a little further. There are two main questions that are of concern: 


1. Precisely what type of boundary conditions can be allowed in a singular Sturm-Liouville 
problem? 

2. To what extent do the eigenvalues and eigenfunctions of a singular problem share the 
properties of eigenvalues and eigenfunctions of regular Sturm-Liouville problems? In 
particular, are the eigenvalues real, are the eigenfunctions orthogonal, and can a given 
function be expanded as a series of eigenfunctions? 


Both of these questions can be answered by a study of the identity 
1 
f (L[u]v — uL[v])dx = 0, (15) 
0 


which played an essential part in the development of the theory of regular Sturm-Liouville 
problems. We therefore investigate the conditions under which this relation holds for singular 
problems, where the integral in equation (15) may now have to be examined as an improper 
integral. To be definite, we consider the differential equation (1) and assume that x = 0 is a 
singular boundary point but that x = 1 is not. The boundary condition 3; y(1) + 85y'(1) = 0 
is imposed at the nonsingular boundary point x — 1, but we leave unspecified, for the moment, 
the boundary condition at x = 0. Indeed, our principal objective is to determine what kinds of 
boundary conditions are allowable at a singular boundary point if equation (15) is to hold. 
Since the boundary value problem under investigation is singular at x — 0, we choose a 


1 1 
positive number e and consider the integral J L[u]vdx, instead of I L[u]vdx, asin Section 
€ 0 


11.2. Afterward we let e approach zero. Assuming that u and v have at least two continuous 
derivatives on e < x < 1, and integrating twice by parts, we find that 


1 1 
/ (L[u]v — uL[v dx = —p(x)(u'(x) v(x) — uG)v'G))] . (16) 


The boundary term at x = | is again eliminated if both u and v satisfy the second boundary 
condition in (2), and thus 


1 
/ (L[uJv — uL[v dx = p(e)(u'(e)v(€) — u(e)v'(e)). (17) 
Taking the limit as e — 0 through positive values yields 
1 
D (L[u]v — uL[v dx = lim p(e)(u'(e)v(e) — u(e)v'(e)). (18) 
0 c0* 


10The function Jo is well tabulated; the roots of equation (13) can be found in tables such as those in Abramowitz and 
Stegun. You can also use a computer algebra system to compute them quickly. The first three roots of equation (13) 
are / À = 2.405, 5.520, and 8.654, respectively, to four significant figures; /À; = (n — 1/4)7 for large n. 
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Hence equation (15) holds if and only if, in addition to the assumptions stated previously, 


lim p(e)(u'(e)v(e) — u(e)v'(e)) = 0 (19) 
ce—0* 
for every pair of functions u and v in the class under consideration. When x = 0 is a 


singular boundary point for L[y] — Ar(x)y, equation (19) is the criterion that determines 
what boundary conditions are allowable at x — 0. A similar condition applies at x — 1 if that 
boundary point is singular, namely, 


lim p(e)(u'(e)v(e) — u(e)v'(e)) = 0. (20) 


e>1- 


In summary, as in Section 11.2, a singular boundary value problem for equation (1) is said 
to be self-adjoint if equation (15) is valid, possibly as an improper integral, for each pair of 
functions u and v with the following properties: they are twice continuously differentiable on 
the open interval 0 < x < 1, they satisfy a boundary condition of the form (2) at each regular 
boundary point, and they satisfy a boundary condition sufficient to ensure equation (19) if 
x = Ois a singular boundary point, or equation (20) if x = 1 is a singular boundary point. 
If at least one boundary point is singular, then the differential equation (1), together with two 
boundary conditions of the type just described, are said to form a singular Sturm-Liouville 
problem. 


EXAMPLE 1 


Show that the singular Sturm-Liouville boundary value problem consisting of the differential 
equation 


xy" +y'+Axy =0 
with boundary conditions that both y and y’ remain bounded as x approaches 0 from the right and 


that 3; y(1) + 85y'(1) = 0 is self-adjoint. 


Solution: 


From the differential equation, we see that p(x) = x. If both u and v and their first derivatives are 
bounded as x — 07, it is clear that equation (19) will hold. Hence this singular boundary value 
problem, with any boundary condition of the form 3; y(1) + 85y'(1) = 0 at x = 1, is self-adjoint. 


The most striking difference between regular and singular Sturm-Liouville problems is 
that in a singular problem the eigenvalues may not be discrete. That is, the problem may have 
nontrivial solutions for every value of A, or for every value of À in some interval. In such a case 
the problem is said to have a continuous spectrum. It may happen that a singular problem 
has a mixture of discrete eigenvalues and also a continuous spectrum. When a problem has a 
continuous spectrum, the corresponding sets of eigenfunctions are therefore not denumerable, 
and series expansions of the type described in Theorem 11.2.4 do not exist; the infinite sum is 
replaced by an appropriate integral representation. 

Finally, it is possible that only a discrete set of eigenvalues exists, just as in the regular 
case discussed in Section 11.2. For example, this is true of the problem consisting of equations 
(6), y( 1) = 0, and (12). In general, it may be difficult to determine which case actually occurs 
in a given problem. 

A systematic discussion of singular Sturm-Liouville problems is quite sophisticated!! 
indeed, requiring a substantial extension of the methods presented in this book. We restrict 
ourselves to some examples related to physical applications; in each of these examples it is 
known that there is an infinite set of discrete eigenvalues. 


11 See, for example, Chapter 5 of the book by Yosida listed in the References at the end of this chapter. 
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If a singular Sturm-Liouville problem does have only a discrete set of eigenvalues and 
eigenfunctions, then equation (15) can be used, just as in Section 11.2, to prove that the 
eigenvalues of such a problem are real and that the eigenfunctions are orthogonal with 
respect to the weight function r. The expansion of a given function in terms of a series of 
eigenfunctions then follows, as in Section 11.2. 

Such expansions are useful, as in the regular case, for solving nonhomogeneous boundary 
value problems. The procedure is very similar to that described in Section 11.3. Some examples 
for ordinary differential equations are indicated in Problems 1 to 4, and some problems for 
partial differential equations appear in Section 11.5. 


EXAMPLE 2 


The singular Sturm-Liouville boundary value problem 


—(xy) = àxy, O<x<1, 


y and y'boundedas x — 0, y(1) =0 


has eigenfunctions ġ (x) = Jo( / Asx). 
(a) Show that the ġ„ satisfy the orthogonality relation 


1 
n XP m(X) Pn(x)dx =0, mn (21) 
0 


with respect to the weight function r(x) = x. 
(b) Given a function f with f and f’ piecewise continuous on 0 < x < 1, find the coefficients cn 
such that 


oo 


fe) = an(s): (22) 


n=1 


Solution: 


(a) From the differential equation, we see that p(x) = x, q(x) = 0, and r(x) = 0. Thus the 
orthogonality of the eigenfunctions with weight r(x) = x is a direct consequence of Theorem 11.2.2. 
(b) While Theorem 11.2.4 applies only when the eigenfunctions are normalized, multiplying 
equation (22) by x Jo Cm x) and integrating term by term from x = 0 to x = 1, we obtain 


[ sies ( vs) Yrs [ sn (oes) Jo( Van) dx. (23) 


Because of the orthogonality condition (21), the right-hand side of equation (23) collapses to a single 
term; hence 


[ sies) 


Cm = f (24) 


1 
| xJa(v An x) dx 


0 


which determines the coefficients in the series (22). 


The convergence of the series (22) is established by an extension of Theorem 11.2.4 to 
cover this case. This theorem can also be shown to hold for other sets of Bessel functions, 
which are solutions of appropriate boundary value problems; for Legendre polynomials; and 
for solutions of a number of other singular Sturm-Liouville problems of considerable interest. 

It must be emphasized that the singular problems mentioned here are not necessarily 
typical. In general, singular boundary value problems are characterized by continuous spectra, 
rather than by discrete sets of eigenvalues. 


Problems 


1. Find a formal solution of the nonhomogeneous boundary value 
problem 


-(xy)' = uxy + f(x), 


y and y’ bounded as x > 0*, y(1) =0, 


where f is a given continuous function on 0 < x < 1, and p is not an 
eigenvalue of the corresponding homogeneous problem. 
Hint: Use a series expansion similar to those in Section 11.3. 


2. Consider the boundary value problem 


—(xy)' = Axy, 


y and y'boundedasx > 0*,  y'(1) — 0. 


a. Show that Ay = O is an eigenvalue of this problem 
corresponding to the eigenfunction $o(x) = 1. If A > 0, show 
formally that the eigenfunctions are given by the Bessel functions 
on(x) = Jo (VA, x). where 4/À, is the n' positive root (in 
increasing order) of the equation Jg (VX ) = 0. It is possible to 
show that there is an infinite sequence of such roots. 

Q b. Create a graph supporting the claim that Jj (Vr ) = 0 has 
an infinite sequence of positive roots. 

€. Show that if m, n —0,1,2, ... , then 


1 
ri XPm(X) ón(x)dx —0, m zn. 
0 
d. Find a formal solution to the nonhomogeneous problem 
-Qy) = uxy + f(x), 


y(1) — 0, 


where f is a given continuous function on 0 < x < 1, and p is 
not an eigenvalue of the corresponding homogeneous problem. 


y and y'bounded as x > 0*, 


3. Consider the problem 


n k? _ 
—(xy) + Pes Axy, 


y and y’ bounded as x — 07, y(1) =0, 


where k is a positive integer. 

a. Using the substitution f = VÀ x, show that the given 
differential equation reduces to Bessel’s equation of order k (see 
Problem 9 of Section 5.7). One solution is J; (1); a second linearly 
independent solution, denoted by Y;(t), is unbounded as t > 0. 
b. Show formally that the eigenvalues \;, A», ... of the given 
problem are the squares of the positive zeros of J; (VX ) and that 
the corresponding eigenfunctions are a(x) = Jk( An xh It is 
possible to show that there is an infinite sequence of such zeros. 
C. Show that the eigenfunctions ¢,,(x) satisfy the orthogonality 
relation 


1 
n XOm(X) n(x) dx =0, m zn. 
0 


d. Determine the coefficients in the formal series expansion 


oo 


F(x) 2 3 anpa). 


n-l 
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e. Finda formal solution of the nonhomogeneous problem 


pa ke 
—(ay') + ZY = Hay + FO), 
y(1) =0, 


where f is a given continuous function on 0 < x < 1, and p is 
not an eigenvalue of the corresponding homogeneous problem. 


y and y’ bounded as x — 0*, 


4. Consider Legendre's equation (see Problems 22 through 24 of 
Section 5.3) 


-(( 7 x)Y) 23» 
subject to the boundary conditions 
y(0) = 0, 


y and y’ remain bounded as x > 17. 


The eigenvalues for this problem are 4; = 2, 45 = 4-3, ..., 
An = 2n(2n — 1), ... and the eigenfunctions are the corresponding 
odd Legendre polynomials 


pil) = PiQ) =x, Q(x) = Py(x) = (5x3 — 3x) /2, 
e...’ n(x) = Py, (x), 


a. Show that 
1 
J Om(X) Gn(x)dx = 0, m zn. 
0 


b. Find a formal solution of the nonhomogeneous problem 
-((1- x3») = uy + f(a), 


y(0) =0, 


where f is a given continuous function on 0 < x < 1, and p is 
not an eigenvalue of the corresponding homogeneous problem. 


y and y’ bounded as x > 17, 


5. Theequation 
(1— 3?) y" 2 xy! c Ay 20 Q5) 


is Chebyshev's equation; see Problem 10 of Section 5.3. 
a. Show that equation (25) can be written in the form 


1 = 
-((1 - x7)'"y') = A(1 — x’) ar -Il<x<l. 
(26) 
b. Consider the boundary conditions 


y and y’ remain bounded as x > —17, 


: . i Q7) 
y and y remain bounded as x > 1”. 


Show that the boundary value problem (26), (27) is self-adjoint. 
C. Itcan be shown that the singular Sturm-Liouville boundary 
value problem (26), (27) has eigenvalues Ay = 0, A; = 1, 


Ay =4,...,A, — n2, ... . The corresponding eigenfunctions 
are the Chebyshev polynomials T,,(x): To(x) = 1, T(x) = x, 
T(x) = 1 — 2x?, ... . Show that 
1 

T, T, 

J Fh A) a) ys 0, mn. (28) 
2/2 
2 (1-2) 


Note that this is a convergent improper integral. 
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11.5 Further Remarks on the Method of 
Separation of Variables: A Bessel Series 
Expansion 


In this chapter we are interested in extending the method of separation of variables developed 
in Chapter 10 to a larger class of problems—to problems involving more general partial 
differential equations, more general boundary conditions, or different geometrical regions. 
We indicated in Section 11.3 how to deal with a class of more general differential equations or 
boundary conditions. Here we concentrate on problems posed in various geometrical regions, 
with emphasis on those leading to singular Sturm-Liouville problems when the variables are 
separated. 

Because of its relative simplicity, as well as the considerable physical significance of 
many problems to which it is applicable, the method of separation of variables merits its 
important place in the theory and application of partial differential equations. However, this 
method does have certain limitations that should not be forgotten. In the first place, the problem 
must be linear so that the principle of superposition can be invoked to construct additional 
solutions by forming linear combinations of the fundamental solutions of an appropriate 
homogeneous problem. 

As a practical matter, we must also be able to solve the ordinary differential equations, 
obtained after separating the variables, in a reasonably convenient manner. In some problems 
to which the method of separation of variables can be applied in principle, it is of very limited 
practical value due to a lack of information about the solutions of the ordinary differential 
equations that appear. 

Furthermore, the geometry of the region involved in the problem is subject to rather 
severe restrictions. On the one hand, a coordinate system must be employed in which the 
variables can be separated, and the partial differential equation replaced by a set of ordinary 
differential equations. For Laplace's equation there are about a dozen such coordinate systems; 
only rectangular, polar, circular cylindrical, and spherical coordinates are likely to be familiar 
to most readers of this book. On the other hand, the boundary of the region of interest must 
consist of coordinate curves or surfaces— that is, curves or surfaces on which one variable 
remains constant. Thus, at an elementary level, we are limited to regions bounded by straight 
lines or circular arcs in two dimensions, or by planes, circular cylinders, circular cones, or 
spheres in three dimensions. 

In three-dimensional problems, applying the method of separation of variables to 
Laplace's operator uy, + Uyy + Uz, leads to the equation X" + AX = 0 in rectangular 
coordinates, to Bessel's equation in cylindrical coordinates, and to Legendre's equation in 
spherical coordinates. It is this fact that is largely responsible for the intensive study that 
has been made of these equations and the functions defined by them. It is also noteworthy 
that two of the three most important situations lead to singular, rather than regular, Sturm- 
Liouville problems. Thus singular problems are by no means exceptional and may be of even 
greater interest than regular problems. The remainder of this section is devoted to an example 
involving an expansion of a given function as a series of Bessel functions. 


The Vibrations of a Circular Elastic Membrane. In Section 10.7 (see equation (7)) we 
noted that the transverse vibrations of a thin elastic membrane are governed by the two- 
dimensional wave equation 


a’( Uxx + Uyy) = ug. (1) 


To study the motion of a circular membrane, it is convenient to write equation (1) in polar 
coordinates: 


" 1 1 
a Urr F m7 T —5 u00 = ug. (2) 
r EF 


11.5 Further Remarks on the Method of Separation of Variables: A Bessel Series Expansion 


We will assume that the membrane has unit radius, that it is fixed securely around its 
circumference, and that initially it occupies a displaced position independent of the angular 
variable 0, from which it is released at time t = 0. Because of the circular symmetry of the 
initial and boundary conditions, it is natural to assume also that u is independent of 0; hence 
u is a function of r and t only. In this event the differential equation (2) becomes 


1 
ac + rt) = Utt, O<r< 1, t>0. (3) 
r 
The boundary condition at r = | is 
u(1,t) =0, t= 0, (4) 
and the initial conditions are 
u(r,0) = f(r), u(7r,0)=0, Oxrzl (5) 


where f(r) describes the initial configuration of the membrane. For consistency we also 
require that f(1) = 0. Finally, we state explicitly the requirement that u(r, t) must be bounded 
forO <r « I. 


EXAMPLE 1 


Find the solution of the two-dimensional wave equation (3) in the unit circle with boundary condition 
(4) and initial conditions (5). 


Solution: 
Assuming that u(r, t) = R(r)T(t), and substituting for u(r, t) in equation (3), we obtain 
R” +(1/r)R’ ] T" 


2 
R =ar °° (©) 


We have anticipated that the separation constant must be negative by writing itas —A? with A > 0.? 
Then equation (6) yields the following two ordinary differential equations: 


r? R" +rR' + APR = 0, (7) 
T" +7a°T =0. (8) 

Thus, from equation (8), 
T(t) = kı sin(Aat) + ky cos(Aat). (9) 


Introducing the new independent variable € = Ar into equation (7), we obtain 


dR dR 
2 2 
= +E— +E R =0, 10 
u a (10) 
which is Bessel’s equation of order zero. Thus 
(5) = c1 Jo(E) + c2¥o(§), (11) 


where Jo and Y, are Bessel functions of the first and second kinds, respectively, of order zero (see 
Section 11.4). In terms of r, we have 


R(r) = ci Jol Ar) + c3Yo9(Ar). (12) 
The boundedness condition on u(r, t) requires that R remain bounded as r — O°. Since 


Yo(Ar) — —ooasr — 0*, we must choose c = 0. The boundary condition (4) then requires 
that 


Jo(A) — 0. (13) 


1? By denoting the separation constant by —A?, rather than simply by —, we avoid the appearance of numerous 
radical signs in the following discussion. 
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Consequently, the allowable values of the separation constant are obtained from the roots of 
the transcendental equation (13). Recall from Section 11.4 that J9( A) has an infinite set of discrete 
positive zeros, which we denote by A1, Ax, À3. ... , An, ... , ordered in increasing magnitude. 
Further, the functions Jo9( A,r) are the eigenfunctions of a singular Sturm-Liouville problem and can 
be used as the basis of a series expansion for the given function f. The fundamental solutions of 
this problem, which satisfy the partial differential equation (3), the boundary condition (4), and the 


boundedness condition, are 


us(r,t) = Jo(A,r) sin(Anat), n=1,2,..., (14) 


var, t) = Jo(Anr) cos(Ayat), n=1,2,.... (15) 


Next we assume that u(r,t) can be expressed as an infinite linear combination of the 


fundamental solutions (14), (15): 


u(r, t) = 3 ous. t) + CnVn(r, t)) 


n=1 
oo 
= 3 endo Anr) sinat) + cr Jor) cos(Anat)). (16) 
n=1 
The initial conditions require that 
oo 
u(r,0) = M e Jour) = f(r), (17) 
n=1 
oo 
u(r, 0) = X AnaknJo(Anr) = 0. (18) 


n=1 


From equation (24) of Section 11.4, we obtain 


kn =0, r= 


1 
n rf(r)Jo( Aur) dr 
0 


i s WmL2. re (19) 
| r(Jo(Anr))? dr 
0 


Thus the solution of the partial differential equation (3) satisfying the boundary condition (4) 
and the initial conditions (5) is given by 


oo 


ulr, t) = 3 Cn Jor) cos( Anat) (20) 


n-l 


with the coefficients c,, defined by equation (19). 


Problems 


1. Consider Laplace's equation uxx + Uyy = 0 in the parallelogram 
whose vertices are (0, 0), (2, 0), (3, 2), and (1, 2). Suppose that the 
boundary condition u(x, 2) — f(x) is imposed on the side y — 2 for 
1 € x € 3, and that on the other three sides u = 0 (see Figure 11.5.1). 

a. Show that there are no nontrivial solutions of the partial 
differential equation of the form u(x, y) — X(x)Y(y) that also 
satisfy the homogeneous boundary conditions. 


1 
b. Let€ =x — 2^ 1] — y. Show that the given parallelogram 


in the x y-plane transforms into thesquare0 < £ < 2,0 9 <2 
in the £7-plane. Show that the differential equation transforms 


into 
5 
46 — Ugn + uy =Q. (21) 


How are the boundary conditions transformed? 
C. Show that in the €7-plane, the differential equation (21) 
possesses no solution of the form 


ul, n) = UCE) V (n). 


Thus in the xy-plane, the shape of the boundary precludes a 
solution by the method of the separation of variables, while in 
the €7-plane, the region is acceptable but the variables in the 
differential equation can no longer be separated. 
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| FIGURE 11.5.1 | The regions in Problem 1. 


2. Find the displacement u(r,f) in a vibrating circular elastic 


membrane of radius 1 that satisfies the boundary condition 
u(1,1) 20, t>0, 

and the initial conditions 
u(r,0) 20, u;(r, 0) = g(r), 


where g(1) = 0. Hint: The differential equation to be satisfied is 
equation (3) of this section. 


O<r<l, 


3. Find the displacement u(r,f) in a vibrating circular elastic 
membrane of radius 1 that satisfies the boundary condition 
u(1,1) 20, +t >0, 
and the initial conditions 
u(r, 0) = f(r), 
where f(1) = g(1) = 0. 
4. The wave equation in polar coordinates is 


1 
a 


ur(r, 0) = g(r), 


O0<r<l, 


ugg = Utt. 


1 1 
Urr + —uy + z 
r r 


Show that if u(r, 0,1?) = R(r)O(0)T(t), then R, O, and T satisfy 
the ordinary differential equations 
r? R" E rR' E (A1? — n?) R = 0, 
0" - 2^0 — 0, 
T"  Aa?T 2 0. 
5. Inthecircular cylindrical coordinates r, 0, z defined by 
y —rsin6, 


x —rcos, £-—2, 


Laplace's equation is (see Problem 16 in Section 10.8) 


1 1 
Urr + —Hu, + 3 400 F Uz = 0. 
r r 


a. Show that if u(r, 0, z) = R(r)O(0)Z(z), then R, O, and Z 
satisfy the ordinary differential equations 


r? R" +rR' + (dr? - n?) R = 0, 
©” +n°O =0, 
Z' - XZ «0. 
b. Show that if u(r, 0, z) is independent of 0, then the first 
equation in part a becomes 
r? R" +rR' Ar) R — 0, 
the second is omitted altogether, and the third is unchanged. 


6. Find the steady-state temperature distribution in a semi-infinite 
rod0 < z < oo,0 <r < 1, if the temperature is independent of 0 
and approaches zero as z — oo. Assume that the temperature u(r, z 
satisfies the boundary conditions 


u(l,z) = 0, 
u(r,0) = f(r), 


Hint: Refer to Problem 5. 
7. The equation 


z0; 


Garak 


Vxx + Vyy + ky =0 
is a generalization of Laplace’s equation and is sometimes called the 
Helmholtz”? equation. 
a. In polar coordinates the Helmholtz equation is 


1 1 2 
Ver + =V; + -5 V88 T k^v =0. 
r r 


If v(r, 0) = R(r)O (0), show that R and © satisfy the ordinary 
differential equations 


DR"LrRRR(OSD-AR-I0, 0" X06 =0. 

b. Consider the Helmholtz equation in the disk r < c. Find 
the solution that remains bounded at all points in the disk, that 
is periodic in 0 with period 27, and that satisfies the boundary 
condition v(c, 0) = f(0), where f is a given function on 
0 < 0 « 2m. Hint: The equation for R is a Bessel equation. 
See Problem 3 of Section 11.4. 


8. Consider the flow of heat in a cylinder O < r < 1, 
0 < 0 < 2m, —oo < z < oo of radius 1 and of infinite length. 
Let the surface of the cylinder be held at temperature zero, and let the 
initial temperature distribution be a function of the radial variable r 
only. Then the temperature u is a function of r and t only and satisfies 
the heat conduction equation 


1 
2c u) —u, 0<r<1, t>0, 
m 
and the following initial and boundary conditions: 


u(r,0) = f(r), 
u(1,t) =0, 


O<r<l, 


t> 0. 


13The German scientist Hermann von Helmholtz (1821-1894) was trained in 
medicine and physiology; early in his career he made important contributions 
to physiological optics and acoustics, including the invention of the 
ophthalmoscope in 1851. Later his interests turned to physics, especially 
fluid mechanics and electrodynamics. During his lifetime, he held chairs in 
physiology or physics at several German universities. 
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Show that 


oo 


242 
u(r,t) — 1 cy Jol Anr) e| Ant, 


n=1 
where Jo(A,,) = 0. Find a formula for c,. 


9. In the spherical coordinates p, 0, 6 (p > 0,0 < 0 < 2m, 
0 < @ < m) defined by the equations 


x-—pcosÜsinó, y-psinÜsinó, z=pcos¢, 
Laplace's equation is (see Problem 17 in Section 10.8) 
£^ ugs c 2pu, + (esc? $)ugg + uoo +(cotd)ug = 0. 


a. Showthatifu(p, 0, 6) = P(p)O(0)®(@), then P, O, and 
® satisfy ordinary differential equations of the form 


p? P" 4: 2p P' — u?P — 0, p» 0, 
9" -- A28 — 0, 0 « 8 « 2n, 
(sin? 9) ®” + (sind cos $) $' 
4 (u? sin? $ — A7)6 = 0, 0c <r. 


The differential equation for P is of the Euler type, while the one 
for ® is related to Legendre's equation. 

b. Show that if u(p, 0, à) is independent of 0, then the first 
equation in part (a) is unchanged, the second is omitted, and the 
third becomes 


(sin? 9) ®” + (sind cos 9) ®’ + (nu? sin? 9) = 0. 
C. Define a new independent variable s = cos o. Then show 
that the equation for in part b becomes 
a ae? n trey, xpessi 
s s =0, -Il<s<l. 
ds? ds |" : 


Note that this is Legendre's equation. 


10. Find the steady-state temperature u(p, o) in a sphere of unit 
radius if the temperature is independent of 0 and satisfies the boundary 
condition 


u(l @) = f(¢), 


Hint: Refer to Problem 9 and to Problems 22 through 29 of Section 
5.3. Use the fact that the only solutions of Legendre’s equation that 
are finite at both +1 are the Legendre polynomials. 


O<o <7. 


116 Series of Orthogonal Functions: Mean 


Convergence 


In Section 11.2 we stated that under certain restrictions, a given function f can be expanded in 
a series of eigenfunctions of a Sturm-Liouville boundary value problem, the series converging 


1 
to x) + f(x^)) at each point in the open interval. Under somewhat more restrictive 


conditions, the series converges to f(x) at each point in the closed interval. This type of 
convergence is referred to as pointwise convergence. In this section we describe a different 
kind of convergence that is especially useful for series of orthogonal functions, such as 
eigenfunctions. 

Suppose that we are given the set of functions $1, do, .. 
the interval 0 < x < 1 and satisfy the orthonormality condition 


. , Ọn, that are continuous on 


1 
f r(x) ói(x)0 j(x)dx = 63; (1) 


where r is a nonnegative weight function and 6;; is the Kronecker delta. Suppose also that we 
wish to approximate a given function f, defined on 0 < x < 1, by a linear combination of 
Q1, ... , Pn. That is, if 


n 
Sn(x) = M ^ aidi(x), Q) 
i-l 
we wish to choose the coefficients aj, ... , a, so that the function S, will best approximate 
f on O0 «x x < 1. The first problem that we must face in doing this is to state precisely what 
we mean by “best approximate f on 0 < x < 1." There are several reasonable meanings that 
can be attached to this phrase. 


1. We can choose n points x1, ... , x, in the interval 0 < x < 1 and require that S,( x) have 
the same value as f(x) at each of these points. The coefficients aj, ... , a, are found by 
solving the set of linear algebraic equations 


S adi) = fy, jd ah Q) 
i=l 


This procedure is known as the method of collocation. It has the advantage that it is very 
easy to write down equations (3); we need only to evaluate the functions involved at the 
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points x1, ... , Xn. If these points are well chosen, and if n is fairly large, then presumably 
S,(x) not only will be equal to f(x) at the chosen points but also will be reasonably 
close to it at other points as well. Collocation also has several deficiencies. One is that 
if one more base function $,,, is added, then one more point x,,, is required, and all 
the coefficients must be recomputed. Thus it is inconvenient to improve the accuracy of 
a collocation approximation by including additional terms. Further, the coefficients a; 
depend on the location of the points x1, ... , Xn, and it is not obvious how best to select 
these points. 


2. Alternatively, we can consider the difference | f(x) — S,(x)| and try to make it as small 
as possible. The trouble here is that | f(x) — S,(x)| is a function of x as well as of the 
coefficients a,, ... , an, and it is not clear how to calculate a;. The choice of a; that makes 
| f(x) — S,(x)| small at one point may make it large at another. One way to proceed is to 
consider instead the least upper bound!* of | f(x) — S,Cx)| for x in O < x < 1, and then 


to choose aj, ... , a, so as to make this quantity as small as possible. That is, if 
Ej(a;, m + dn) = lub | f(x) = Sn(x) > (4) 
0xxxl 
then choose a), ... , a, so as to minimize E,,. This approach is intuitively appealing and 
is often used in theoretical calculations. In practice, however, it is usually very hard, if not 
impossible, to write down an explicit formula for E,(aj, ... , an). Further, this procedure 


shares one of the disadvantages of collocation: upon adding an additional term to S,,(x), 
we must recompute all the preceding coefficients. Thus it is not often useful in practical 
problems. 


3. Another way to proceed is to consider 
1 
hlan sss) = f TOU) — Sa) dx. 6 
0 


If r(x) = 1, then J, is the area between the graphs of y = f(x) and y = S,(x) (see 
Figure 11.6.1). We can then determine the coefficients a; so as to minimize 7,. To avoid 
the complications resulting from calculations with absolute values, it is more convenient 
to consider instead 


1 
use a) f r(x) (f(x) — Sy(x))2dx 6) 
C 


as our measure of the quality of approximation of the linear combination S,(x) to 
f(x). Although R, is clearly similar in some ways to 7,, it lacks the simple geometric 
interpretation of the latter. Nevertheless, it is much easier mathematically to deal with R, 
than with /,,. The quantity R, is called the mean square error of the approximation Sn 
to f.If aj, ... , a, are chosen so as to minimize R,, then S, is said to approximate f in 
the mean square sense. 


I —nisthe area between the graphs of 
y = f(x) (blue) and y = S,(x) (red). 


The least upper bound (lub) is an upper bound that is smaller than any other upper bound. The lub of a bounded 
function always exists and is equal to the function's maximum if it has one. 


568 


CHAPTER 11 Boundary Value Problems and Sturm-Liouville Theory 


To choose dj, ... , a, so as to minimize R,,, we must satisfy the necessary conditions 
OR, 
da, ^9 De ue (7) 
Writing out equation (7) and noting that 0S,,(x; a1, ... , an) /Oa; is equal to d;(x), we obtain 
OR, l 
"m 2 f rGOQLfG) — $,G)]6;(G)dx = 0. (8) 
i 0 


Substituting for $,(x) from equation (2) and making use of the orthogonality relation (1), we 
find that 


1 
di x r(x)f(x)ó;(x)dx, i-l...,n. (9) 
0 


The coefficients defined by equation (9) are called the Fourier coefficients of f with respect 
to the orthonormal set ó;, 42, ... , 6, and the weight function r. Since the conditions (7) 
are only necessary and not sufficient for R, to be a minimum, a separate argument is required 
to show that R, is actually minimized if the a; are chosen by equation (9). This argument is 
outlined in Problem 5. 

Note that the coefficients (9) are the same as those in the eigenfunction series whose 
convergence, under certain conditions, was stated in Theorem 11.2.4. Thus S,(x) is the nth 
partial sum in this series and constitutes the best mean square approximation to f(x) that is 
possible with the functions $1, ... , n. We will assume hereafter that the coefficients a; in 
S,(x) are given by equation (9). 

Equation (9) is noteworthy in two other important respects. In the first place, it gives a 
formula for each a; separately, rather than a set of linear algebraic equations fora;, ... , a, as 
in the method of collocation, for example. This is due to the orthogonality of the base functions 
Q1, ... , Q5. Further, the formula for a; is independent of n, the number of terms in S,(x). 
The practical significance of this is as follows. Suppose that, to obtain a better approximation 
to f, we desire to use an approximation with more terms — say, k terms, where k > n. Itis then 
unnecessary to recompute the first n coefficients in S; (x). All that is required is to compute, 
from equation (9), the coefficients a,,,. ... , ay arising from the additional base functions 
Pntis --- > Px. Of course, if f, r, and the ¢,, are complicated functions, it may be necessary 
to evaluate the integrals numerically. 

Now let us suppose that there is an infinite sequence of functions $1, ... , n, ... , that 
are continuous and orthonormal on the interval 0 < x < 1. Suppose further that as n increases 
without bound, the mean square error &, approaches zero. In this event the infinite series 


o aii) 


i=1 


is said to converge in the mean square sense (or, more simply, in the mean) to f(x). Mean 
convergence is an essentially different type of convergence from the pointwise convergence 
considered up to now. A series may converge in the mean without converging at each point. 
This is plausible geometrically because the area between two curves, which behaves in the 
same way as the mean square error, may be zero even though the functions are not the same 
at every point. They may differ on any finite set of points, for example, without affecting the 
mean square error. It is less obvious, but also true, that even if an infinite series converges at 
every point, it may not converge in the mean. Indeed, the mean square error may even become 
unbounded. An example of this phenomenon is given in Problem 4. 

Now suppose that we wish to know what class of functions, defined on 0 < x < 1, canbe 
represented as an infinite series of the orthonormal set ¢;, i = 1,2, ... . The answer depends 
on what kind of convergence we require. We say that the set $41, ...,@n, ... is complete 
with respect to mean square convergence for a set of functions £ if, for each function f in F, 
the series 


f(x) = M adi), (10) 


i=l 


with coefficients given by equation (9), converges in the mean. There is a similar definition 
for completeness with respect to pointwise convergence. 
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Theorems having to do with the convergence of series such as that in equation (10) can 
now be restated in terms of the idea of completeness. For example, Theorem 11.2.4 can be 
restated as follows: The eigenfunctions of the Sturm-Liouville problem 


-(pG)y) +4(x)y = Ar(x)y, O< x <1, (11) 
a y(0) +a2y (0) =0, Bry) + B2y/(1) =0 (12) 


are complete with respect to ordinary pointwise convergence for the set of functions that are 
continuous on 0 < x < 1 and that have piecewise continuous derivatives there. 

If pointwise convergence is replaced by mean convergence, Theorem 11.2.4 can be 
considerably generalized. Before we state such a companion theorem to Theorem 11.2.4, 
we first define what is meant by a square integrable function. A function f is said to be 
square integrable on the interval 0 < x < 1 if both f and f? are integrableP on that 
interval. The following theorem is similar to Theorem 11.2.4 except that it involves mean 
convergence. 


Theorem 11.6.1 


The eigenfunctions 9; of the regular Sturm-Liouville problem (11), (12) are complete with respect 
to mean convergence for the set of functions that are square integrable on 0 < x < 1. In other words, 
given any square integrable function f, the series (10), whose coefficients are given by equation (9), 
converges to f(x) in the mean square sense. 


Itis significant that the class of functions specified in Theorem 11.6.1 is very large indeed. 
The class of square integrable functions contains some functions with many discontinuities, 
including some kinds of infinite discontinuities, as well as some functions that are not 
differentiable at any point. All these functions have mean convergent expansions in the 
eigenfunctions of the boundary value problem (11), (12). However, in many cases these series 
do not converge pointwise, at least not at every point. Thus mean convergence is more naturally 
associated with series of orthogonal functions, such as eigenfunctions, than ordinary pointwise 
convergence. 

The theory of Fourier series discussed in Chapter 10 is just a special case of the general 
theory of Sturm-Liouville problems. For instance, the functions 


6,(x) = V2sin(nrx),n —1,2, ... (13) 
are the normalized eigenfunctions of the Sturm-Liouville problem 
y"+Ay=0, y(0) =0, y(1) =0. (14) 


Thus, if f is a given square integrable function on 0 < x < 1, then according to Theorem 
11.6.1, the series 


f(x) = So bmbm(x) = V2 bn sinimnx), (15) 
m=1 m=1 
where 
1 1 
bm =f f(x) bm(x)dx = vaf f(x) sin(mrx)dx, (16) 
0 0 


converges in the mean. The series (15) is precisely the Fourier sine series discussed in Section 
10.4. If f satisfies the further conditions stated in Theorem 11.2.4, then this series converges 
pointwise, as well as in the mean. 


15For the Riemann integral used in elementary calculus, the hypotheses that f and f? are integrable are independent; 
that is, there are functions such that f is integrable but f? is not, and conversely (see Problem 6). A generalized 
integral, known as the Lebesgue integral, has the property (among others) that if f? is integrable, then f is also 
necessarily integrable. The term square integrable came into common use in connection with this type of integration. 
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Similarly, the Sturm-Liouville problem 
y"+Ay=0, y(0) =0, y(1) =0 (17) 


has normalized eigenfunctions ¢9(x) = 1, $,(x) = v2 cos(nn x), n -—1,2,.... Thus this 
problem is naturally associated with the Fourier cosine series. 


EXAMPLE 1 


Let f(x) = 1 fr 0 < x < 1. Expand f(x) using the eigenfunctions (13), and discuss the 
pointwise and mean square convergence of the resulting series. 


Solution: 


The series has the form (15), and its coefficients bm are given by equation (16). Thus 


1 1 22 
ba n F(x) bm(x)dx = "Ji sin(mz x)dx = ——(1 — cos(m7)), (18) 

0 0 m 

22 

2 LT 

-Vazen d m mo 
mm 
0 meven. 


and the n" partial sum of the series is 


n n 
]— 4 1 
Si(x) = V2 bm sin(mm x) = 25 cone) sin(mz x) = = » = sin(mz x). 
m=1 m=1 mz1,3;..., n 


(19) 
The mean square error associated with the nth partial sum of the series is then 


1 
n- | (f(x) — Sy(x))7dx. (20) 
0 


By calculating R, for several values of n and plotting the results, we obtain Figure 11.6.2. 
This figure indicates that R, steadily decreases as n increases. Of course, Theorem 11.6.1 asserts 
that R, — O as n — co. Pointwise, we know that S,(x) > f(x) 21asn— oo for0< x < 1; 
further, S,(x) has the value zero for x = 0 or x = 1 for every n. Although the series converges 
pointwise for each value of x, the least upper bound of the error does not diminish as n grows larger. 
For each n, there are points close to x = 0 and x = 1 where the error is arbitrarily close to 1. The 
situation is similar to the Gibbs phenomenon observed in Example 1 in Section 10.3; see Figures 
10.3.3 and 10.3.4. 


0 2 4 6 8 10 12 14 16 18 20 n 


Dependence of the mean square error R, on n in Example 1. Note 
that because bak = 0, Rox = Ro s 
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Theorem 11.6.1 can be extended to cover self-adjoint boundary value problems having 


periodic boundary conditions, such as the problem 
y" + Ay =0, 


y(-1) — y(1) = 9, 


(21) 


y(-D -»y() =0 


considered in Example 4 of Section 11.2. The eigenfunctions of problem (21) are 


1 
$o(x) = 25 o,(x) = cos(mmcx) for n = 1,2,..., and v,(x) = sin(nmx) for 
n=1,2,....If f is a given square integrable function on —1 < x < 1, then its expansion 
in terms of the eigenfunctions ¢, and v, is of the form 
oo 
fa) = $+ 2C coss x) + b, sinn), (22) 
where 
1 
an = I f(x) cos(namx)dx, n=0,1,2,..., (23) 
-1 


b, 


1 
n fGOsin(nrx)dx, n=1,2,.... 
-1 


(24) 


This expansion is exactly the Fourier series for f discussed in Sections 10.2 and 10.3. 
According to the generalization of Theorem 11.6.1, the series (22) converges in the mean for 
any square integrable function f, even though f may not satisfy the conditions of Theorem 


10.3.1, which ensure pointwise convergence. 


Problems 


1. Extend the results of Example 1 by finding the smallest 
value of n for which R, < 0.02, where R, is given by equation (20). 


2. Let f(x) =x for0< x « 1, 

and let @m(x) = 4/2sin(mmx). 
a. Find the coefficients b, in the expansion of f(x) in terms of 
ó1Q). PA), .... 
b. Calculate the mean square error R, for several values of n 
and plot the results. 
c. Find the smallest value of n for which R, < 0.01. 


3. Follow the instructions for Problem 2 
using f(x) —-x(1— x) for0« x < 1. 
4. In this problem we show that pointwise convergence of a 
sequence S,(x) does not imply mean convergence, and conversely. 
a. Let $,(x) = nJxe te /2, 0 < x < 1. Show that S, (x) — 0 
as n — oo for each x in 0 < x < 1. Show also that 


1 
R, = n (0— S,(x)) ax = 5 (1 ze) 
0 


and hence R, — oo as n — ov. Thus pointwise convergence 
does not imply mean convergence. 

b. Let S,(x) = x" forO < x < 1, and let f(x) = 0 for 
0 < x < 1. Show that 


1 
= m 2 = 
Rn EJ (fx) — 5560) dx = 4 IT 


and hence S,(x) converges to f(x) in the mean. Also show 
that S,(x) does not converge to f(x) pointwise throughout 
0 < x < 1. Thus mean convergence does not imply pointwise 
convergence. 


5. Suppose that the functions ¢œ1,...,@n satisfy the 
orthonormality relation (1) and that a given function f is to be 
approximated by S,(x) = ci61(x) + --- + Ca@n(x), where the 
coefficients c; are not necessarily those of equation (9). Show that 
the mean square error R,, given by equation (6) may be written in the 
form 


1 n n 
Rp = r(x) Pdr -X a; + Soci — aij)’, 
0 1 i=1 


i= 
where the a; are the Fourier coefficients given by equation (9). Show 
that R„ is minimized if c; = a; for each i. 
6. In this problem we show by examples that the (Riemann) 
integrability of f and of f? is independent. 


x. Ocxc-zl, 
a. Let = =, 
et fe» a x= 0; 


1 
Show that f f(x)dx exists as an improper integral, but 
0 


1 
n f?(x)dx does not. 
0 


1, x rational, 


b. Let f(x) = { 


—]l, x irrational. 


1 1 
Show that T f?(x)dx exists, but 7 f(x)dx does not. 
0 0 


Hint: Recall that when a (Riemann) integral exists, upper and 
lower Riemann sums on a partition of 0 < x < 1 converge to the 
value of the integral as the length of the largest subinterval in the 
partition decreases to zero. 
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7. Suppose that it is desired to construct a set of polynomials fo(x), 
SiC), falx), .... f(x), ... , where f(x) is of degree k, that are 
orthonormal on the interval 0 < x < 1. That is, the set of polynomials 
must satisfy 


1 
CSi fi) zi Fix) fi x) dx = 6 jx. 
0 


a. Find f(x) by choosing the polynomial of degree zero such 
that ( fo, fo) = 1. 

b. Find f(x) by determining the polynomial of degree one such 
that ( fo. fi) =O and (fi, fi) — 1. 

c. Find f(x). 

d. The normalization condition ( fk, fe) = 1 is somewhat 
awkward to apply. Let go(x), g1(x). ... , gx(x), ... be the 
sequence of polynomials that are orthogonal on 0 < x < 1 and 
that are normalized by the condition g,(1) = 1. Find go(x), 
21 (x), and go(x), and compare them with fo(x), fi(x), and 
fax). 

8. Suppose that it is desired to construct a set of polynomials 
Po(x), Pi(x), ... , P(x), ... , where P(x) is of degree k, that are 
orthogonal on the interval —1 < x < 1; see Problem 7. Suppose 
further that P(x) is normalized by the condition P,(1) = 1. Find 
P(x), P(x), Po(x), and P3(x). Note that these are the first four 
Legendre polynomials (see Problem 19 of Section 5.3). 


9. This problem develops some further results associated with 
mean convergence. Define R,(aj,....a,), S,(x), and aj by 
equations (2), (6), and (9), respectively. 

a. Show that 


1 n 
2 
R= I r(x) f2(x)dx — Soa 
0 : 
i=l 
Hint: Substitute for S,(x) in equation (6) and integrate, using the 
orthogonality relation (1). 
n 


1 
b. Show that 5 a? < f r(x) f*x) dx. This result is known 
i-l g 


as Bessel’s inequality. 
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oo 
c. Show that ) a? converges. 


i=1 
oo 


1 
d. Show that lim R, zi r(x) f2(x)dx — X ds 
noo 0 i=l 
oo 


e. Show that »» ai 9i(x) converges to f(x) in the mean if and 
i=l 
only if 


oo 


1 
(x) f?(x)dx = a 
f FAX x X Soa 


i=1 


This result is known as Parseval’s equation. 


In Problems 10 through 12, let ¢),¢2,...,@n,... be the 
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JA vn 
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Answers to Problems 


Chapter 1 


Section 1.1, page 8 


7. 


vou de ks 


y > 3/2as t > co 

y diverges from 3/2 as t > oo 

y -1/2ast — oo 

y diverges from —1/2 as t > co 

y 22-3y 

y-y-2 

y = 0 and y = 4 are equilibrium solutions; y — 4 if initial value 


is positive; y diverges from 0 if initial value is negative. 


8. 


y = O and y = 5 are equilibrium solutions; y diverges from 5 if 


initial value is greater than 5; y — O if initial value is less than 5. 


9. 


y = 0 is equilibrium solution; y — 0 if initial value is negative; 


y diverges from 0 if initial value is positive. 


10. 


y = O and y = 2 are equilibrium solutions; y diverges from 0 if 


initial value is negative; y — 2 if initial value is between 0 and 2; y 
diverges from 2 if initial value is greater than 2. 


11. 
12. 
13. 
14. 
15. 
16. 
17. 


18. 
19. 
20. 


21. 


22. 
23. 
24. 
25. 


Q9 
(c) 
(g) 
(b) 
(h) 
(e) 
a. da/dt 2 3—3x107*a;aing, tinh 
b. a 10^ g; no 
dc = 3 x107 — 3 x 10c, with initial condition c(10) = 
a(0)/ 105 = ag/108 
dV /dt = —kV?/5 for some k > 0. 
du/dt = —0.05(u — 70); u in °F, tin min 
a. dq/dt = 500 — 0.44; q in mg, tinh 
b. q —> 1250 mg 
a. my! = mg - kv? 
b. v ymg/k 
c. k=2/49 
y is asymptotic to f — 3 as t — co 
y > oco, 0, or —co depending on the initial value of y 
y > œ or —oo or y oscillates depending on the initial value of y 


yf) = 0 and then y fails to exist after some t, > 0 


Section 1.2, page 15 


1. 


a. y= 5+ Qo- 5)” 

b. y = (5/2) + Oo - 5/27” 

c. y= 5 + (yo — Se 

Equilibrium solution is y = 5 in (a) and (c), y = 5/2 in 
(b) solution approaches equilibrium faster in (b) and (c) than 
in (a). 

a. yz 54(yy 5e 

b. y= 5/2 + (yo — 5/2)e” 


10. 


12. 


13. 


14. 


€. y= 5 + (yy - 5)e” 

Equilibrium solution is y = 5 in (a) and (c), y = 5/2 in (b); 
solution diverges from equilibrium faster in (b) and (c) than in 
(a). 

a. y=ce“+b/a 

b. (i) Equilibrium is lower and is approached more rapidly. 

Gi) Equilibrium is higher. (iii) Equilibrium remains the same and 
is approached more rapidly. 

a. y, = (b/a) 

b. Y'=aY 

a. y (f) 2 ce” 

b. y= ce” +b/a 


y=ce" t bla 

a. T =21n18 = 5.78 months 

b. T = 21n(900/(900 — pọ)) months 
€. pg = 90001 — e76) = 897.8 

a. T=51n502 19.56 s 

b. 718.34 m 

b. v = 49tanh(r/5) m/s 

e. x = 245Incosh(r/5) m 

f. Tz948s 


Q(t) = CV exp(-(t — t))/(RC)) 

Q'-30-10*9, Q(0)-0 

. O(t) = 10* (1 — eno Ji t in h; after 1 year 

Q S 9277.11 g 

c. Q'- -30/104, Q(0) = 9277.77 

d. Q( = 9277.77e3/! | t in h; after 1 year Q & 670.07 g 
e. T z2.60 yr 


a. r = 0.02828 day! 

b. QW = 100e~0.028287 
c. T2245d 

a. u- T + (uy — T)e* 
b. kr =1n2 

a. Q(t) = CV(1— e-"/RO) 
b. Q0 5 CV=0, 

C. 

a. 

b 


Section 1.3, page 22 


1. 


Second order, linear 
Second order, nonlinear 
Fourth order, linear 


Second order, nonlinear 


r--2 
r=2,-3 
r=0,1,2 
r=-1,-2 
r=1,4 


Second order, linear 
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17. Fourth order, linear 


18. Second order, nonlinear 


Chapter 2 


Section 2.1, page 31 
l.c. y=ce* +t/3 — 1/9 +e; y is asymptotic to 1/3 — 1/9 
as t oo 
2.c. y ce? + Pe? [3 
3. c. y=ce™ *14 0e7/2; 
4. c. y = c/t + 3cos(2t)/(4t) + 3 sin(21)/2; y is asymptotic to 
3 sin(21)/2 as t > oo 


y oast>o 


yolast- oo 


5. c. y= ce? —3e'; y > oo or -oo as t > co 
6. c. y 2—te ct; y > oo, 0, or —oo as t > oo 
7. c. y= ce™ +sin(2t) — 2cos(20); y is asymptotic to 


sin(2t) — 2 cos(2t) as t > oo 


8. c. y ce"? - 37? — 12t 24; y is asymptotic to 31? — 12t+ 24 
as f > oo 
9. y-3e' +2(t— De? 
10. y - (? - De7?J2 
1l. y=(sind/? 
12. y=(@#-14+2e%/t, tz0 
13. b. y= E cos f + 3 nee (a+ +e” dy = -$ 
C. y oscillates for a = ag 


14. b. y= (2+aBr + 4)e”’ — 
ag = -2/(3z + 4) 
Cc. y 0fora- ag 

15. b. y z te™ + (ea - De*/t; ag = 1/e 
c. y 0ast—5 Ofora- ap 

16. b. y= (é—e-asin1)/sint; ay — (e— 1)/sin1 
Cc. y lfora-ag 

17. (ty) = (1.364312, 0.820082) 

18. yg = —1.642876 

19. a. y= 


12 as t > oo 


b. t= 10.065778 
20. yo = —5/2 
21. yo —16/3; y > —o as t > œ for yọ = —16/3 
29. See Problem 2. 
30. See Problem 4. 


2e771/2) /(32 +4); 


12+ B cos 2t+ = sin 2t— 188 els. 


; yoscillates about 
65 65 65 


Section 2.2, page 38 
1l. 3?-28-2c6 y#0 
2. y7! +cosx =c ify Z0; 


3. 2tan(2y) — 2x — sin(2x) = c if cos(2y) z 0; also 
y = +(2n + 1)x/4 for any integer n; everywhere 


also y = 0; everywhere 


y = sin(ln |x| + c) if x Z 0 and |y| < 1; also y 2 +1 
yte #0 


everywhere 


4 
5. yx? +2(e? -eM) =c; 
6. 3y+y -2 zc 
7. y=kx 

8 y=+V2 +c 
9 


. a. y-I/(à -x-6) 
c. —2<x<3 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 
18. 
19. 
20. 
21. 


22. 


23. 


25. 
26. 
27. 
28. 
29. 
30. 
31. 


ep Pp Op 


cp Op 


y 
y 


2 = 


PS Fs Ss 5 2 


y=-V2x -2x +4 


-]«x«2 
= (2(1 —xe* — 172 
—1.68 < x < 0.77 approximately 
r 2 2/(1—21n60) 
0 «60 « ye 


-(3-2 Ix) 


I< 25 
1.01 


=—- +- 
2= Ty 


x» VI 
y-25/2- x39 — e + 13/4 


—1.4445 « x « 4.6297 approximately 


-1/2 


4x2 — 15 


y- (z- arcsin (3 cos? x))/3 
|x — z/2| < 0.6155 
-3y-x-x 4220, |x] <1 
—4y— 3) =-1, 
x=-2 

x = In(2) 
yo4ify>0; y=O0ify) =0; 
T = 3.29527 

yo4ast oco 

T — 2.84367 
3.6622 « yy « 4.4042 


T ad — A 
a 


y 2 -o if yy <0 


Injay+b| +k; az0, ay+b#0 


ly + 2x1? MM 
arctan (y/x) — In |x| = c 
xb-yl-coz0 
ly-x| = cly 3x; also y =x 
ly + x| ly + 4x]? = c 

IxPp? -5y = ¢ 


ex = p? -2°| 


Section 2.3, page 47 


1. 
2. 


3. 


t = 1001n 100 min = 460.5 min 


Q(t) = 120y(1 — exp(-1/60)); 


p 


opPOoPTPDOS 


k= 


Prp SS 


120y 
63, 150 e—1/50 4 25 — 625 
2501 2501 5002 


level = 25; amplitude = 25 y 2501/5002 = 0.24995 
130.41 s 
(In 2)/r yr 


9.90 yr 
8.66% 


k(e re 1) /r 

k = $3930 

9.77% 

$3086.64/yr; $1259.92 

t = 146.54 months 
$246,758.02 

0.00012097 yr! 

Qo exp(—0.000120971), tinyr 
13,305 yr 


sint 


cost+ 


Q() = = 


x3 — 1| < 16/33 or -1.28 < x < 1.60 
y--1/0G?/2 * 2x 1); 


y = —3/2 + 42x — e* + 13/4; 


10. a. c = 2.9632; no 
b. t= 10In2 % 6.9315 
Cc. rt = 6.3805 
11. b. y, & 0.83 
A 1n(13/8) 
1n(13/12) 
13. a. u(t) = 2000/(1 + 0.0487)! ? 
C. t & 750.77 s 


min & 6.07 min 


14. a. u(t) 2 ce" + Tg + kT, (kcoslæt) + o sin(wt))/(k? + o) 


b. R&911°F; rcz351lh 

c. R-KT,/VKE +; t= (1/o) arctan (@/k) 
15. a. c=k+P/r+ (co — k- P/rje™!/V; 

lim c =k+P/r 

b. T-(VIn2)/r; T =(Vln10)/r 

c. Superior, T = 431 yr; Michigan, T = 71.4 yr; Erie, 

T = 6.05 yr; Ontario, T = 17.6 yr 


16. a. 50.408 m 
b. 5.248 s 
17. a. 45.783 m 
b. 5.129 s 
18. a. 48.562 m 
b. 5.1945 
mg kvo mvo 
19. a. -- Inj 1 ; 
"rn n( i) k 
k 
f= "i1 + zu) 
mg 
20. v = —mg/k + (vg + (mg/k)) exp(-kt/m) 
v = vo — gt; yes 
v=0fort>0 


21. vy = 2d? g(p — p')/9u 

e = Axa? g(p — p')/3E 

x — utcosA, y= —gt? /2 + utsinA +h 

—16L? /(u? cos? A) + LtanA +3 2 H 

0.63 rad € A < 0.96 rad 

u = 106.89 ft/s, A = 0.7954 rad 

v = (ucos A)e ", w= —g/r + (usinA + g/r)e "* 

x = (ucosA)(1 — e~")/r, 

=-gt/rt+(usinA+g/r)(l—e")/r+h 
d. u= 145.3 ft/s, A = 0.644 rad 

24. d. k= 2.193 


23. 


a 
b 
c. 
a 
b 
22. b 
d 
e. 
f. 
a 
b 
X 


Section 2.4, page 57 


= 


0«t«3 

z/2«t«3z[/2 

—co«t«-2 

l«t«z 

P-y <1 

1-245 »0or1-2 +y «0, t#0, y#0 
Everywhere 

y#0, y#3 

y = tJ», — 4? if yo # 0; 
10. y 2 (1/yo - P)! if yo #0; 
interval is |t| < 1/ J/yo if yo > 0; 
11. y = yo/ Aj 2tys + 1 if yo #0; y = 0 if yọ = 0; 
interval is —1/2y5 < t < 00 if y; #0; —co < t < œ if y; =0 


Se SS oe ee 


It] < lyol/2 


y = 0 if yọ 20; 
-œ <t < œify <0 


18. 


22. 


23. 


24. 
25. 


26. 


y= 


27. y=e for0<t<1; 
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y=+4/ Z In(1 + 8) + y2; -(0 — exp 2332/2)? <t < co 


y > 3 ifyo > 0; y =0 ify 20; y > —% if yy < 0 


y > —oœ if yọ «0; y> Oif y 20 


y> 0ify <9; y> æ ify > 9 


y > —oo if yy < y, & —0.019; otherwise y is asymptotic to 


. y, (0 is a solution for t > 2; 
. f is not continuous at (2, —1) 


sp ppp 


. n= 0: y= — 


No 


. Yes; set ty = 1/2 in equation (19) in text. 


ly] € 4/37 = 1.5396 

y(t) is a solution for all t 
1 pog 

y2c——ct—— 

M) HO) Sig 


1 t 


H(s)g(s)ds 


H(s)q(s)ds_ where p(t) = exp ( D P(t) a) 


H(t) Jig " 


t 
n=liy= ex] q(t) nod) 
to 


y=r/(k+cre") 


y= +(e/(o + cee’) ) 


1/2 


1 


y= see) for0 <t<1; 


1G — Le fort >1 


y=e OD fort >1 


Section 2.5, page 67 


7 
8. 


9. 


y = —a/b is asymptotically stable, y = 0 is unstable 


y = 1 is asymptotically stable, y = 0 and y = 2 are unstable 


y = 0 is unstable 


y 
c. 


= 0 is asymptotically stable 


y = Oo + (1 — yokt)/(1 + (1 — yo kt) 


y = —1 is asymptotically stable, y = 0 is semistable, 
y = 1 is unstable 


y = -l and y = 1 are asymptotically stable, y = 0 is unstable 


y = 2 is asymptotically stable, y = 0 is semistable, 
—2 is unstable 


y = 0 and y = 1 are semistable 


16. a. y= 0 is unstable, y = K is asymptotically stable 


17. 


18. 


19. 


20. 
21. 


22. 


b. 


Concave up for 0 « y € K/e, 


concave down for K/e € y « K 


cop TP Pp Ao oS foe 


y = Kexp((In(yg/K))e^") 
y(2) & 0.7153K & 57.6 x 106 kg 
T 22.215 yr 


. V* (k/(az))"? zh/(3as; yes 


k < a ra? 


Y = Ey, = KE(1 — (E/r)) 

Y,, = Kr/4 for E = r/2 

X12 = KU x y1 - (4h/rK))/2 

y = Q is unstable, y = 1 is asymptotically stable 
y = yo/Oro + — yoe™) 

y = yge P 

x = Xp exp(-ayo(1 — e 7/8) 

Xo exp(—-ayo/ f) 


576 Answers to Problems 


23. 


24. 


25. 


26. 


27. 


b. z- 1/(v- (1 - ve^) 

c. 0.0927 

a,b. a= 0: y = 0 is semistable. a > 0: y = vais 
asymptotically stable and y — -vya is unstable. 


a. a < 0: y = O is asymptotically stable. a > 0: y = O is 
unstable; y = ya and y = —4/a are asymptotically stable. 


a. a«0: y= 0 is asymptotically stable and y = a is unstable. 


a = 0: y = Q0 is semistable. 


a > 0: y = 0 is unstable and y = a is asymptotically stable. 


: mE B pa(e*a-Pt — 1) 
a. lim x(1) =min(p,g); x(t) = qe P Lp 


2 
! pat 
b. ! Dp f m—BÉ— 
pues pi xO pat 1 


Section 2.6, page 75 


E 


N = =| S| REA REA RE 
SS e END) SS 


21. 


> Mo SN SS On ecd. qe» 


xb-óx-y!-2y-c 

Not exact 

x —Xxy-2x-2y) -3y-c 
ax? + 2bxy + cy? =k 

Not exact 

e? cos2x +x - 3y 2 c 
ylnx + 332 -2y-2c 
x)b-yzc 


y= (x+ V238- 3x2) /2, Ix] < 2873 


y = (x — (243 +x — 8x — 16)!/)/4, x > 0.9846 


b=3; xy*+2y=c 

b=1; eY4x=c 
xi-21In|y|-y?-2 c; alsoy=0 
xie*siny 2 c 

ua =e (3Q0yey)e*zc 
Hax)=e*; y=ce +1 +e” 
uO) =y; xycycosy-siny-c 


u(y) =e” /y; xe?” —In|y| =c; alsoy=0 


Section 2.7, page 82 


1. 


Se x ON. A 


1.2, 1.39, 1.571, 1.7439 

1.1975, 1.38549, 1.56491, 1.73658 
1.19631, 1.38335, 1.56200, 1.73308 
1.19516, 1.38127, 1.55918, 1.72968 


1.1, 1.22, 1.364, 1.5368 

1.105, 1.23205, 1.38578, 1.57179 
1.10775, 1.23873, 1.39793, 1.59144 
1.1107, 1.24591, 1.41106, 1.61277 


1.25, 1.54, 1.878, 2.2736 

1.26, 1.5641, 1.92156, 2.34359 
1.26551, 1.57746, 1.94586, 2.38287 
1.2714, 1.59182, 1.97212, 2.42554 
0.3, 0.538501, 0.724821, 0.866458 
0.284813, 0.513339, 0.693451, 0.831571 
0.277920, 0.501813, 0.678949, 0.815302 
0.271428, 0.490897, 0.665142, 0.799729 


Converge for y > 0; undefined for y < 0 


APFS RRE rPe Ep SS 


Converge for y > 0; diverge for y < 0 


Converge for |y(0)| < 2.37 (approximately); diverge otherwise 


Diverge 


9. 


11. 


12. 


a. 2.30800, 2.49006, 2.60023, 2.66773, 2.70939, 2.73521 
b. 2.30167, 2.48263, 2.59352, 2.66227, 2.70519, 2.73209 
2.29864, 2.47903, 2.59024, 2.65958, 2.70310, 2.73053 
2.29686, 2.47691, 2.58830, 2.65798, 2.70185, 2.72959 


1.70308, 3.06605, 2.44030, 1.77204, 1.37348, 1.11925 
1.79548, 3.06051, 2.43292, 1.77807, 1.37795, 1.12191 
1.84579, 3.05769, 2.42905, 1.78074, 1.38017, 1.12328 
1.87734, 3.05607, 2.42672, 1.78224, 1.38150, 1.12411 


—0.166134, —0.410872, —0.804660, 4.15867 
—0.174652, —0.434238, —0.889140, —3.09810 


sDPROPDPP RO 


A reasonable estimate for y at t = 0.8 is between 5.5 and 6. No 


reliable estimate is possible at t = 1 from the specified data. 


13. 
14. 


b. 2.37 < ag < 2.38 
b. 0.67 < ay < 0.68 


Section 2.8, page 90 


1. 
2. 


3. 


dw/ds =(s +1)? c(w--2Y, w(0)20 


dw/ds =1-(w+3, w(0) = 0 
n okk 
a $,() = 2 


k=1 
c. lim $,( =e -1 
noo 


n 
a. $0) = Y, (- Dt fg + I! 
k=1 
c. lim $,(t) 2 4e"? + 2r-4 
noo 


n 


pe 
5. a. = ———————— 
a. $,0 2 133-075 
n pel 
6. a. t=- r 
$«0 2 3.508 GET) 
p p Ü 
Te 8. =z = — + —; 
a. (t) 3 y(t) z + 9 
B i 2111 115 
9 E— 
P30) 3*7.9*3-7.9-11 * @-9p-15 


10. 


.a. ói0-5 $0 = 1-7 


4 


ga 3110 E 
t)=t- = 
P30) 4°24 16-10 64 13 


Don r 8 
. 8a. ht, bY =t- + a Et Oe) 
PrP p p TD ur 7 
C= ta ar ae ar rere 
_, P DP TP 31t 7 
$40 —t- 4 t 3r Ur "up 690» 
" 
a. O i a es 
2 3 4 S 6 
épo-- LE ELE IL rog, 


Ê ff 3D A 
t)2-t— = O(t’), 
P30) 2*1 20 + 45 OO) 
2 4 5 6 
diners ae aT agg» 


Section 2.9, page 99 


1. 
2. 


3. 


y, = (-1)"(0.9)"¥o3 Yn > Oasn > co 


Yn 2 yo V (n * 2) -1)/2; y, > oo asn > oo 


_ yo ifn-4korn-4k- 1; 
Yn = =y% ifn = 4k — 2 or n = 4k — 3; 


y, has no limit as n — œ 


12. 
15. 


ee Sr ams 


= (0.5)" (yg — 12) + 12; 
$2283.63 
$258.14 
30 yrs: $804.62/mo; $289,663.20 total; 
yrs: $899.73/mo; $215,935.20 total 
$103,624.62 
9.73% 
b. u, > -œ asn > œ 


4.7263 
1.223% 
3.5643 
3.5699 


Y, 12asn > oo 


ons 


Miscellaneous Problems, page 100 


p= p pi pà = pat 
Se SS 


m= 
n 


YN WY YK FE FF om 
SNE SSS INC 


VY N 
pow 


EC SUE RE Rs 


l 
y 
. 8. y=tt(c—17! 
b 
c 


y = c/x? - x5 


2y +cosy—x-—sinx=c 
xX +xy-3y-y 20 
y2-34ce7* 


rytaytx=c 
y=x!(1-el) 
= (4+ cos 2 — cos x)/x* 
eytx+y? =c 
xln|x| +x! +y- 21n |y| = c; also y = 0 
x/34xyce-c 
x? + 2xy + 2y? = 34 
y = c/cosh?(x/2) 
e™ cosy +e” sinx 2c 


y= ce _ e 


X 
2 
y- zi e™ ds+3e* 
0 


2xy xy) -xr 2c 

e+e?” =c 

2xy) + 33y - Ax +y =c 

y = e” /(3t) * ce"! ft 

e 4 In |x| 2 c 

GO 4 y! 4 De 2c 
arctan(y/x) — In và +y? =c 


y [i «yf? 2c 
1 s? 2 


. y2t-2t(c- Py! 


-1 
. y=sint+ (ccos 1 - sini) 


27. a. v — (A - bv =b 

b. v= (bf u(t)dt--c)/u(t), — (t) = exp(-a? /2 — br) 
28. ysct +c Int 
29. y-c,lntt c; +t 
30. y= I/kIn|(k—0/(k | +c ifc; = > 0; 
y = 2/karctan(t/k) + c; ifc; = —K? < 0; 
y=-2r! +c, ifc 20; alsoy=c 

2 

31. y= t5¢- 2c))tte, +c; alsoy=c 
Hint: u(v) = v? is an integrating factor. 
32. y?-cttc 


33. 
34. 
35. 


36. 
37; 


Answers to Problems 


y = cı sin(t + c;) =k, sint + k, cost 
yin |y] -y +c1y +t = 65; 
e -(t*oy +c 


also y = c 


4 1 
= -(t+1}2-- 
e ) 3 


y-3lnt- SIn +1)  Sarctanr 2. 2 In2+ ža 


Chapter 3 


Section 3.1, page 109 


s 


10. 


Me (e X OG Ux de D» [9 


e +c e7" 


Il 
^ 


2t 


ll 
^ 


e t coe^ 
el? + cet 
+c ,e-> 
gn 4 cec 


exp((I + V3) tc, exp((1 — v5) 


Il 
^ 


e cu o a 
Il 
eo 


=e; y-ooast oo 

5 t 1 3t 

= -e'--e"; yo Oast>o 
"UE S 
=-l-e*; yo -last oo 


y = (2/33) exp((- 1+ V33y/4) 


e 1 = 33/4); y > co as f > oo 


11. E TAR LE e^; y 2 oo ast — oo 

12. y- — Fenn + Se ens y > —oo as t > oo 

13. y"4-y'-6yz0 

14. y- weiter minimum is y = 1 att =1n2 

15. y= —e' + 3e'/?; maximum is y = 9/4 at t = In(9/4), 
y-0att-1n9 

16. a=-2 

17. y > 0 for a «0; y becomes unbounded for a > 1 


5TT 


18. y > 0 for « < 1; there is no @ for which all nonzero solutions 
become unbounded 


19. 


20. 


21. 


a. y=(6+ f)e?! — (4 + pye* 
b. t, = In((12 + 3f)/(12 + 2)), 
B =6(1 + V3) = 16.3923 

tm > 1n(3/2, y, > oo 
y=d/c 

aY" + bY’ +cY¥ =0 


b>Oand0 «c < b*/(4a) 
c«0 
b «O0 and 0 « c « D?/(4a) 


"PP PRO 


Section 3.2, page 119 


9e CX Sx UU Re SS 


Jun 


_4 3 2 
Ym = 55 (6 + DP p) 


578 Answers to Problems 


9. 2<x<3n/2 
11. The equation is nonlinear. 
12. The equation is nonhomogeneous. 
13. No 
14. 31e" + ce? 
15. -4(tcost — sint) 


16. y, and y, are a fundamental set of solutions if and only if 
aib, — ab; #0. 


17. y(0- eU T ze. y) = -30% + ze 
18. a BA + se, 

y,(t) = Seep + se) 

19. Yes 

20. Yes 

21. Yes 

22. b. Yes 


C. {yi £), y4(0). and (y,(0,y4(0] are fundamental sets of 
solutions; {y,(t), y4(0)) and {y4(t), y5(t)} are not 

23. cte! 

24. ccost 

25. c/(1 - x2) 

27. 225 

28. p(t) = 0 for all t 

30. Ift is an inflection point, and y = dX) is a solution, then from 


the differential equation p(t)’ (to) + q(t) (tp) = 0. 


x 
32. Yes, y= gene d e Pg; + ce 
X) 


x 
t 
33. Yes, Ja af Ha. c, ; 
H(x) x d 


where ux) = ew(- JC + 23) 
x x 


34. Yes y= cx! + cx 

36. xu" +3xp' +0 -x-wuz0 

37. y" -xn=0 

38. The Legendre and Airy equations are self-adjoint. 


Section 3.3, page 125 


= 


e? cos3 — ie? sin 3 & —7.3151 — 1.0427i 

-1 

e cos(z/2) — ie? sin(z/2) = —e?i & —7.3891i 
2 cos(In2) — 2i sin(In 2) = 1.5385 — 1.2779i 
y = ġe cost + ce sint 

e cos( 57) t ce! sin( V57) 

y = c,ecost- cse sint 

y = cq€ ^?! cos(2t) + cze” sin(22) 

y = c,€*cos(t/2) + coe“ sin(t/2) 

eh + ce h 

€?! cos(3t/2) + ce? sin(3t/2) 


ee 4 p ARUP 
- 
ll 
eo 


E 
> 

Se 
Il 
o 


= pm 
So 
<% yY 
Il Il 

^ 


; sin(2t); steady oscillation 


= 
^ 
iz 


= —e-7sn2t; growing oscillation 


y= (1 + 23) cos f — (2 B v3) sint; steady oscillation 


= 
= 


15. y= V2 e—-7/9 cost + V2 e—t-7/4) sin t; decaying 
oscillation 
16. a. u= 2e'/6 cos(v23 1/6) 2 (2/23 es sin(v23 1/6) 


17. 


18. 


b. t= 10.7598 


u=2e/ cos( 34 1/5) + (7/v34) et sin(V34 1/5) 


" T — 14.5115 

a. y 22e" cos( 51) + ((@+2)/v5) e~ sin( 57) 
b. a = 1.50878 
c 
d 


T (x — arctan(2V5/(2 + a)))/v5 


< 0/5 


26. y= c cos(ln t) +c sin(In f) 
27. y=ct! +e? 
28. y=cÉ + et! 
29. yc cp? 
30. y2 cet ct? 
31. y= ct? cos(Inr) + c£? sin(In f) 
'(t) + 2p(t)q(t 
32. e q (0 + OW must be a constant 
2(-q()?? 
33. Yes, y=c,cosxtc)sinx, x= petra 
34. No 
35. Yes, y= cje? cos( V3 2/4) + ce I sin( 5/4) 
36. y=c exp( / en dt) tc, ex(- J er dr) 


Section 3.4, page 132 


E 


= = jat 
FS 


E 
o 


18. 
19. 
20. 
21. 
22. 


23. 


24. 


Mee Sup 3 duoc. E 


= cue + cote! 
= ce" + cute 


= ce"? + cg 


= cje” + cre? 


= cue + ce 


3 
y 
y 
y = ce! cos(3t) + cze sin(3t) 
J: 
3 
y 


= ce + cu IA 
y = e"! cos(t/2) + ce! sin(t/2) 
y = 22205 — Tenn, y > —oo as t > oo 
y = 2te*", y= coast oo 
yo Te) p Stet), y + Qast > oo 
y = 2e? + (b - 1)te'^; b=1 


a ye? 3 gU 
2 


b. ty = = yu = 5e? & 2.24664 

c. ye]. C + je? 

d. ty =4b/(1 + 2b) > 2as b > oo; 

ym = (1 + 2b) exp(-2b/(1 + 2b)) > œ as b > co 


y(t) =P 

y(t) =r? 

y(t) =f! Int 
y(x) = cos x 
y(x) = x7 /2 cos x 


x 
2 2 ee) 
y= c,e ?* rf e? P ds + ce óx^ /2 
0 


YA 2»,0 | yp G)exp (- f l pin aye 
to 50 


25. 
26. 
27. 
29. 
31. 
32. 
33. 
34. 


yo(t) = (Int 
y2(t) = cos g 


2 COS X 


yx) 2x 
b. yo + (a/b)y, 

y=c P c Int 
yoct 1? vct? Int 
y=et | +t lnt 
yop? co? Int 


Section 3.5, page 141 


1. 


t 2t 


y=ce+oe'—e 
y-ce + oe" — z +3t- 2? 
y = cje” + coe + 2e” — 3e?! 
3 3 
3t -t —t 2 5—t 
—c,€ ctc0 c —te «te 
4 ET 8 
y + e707 + 34 zl sin(2t) — 1 cos(2t) 
2 2 2 
y=ce'+ote'+Pet 
y =c; cost + c sint — p cos(2t) — 2 sin(2t) 
u = c, COS(M@pt) + c» sin(@gt) + (ex, — o^. cos(ot) 


u = c, COS(Wpft) + Cy sin(@gt) + 1/ (2a )t sin(cr) 


y= ce“. eos( y 15 1/2) + ce? sin( v 15 1/2) 


+ oe cie 
1 1 
1l. goes eee 
Ime gs 2 
7. 19 lo 1.3 
12. y= — sin) - — cos(2t) + -° — = + Ze 
aT ee’ AUR Waa 
13. y= Ate! - 3e + EBE +4 


15. 
16. 


17. 


18. 


19. 


20. 


21. 


y = 2cos(2t) — ; sin(2t) — arcos) 


y = e™ cos(2t) + Ta sin(2t) + te^! sin(2t) 


a. Y(t) = (Agf - A,P + Ast? + Ast - A4) 

+ (Bot? + Bit + B,)e™™ + Dsin3t + E cos 3t 

b. Ay =2/15, A, = —2/9, A, = 8/27, A4 = —8/21, 
A, = 16/81, By = —1/9, B, = —1/9, B, = —2/27, 
D = —1/18, E = —1/18 

a. Y(t) = e'(Acos2t + B sin 2t) + (Dot + D, )e” sint 
+ (Et + Ee? cost 

b. A = -1/20, B = -3/20, Dy = —3/2, D, = —5, 
Ey = 3/2, E, = 1/2 

a. Y(t) = Ae™ + (Bot? + Bt + B,)e™ cost 

+ t(Dot? + Dit + Dz)e™ sint 

b. A=3, By 2 2/3, Bj 20, Bj, =1, Dy =0, 
D,=1, D,=1 

a. Y(t) = (Ag? +A,t+A,) sinQt) 

+ (Bot? + B,t + B5)cos(2t) 

b. Aj =0, A = 13/16, A, =7/4, By = —1/12, 
B, =0, By = 13/32 

a. Y(t) = (Apt? + A,t + Aj)e' sin(2t) 

+ (Bot? + B,t + B3ye! cos(2t) + e"' (Dcos t + E sin t) + Fe! 
b. Ap = 1/52, A, = 10/169, A, = —1233/35,152, 
By = —5/52, B, = 73/676, B, = —4105/35,152, 
D=-3/2, E=3/2, F=2/3 

a. Y(t) 2 f(Agt + A, )e~ cos 2t + t(Bot + B, )e~ sin 2t 
+ (Dot + D, )e~! cos t + (Ept + Ej)e ?' sint 


22. 


23. 


24. 


25. 
28. 


Answers to Problems 


b. Ay 2 0, A, 2 3/16, By — 3/8, B, =0, 
Dy = -2/5, D, = —7/25, Ey = 1/5, E, = 1/25 
2 
b. w=-2+c,e% 
w 5 tee 
N 
y = c, cos At + cy sin At + £ (a,,/(A? — m?x*)) sin(mat) 
m=1 

B t O0xtzEz 
73-0 + 2/2) sint-@/2)cost+@/2)e™!, t» x 
No 


t 2t 


y= cje" +e -— xe 


Section 3.6, page 146 


1. 


See T 


7. 
2n 


8. 
9. 


10. 
11. 
12. 


13. 
14. 


15. 
18. 
19. 
20. 


21. 


24. 
25. 


26. 


Y(t) = e! 
2 
Y(n2- 3€" 
Y(t) = 22e? 
y = cı cost + c; sint — (cos t) In(tant + sect) 
y = c, cos(3t) + c; sin(3t) + sin(37) In(tan(3r) + sec(3t) — 1 
y = c,€? + cote! — e Int 


y = cı cos(t/2) + c; sin(t/2) + t sin(t/2) + 
cos(t/2)) cos(t/2) 


y 2 ce! + cte! — Le In(1 + 22) + te’ arctant 


y-c,€ t cg + a — eX79) e(s)ds 


YQ) = ; 4 P In(f) 
Y(t) = -2? 


Y = AG - pet 


Yo) = cx na? 
YQ) = -$un COS X 
Y) 2x17? I 13? sin(x — f)g(t)dt 
t 
b. y 2 yocost + y sint + f sin(t — s)g(s)ds 


to 


t 
y= (b-a)! [| [P — e] g(s)ds 
to 


t 
E f e sin u(t — s)g(s)ds 


to 


t 
y= | t-se g(s)ds 
to 
yc ct c£ -4PInt 


y=cltp)tee+ st Det 


Section 3.7, page 157 


1. 
2 
3. 
4 


oz 


u=S5cos(2t—6), 6 =arctan(4/3) & 0.9273 
u = Ņy13cos(zt — ô), ô= m t arctan(3/2) & 4.1244 


= > sin(14r) cm, tins; t= 2/148 


u- sin(s 21) - z cos(8 21) ft tins; 


4y/2 
82 rad/s, T = 2/(4V2) s, R= 411/288 = 0.1954 ft, 


8 = z —arctan(3/2) = 2.0113 


579 


580 Answers to Problems 


5. 


u= e! (2cos(4 61) + (5/v6) sin(4 61)) cm, 


tins; w= 44/6 rad/s, T,- 2/26 s, 
T,/T =7/(2V/6) = 1.4289, c 0.4045 s 


6. 
u = 3.87008 rad/s, u/cy = 3.87008/ 


Te 
8. 
11. 


12. 
14. 
16. 
17; 
18. 


19. 
20. 


21. 


22. 
24. 


u = 0.057198e79-5! cos(3.87008 t — 0.50709) m, t in s; 
15 = 0.99925 


Q - 1076 pe = er) C; tins 


y = 20/9 = 1.4907 
a. r= VA? + B?, rcosÓ =B, rsinü = —A 


8-264 (4n Dz/2,n 20,12, ... 


vo < —rug/ Qm) 
27/4/31 
k=6, v=+2V/5 
a. u(t) 2 e/m 
x (uo V4kn = Y? cos(ut) + mv, + yug) sin(yt)) //4kmn = 7? 


b. R? = 4m (kus + yugvo + mv?) / (4km — 7?) 
plu" + poggu =0, T =2raypl/ pog 
u= v2sin( V2r) 


clockwise 


u = (16/V127)e—/8 sin( V127 1/8) 


clockwise 


u= acos( Vm) + by/m/ksin( ykm) 
u=sint, A=1, T=22 

A =0.98, T = 6.07 

e =0.2, A=0.96, T = 5.90; 
T = 5.74 

f. e= -—0.1, A = 1.03, T2655; e= —0.2, 

A = 1.06, T = 6.90; e= —0.3, A = 1.11, T2741 


arr grp py 


€ =0.3, A = 0.94, 


Section 3.8, page 167 


1. 
2. 
3. 
4. 


2 sin(t/2) cos(131/2) 

2 cos(3at/2) cos(zt/2) 

2 sin(7t/2) cos(t/2) 

u” + 10u’ + 98u = 2 sin(t/2), 


u(0)—0, u’(0) = 0.03, 


uin m, tins 


5. 


6. 


mm COIT (16067 cos( 731) 
m ES e75! sin( /73 f) — 160 cos(t/2) + 3128 sin(t D 


b. The first two terms are the transient solution. 


d. w= 44/3 rad/s 
u = cos(8t) + sin(8r) — 81 cos(81)/A ft, t in sec; 


1/8, z/8, z/4, 3n /8 s 


ds 


8. 


11. 


13. 
14. 


a. = (30 cos(2t) + sin(21) ) ft, tins 


b. m=4slugs 
u= (v2/6) cos(3t — 37/4) m, tins 
F(t — sin t), O<t<a 
u= Fo((27 — t) — 3 sin t), z«tE2z 
—4F) sin f, 2m «t«oo 


a. u = (cost — cos(ot))/(o — 1) 
a. u= ((@ +2) cost — 3cos(ot))/(o* — 1) + sint 


Chapter 4 


Section 4.1, page 173 


1. 
2. 
3. 


—oo«ft«oo 
t»-1l,orO«r«l,ort «0 
.Q,-9z/2«x«-nz[2, -n/2«x«1, 


1«x«z/2, n/2«x «3ny[2, ... 


14. 


16. 
17. 
20. 
21. 


—coo«x«-2, -2«x«2, 2«x«oo 

Linearly independent 

Linearly dependent; fi + 3f,(t) — 2f4(t) 2 0 

Linearly dependent; 2f,(t) + 136A — 3f4(0) — 7f4(t) = 0 
1 

—6e 7?! 

6x 

6/x 


sin? t = Lal cos(2t) 
2 2 
a. ag(n(n— 1)(n —2) - D) c a (n(n — 1) -- 2) + +a," 


b. (agr" +ayr | +++ + a, Je" 
c. el e™, e”, e? yes, Wel, e", e”, e?'] #0, 


—oo«t«oo 
W(t) = ce? 
WA) =c/t? 


y 7 ce + cst + cte 


y c£ +e * c(t 1) 


Section 4.2, page 180 


VD exp(i(z/4) + 2mz)) 

2. 2exp(i2z/3 + 2mz)) 

3. 3exp(i(z + 2mz)) 

4. 2exp(i(11z/6) + 2mz) 

5. 1, TE +iv3), z(= " iv3) 

6. 21/4 e-2i/8 21/Ag7zi/8 

i (v3 +i)/v2, -(v3+i)/v2 

8. y-—cje! + cte + cet 

9. y- ce + cte! + cte 
10. y =c] ct ce + cate? 

_ , Vs t ft 

11. y=c,cost+c)sint+e c3 cos( 7 + cq sin 5 
+ e VPC, cos( =) + & sin(=))) 
12. y= cje + cote! * cse! + cye + cste™ + cse 
13. y= c] cst cse + ce + c5 cost + cg sint 
14. yc, coe * ce? c4 cost +c; sint 
15. y=e'((c, + cyt) cos t + (cz + cyt) sin f) 
+ e"((cs + cgt) cos t + (c; + cgt) sin t) 
16. y= (c; + cyt) cost + (c3 + c4t) sint 
17. y=cyet+ ge + ada 
18. y= cje” c c6? + m t go Oe) 
19. y=cye + ce + c3e™ cos(2t) + c4e ^ sin(2t) 
20. y 22—2cost- sint 


21. 
22. 
23. 


24. 
25. 
26. 


27. 
28. 


29. 


y- pev? sin(/ v2) = sel sin(1/ v2) 


y=2t-3 
= -Ze — 1 e = 1-2 16, 
3 10 6 15 
y= 2 ot fs 2A d cos t + —e'/? sint 


y 
y 2 "rg 5 C08 t= L sint 


y= 5 (cosh —cost)+ 5 Ginhr — sin ft) 


a. W(t) 2c, aconstant 
b. W(t) 2 -8 
c. W(n4 


b. u; 2 c, cost c; sint +c cos( Ver) tc, sin( Ver) 


Section 4.3, page 184 


l. y= cje + cote! + ce + je +3 

2. y=c e + ce + cs cost +c, sint 3r Ftsint 
3 - =è 1 1 =f 

©. y-cje TG COSE Fes Sine ode T4(t — 1) 
4. y=c rre E 
m ^ 3 48 16 
: z 1 

5. y=c COS Eek go SR caT COR Se eae ai c SORE 
6. yHe, tot c£ ce i 
secs cos( 31/2) t €6 sin( V31/2)) t a 

Teo y= Ža — cos(22)) + na 

8. y=(t-A)oost - (274) since sce 

9. y= wie Seine Lapi d ag. T9 uh 

20 40 520 
+ un cos(2t) — 2 sin(2t) 
65 130 

10. Y(t) -t(AgP + Ay? + Ast - A4) + Bre! 
11. Y(t) =t(Apt - Ape" + Bcost+Csint 
12. Y(t) = (AÊ *- Ait - A;) + (Bot + Bj) cost + (Cot + Cj)sint 
13. Y(t) = Ae! + (Bot + B,)e™ + te ' (C cost + D sint) 
14. kj2 ay k, = aga" +a"! b a, qo a, 


Section 4.4, page 188 


1. 
2. 


3. 
4. 


5. 


y = cı +c) cost + c; sint — Incost — (sin t) In(sec t + tan?) 


y =c] toe + ce" -— PP 


1 
= ce! + cse + cze” + — e" 
y 2 3 30 


5e 
ll 
^ 


" 1 
e! + cy cost + c3 sint — so cost 


; 1 
y=cye' + c cost + c sint — 5 (cost) In cost 


t 
+ (sin t) ln cost — stcost — xt sint + xe (e*/coss)ds 


6. 


= c, toe + ce™ + In 


-t 


to 

YO = y.) + y (t) t 

cselt)  cot(r) | | z / e™cse(s)ds + 
to 


csc(to) + cot (to) 


t 
e*csc(s)ds 
to 


Answers to Problems 581 


3 1 5 


7. Qa IL Ll cm D 
8. y= 3 + In(2)/2 — e *-7/? — In |sec(t)| 
t a rt 
pt i e 5 tan(s)ds + =f e tan(s)ds 
2 z/4 2 z/A4 
9. YQ) 2 x*/15 
t 
10. Y()- i (e™ — sin(t — s) — cos(t — s))g(s)ds 
to 
t 
11. Y= : I (sinh(t — s) — sin(t — s))g(s)ds 
to 
t 
12. Y= : "i et- sye(s)ds; Y(t) = —te! In |t| 
to 
Chapter 5 
Section 5.1, page 195 
1. p=1 
2. p=2 
3. p=% 
1 
4. p-- 
F3 
5. p=l 
6. p=3 
d (- yr 
Te ——————, p=% 
ri (2n + 1)! 
<x" 
8. 2 n" p = œ 
9. 1-(x-1) p=% 
10. 1-2(x-D-(x-1), p=% 
H. yocp" Ec psd 
n=1 
12. E p=1 
n=0 
13. Yo-nD"«-2"*, p= 
n=0 
14. a. y 21 2x € 332 8 (n 1x a es, 
b. y" = 27437 2x - 4? 332 E 5? A E (n2) (nt Dy - --- 
15. a. yl =a, * 2ayx 3a + faye + = + (nt Da, ux e 
= > na, x" | = Yin + Da, 4x", 
n=1 n=0 
y" = 2a, +6azx+ 12a4x? +2063 ++ + (n-2)(n- 1)a, x" + 
= 2 n(n — l)a,x"? = Yin +2) (n+ 1a, ox" 
n-2 n=0 
18. Yin + 2)(n + Da, 5x" 
n=0 
19. Yin + Da,x" 
n=0 
20. Y (n+ 2)(n + Dany + na, x" 
n=0 
21. aj Y (n Da, a, a 
n=1 
22. Y (n 2Yn-* Dany a)" 
n=0 
23. a, =(-2)"ay/n!, n=1,2,...5 age?* 


582 Answers to Problems 


Section 5.2, page 204 
1. a. 


dia = Onl + 20D) 
"^ xo c. x2 
bd. (9-7 I*2 t Dei -A On! = cosh x, 
Ó D x cU aH . 
»O-rtytgtnt Mg] = smh 
-3 
. a. an2 = gI 
(73) 2, C37, | C3* 
b, d. y =l, epe e + ILL T gps t 
id c 1 x" 1 3 
l(]—e* 
DU =i 


n=l 
«8. Any = a,/(n+ 2) 


2 x4 x6 em m^ 

bd. 6) 71 > x 
M odor TT ET 
x x x! eo 2ng ly2ntl 

=x+ ct x dup Sy) ee 
ugar. 2 ose 


. 8. (N+ 2)a,,5 — 


an+ — da = 0 
b. y@=1+ se 1 + La- Ip + La- Itt 


dis eA MES M 
«8. a4 = —k 24 n/n + An + 3)); 
kext k4x8 

=l1- —————— 
By 5169 3.4 3-4. 7-8 
% Lpyntl(p2,4ynl 
igy (1) (k^x^) l 
43-478 (4m + 3)(4m + 4) 
2.5 kx? 2 kx! à 
4-5-8-9-12-13 


_ Kx P 
4-5 4.5.8.9 


_ ind (—1)*! (g2ytyn* 
=! © Dias ea) 


yo (x) 


d, = a, =0 
iex 


Hint: Let n = 4m in the recurrence relation, m = 1, 2, 3, ... . 


.8. (n TOME Dania AUE Darit ay =N, n> l; 
a =-5% 
b ya) = 1-5 = 20 At 
1 1 
x) o x- ex p watt 
a Ay. =—a,/(nt1), n=0,1,2,... 
b 1 x x4 xÉ 
. y@)=1- 1 ub FEN M 
oo 
(-1)"x 2n 
= 1 ; 
Po 3-5- (2n— 1) 
x5 x? 
y2Q) =x- 2 Tlg $t 
as (-1)"x 2n4l 


e. 4-6- (2n) 


n=1 
. 8. a2 = —((n- 1) Gg +a, a, 4)/(n + 1) + 2), 
n- 1 2e Ay = —(a t a))/2 


b. GC MACC CAR 

»9)26-D-56- D^ Berl = Tæ- Ds 

. a. 3(n*-2)a,,5 — (n+ Da, E no 
x 5 6 

b,d. Tm 

park oe agr. ue 


3^.2.4.- (2n) 


-34.7.8.11.12 M 


2 16 
=x+ P eia 
nu e D 135° * 945" 


“4+ (2n) 2nl 
AM Era E 
10. a. (n+ (n+ Da, (n 34,20, n20,L2.. 
b,d. via) 21- ie tu - 
Tee eS Qnty ; 
2^ (2n)! 
x 
30) = 2-3 36 ~ 210 
Fer (pE Ont 2) an fs si 
2^(2n + 1)! 


11. a. 2(n +2) + Da, 5 t 3(n + 1)a,,, + (n+ 3)a, = 0, 
n=O, L2,... 


b. x99 21- 10-2» + go - 2)° + ge 2* + 


yal) == 2) ia- + Hee) + d 72 
12. a. y=24x+2 tle + ltte 
c. about |x| < 0.7 
13. a. ps4- de e Sa e Sat to 
c. about |x| < 0.5 
32 ls la 
14. a. y 2-342x- =x? - =x i Tee 
c. about |x| « 0.9 
1 1 1 
15. a. x09) 21—- 307 1} — pe 1) + ig D6 + 
1 1 1 
= -lj)-+ EE ee —1) 2E -1y 
yo) 2 (x —- 1) 1€ ) 50% ) +e y+ 


16. Hint: Consider using induction. 


18. a. y,@=1- I6 H-a ei a 
tey) =x- 42? E 
| Q- alae =10) 9 
b. 1, x, 1-22, P (anes x- $+ 4x 


C. 1, 2x, 432 — 2, 8 — 12x, 16x* — 48x? + 12, 
32x5 — 160x + 120x 


19. b. y2x- à/6-4 -- 


Section 5.3, page 209 


1. $"(0)—--1 q'" (0) — 0, $0) =3 
2. $"(1)-0, Q'" (1) = —6, $91) = 42 
3. $"(0- q""(0) = — $90) = —4a, 
4. p=%, p=% 
5. p=1, p=3, p=l 
6 p=1, p=V3 
7. a. p=% 

b. p= œ 

€. p—oo 

d. p= œ 

e. p=l 

Lo» 

g. p=% 

h. p=1 

i. p=1 

j. p=2 

k. p= V3 


5 


l p=1 
m. p = oo 
n. p=% 
o Q? — a2)o? 
8. a. y@)=1- Td - — o — 
(4? _ a2)? EN a*)a? á 
x9 
6! 
(Qm — 2. — a?) +++ (22 — ae y, 
(2m)! : 
- 32 mp 1= 2 
NUT 
(Qm — 1)? — a?) + (1 — a?) T 
(2m + 1)! 


b. y(x) or y?(x) terminates with x" as a = n is even or odd 
c n=0, y=1; n= 2, y = 1—2x?; 
3 


n=ly=x; 


Rey ae 


9 M c 
yw) e" + T20% + T80% + 
y(x) = x xt l é b ee 

: 12 180 504 ' 
1 
10. yx,06921- et + ut an Tee, 
yQ)2x dg iul baa poo 
2 12 20 60 i 
1 1 

11. =1 +x + — x4 + — 76 4... 

yy (x) +x + T LES T, 

=x+ T 

Y= x- gx + EOP + se neh 


12. Cannot specify arbitrary initial conditions at x = 0; hence x = 0 
is a singular point. 


2 n 
13. y=l+xt + ete 
2! n! 


xbox 6 m 
14. ee taa gae n 
15. y=1+x+Ż ter txt pers — 
E, y 
16. y=a (rene duum -) 
Óo d n 
ZOLL Fot ) 


18. — 3x, 1 2 102 (xe x x- 32, xm 
19. a. 1, x, (32 - D/2, (52 — 3/2, 

(35x* — 30x? + 3)/8, (6350 — 70x? + 1539/8 

C. Pi, 0; P$, +0.57735; P3, 0, x 0.77460; 

P4, 30.339098, 0.86114; P5, 0, x: 0.53847, 0.90618 


Section 5.4, page 218 
2 


1l. y= cx! + cx 

2. y= clx + 1717 + clx + 117/ 

3. y= cx + cx In |x| 

4. y= cx + cxln |x] 

5. y=c i] C529) 4 clx iC me va) 


3 3 
6. yc xp"? (S In m) + cx [72 a(S In m) 


7. ys cy + cx? In |x| 
y 


8. = cq (x - 2)? cos(2 In |x — 2]) + c4( — 2)? sin(2 In |x — 2|) 
9, y= 2/2 — x1 
10. y= 2x7!/? cos(2 Inx) — x7!/? sin(2 In x) 
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11. y= 2x? - 7x7 In |x| 
12. x=0, regular 
x= 1, 


13. x=0, regular; irregular 


14. x=0, irregular; x= 1, regular 


15. x=0, irregular; x= +l, regular 


16. x=1, regular, x =-1, irregular 
17. x=0, regular 
18. x=1, regula; x= —2, irregular 


19. x=0, 3, regular 
20. x=0, regular 


21. x=0, irregular 

22. x=0, +nz, regular 

23. x=0, irregular, x =-+tnz, regular 
24. a «1 

25. p»0 

26. y -2 

27. a. a « land f »0 


a 

b. « «land f 20, ora=l1andf>0 
Cc. a» landf»0 

d. «» landf 20, ora=l1andf>0 
e a-landf»0 


2 
30. yoa(i- ZIEL) 
33. Irregular singular point 
34. Regular singular point 
35. Regular singular point 
36. Irregular singular point 


37. Irregular singular point 


Section 5.5, page 223 


a 
1. b. r2r- 0 20 M NR 
bio n= enn D — i] 
r l rr =0 
(0T UO x 4 6 
u-— "C — OD" REA 
Ben ( 2:5 FASO 2.4.6.5.9.13 
deese (De + 
2”n!5 -9 . 13 -- (An + 1) 
x? x X 
: =1- C MER. NN 
d. x9 2.8 3.07 3a 37d 
(12? . 
ZnI3-1-11-- (4n - 1) 
"E TER 1 1 
2. = a=- D Tp— us Ta 
(n*rP*-$g 


e 
= 
T 
— 
| 
a 
Dun 
w 
— 
| 


584 Answers to Problems 


d. yo =x reso) 
— eee ee 
mne 
£— HG) ** 


md - 38 - 3) (m= 4) 


Hint: Let n = 2m in the recurrence relation, m = 1, 2, 3, .... 


20 M an-1 i 
3. b. rr-1)=0; a,= YU EST. 
rn-lr,z0 
- x x Mo M T 
is no) =a(1- 121 * 251 arD T ) 
2 p. MN LEM TTE 
4. b. ^06; On = pe ry =r, =0 
& x dts Eee a 
” a ^ Qn (n? 
2 9d Ze Ün-l C. = =— 
5. b. r^ -2z06; a, CHEN ri v5, T, V2 
c ye 2 32 
"LES NEUE EN 
1142/2) 210 4 2V2)0 + 21/2) 
(-1)" ens) 
etis ax DOKN: (n+ 21/2) 
d. y) = x? 
x| l=} £ + 
10-2y2) 20-222 - 2V2) 
a) 
n\(1 — 2V2X2 — 272) + (n - 22) 
6. b. ?20; (n+ra, =a, 1; n =m =0 
x" 
c. noiret RC tat g" 
7.a. r=0; 1 =0,%=0 
a(a +1) 
b. y,@)=1+ (x — 1) 


EE e aUe D. yl eel pe 
Do (x - D^- CT) 
y Hat DO -2 7 ala + D) - 


2^(n)? 


- (n(n — 1) — aa + 1) 


x (x—-1)" + 


8. a. 


n= > ry =Oatbothx = +1 


where P, 


—D"P 
1/2 _ 4) 
b. x6) = x-1] (Ès aaa 
(1 + 2a) -- 


-(2n — 1 + 2a)(1 — 2a) +++ (2n — 1 — 2a), y4(x) 
oo (-1)"Q, , 
EA nll-3-5-- 2n- peo 1)" where 


Q, 2 (1 a) (n 1+ aY(-o)(1 — a) = (n — 1 — a), 
9. b. P=0; n=0 nz0 iom 
n 
Dau DÀ uU C20 724 
c. yy @)=1+ ay? * Qn 
4, €200-7 2-(-1- 3) T" 
(n! 


For A = n, the coefficients of all terms past x" are zero. 
12. e. (n-1? - Db, = —b 


5,2» it is impossible to determine b}. 


Section 5.6, page 229 


1l. a. x=0; 
b. rír-1)20; r2L7,20 
2. a. x26; 
b. 2-3r-220; rn22,n-1 
3. None 
4. a gs 4 
b r(r- =) =0; sear r,-0 
a. x 2 —2; 
b r(r-2) =0; rn 1 =0 
4 4 
5. a. x=0; 
b. 2-120 r(|—irj--i 
6. a. x=1; 
b. ?4r20; r 20, r, 2-1 
7. a. x 2 —2; 
b. 2-(5/Ar20; r 25/A4r,20 
8. a. x=2; 
b. r?-2r=0; rnz2,r20 
a. x= —2; 
b. 2-2r20; rnz2,r,-20 
9. b. ri Z0, n= 
c. yi) = Lex 5? +x + ; 
ya) = y (x) Inx - 2x = I = 
10. b. 7, =1, rn =0 
Cc. yy) =x- 4x? +28 - Tes, 
yx) = —6y,(x) lnx + 1 2 33x? += Xe 
11. b. r, ZL r 20 
Cc. yy) — plg ia riu. 
ic 12 144 f 
332, 7 35 
-y (O lnx+ 1- ee L c9 
Jala) = a 9 ac — ys 
12. b. r, =1, s 
1 
k Elda L 
bn n 24 cei 
=-- I + ey oer 
yy (x) 33100 nx+x 90* 
13. b. ner =0 
c. xe 2e-D^(1-2 ia- D+ A a- ES 
d. p= 
14. c. Hint: (n — 1)(n—2) - (1 - a - f)(n — 1) - af 


(n— 1-4 a)n-—1-4 f) 
d. Hint: (n — y)n —1— y) - (1c a 9 ByY(n — y) - af 
=(n-y+a)\(n-y +P) 


Section 5.7, page 239 


oo 


) (—1)"x" 
5 HILL nn D' 
1 = H, +H, 1 nn 
y = —y, (x) Inx + "i = » pe 1) J 
e (= 1)"x n 
s dies n=0 (n! 
oo (-1*H, 


2 


Pm 


% (—1)"2" ; 

3. n= my 
e CD2H, | 
yx) TERES — m 
4. 
= (-1)” x \2m 
— ;3/2 

yx) =x 1+ (3) 


m=1 m\(1 + 2+ 5) (m =) 


oo 


ya) = x14 


wet ma 30-3 em-3 


Hint: Let n = 2m in the recurrence relation, m = 1, 2, 3, .... 


For r= ar te 0 and a; is arbitrary. 


Chapter 6 


Section 6.1, page 247 


1l. Piecewise continuous 


2. Neither 
3. Continuous 
4. a. F(s)=1/s?, s»0 
b. F(s 22/8, s»0 
c. F(s)-nl/s*!, s>0 
s 
5. F(s) = Page s>0O 
s 
6. F(s) = Suy s> lb] 
b 
7. F(s)= T s > |b| 
b 
8. F(s) = ae’ s>0O 
s 
9. F(s) = Sap? s>0O 
b 
10. F(s) = ————_.., 
0 (s) G-a s>a 
ll. Fs) = — 
(S) (s—af +b?’ din 
1 
12. F(s) = TI. s>a 
_ A 2as 
13. F(s- rap np s>0 
n! 
14. F(s) = (say s>a 
2a(3s? — a?) 
15. Fo) = ed s>0 
16. F(= EZ 
s 
17. F(= E 
52 
m =s —2s 
18. F(9- 1— 2e -te 
s 


19. Converges 
20. Converges 
21. Diverges 
23. d. F(3/2) = J/z/2; (11/2) = 9454/7/32 
Section 6.2, page 255 
1. f= ; sin(2t) 
2. f(t) =2re 


(-1)” ey 
2 


Answers to Problems 


fO= ze - ze 

f(t) = 2e™ cos(2r) 

f(t) = 2 cosh(2n) — : sinh(2t) 

f(t) = 3 — 2 sin(2t) + 5 cos(2r) 

f(t) = —2e~*' cost + 5e? sint 

y= Le +4e~*!) 

y = 2e* — e” 

y=e'sint 

y 22e! cos( 37) = (2/v3)e sin( V57) 


y = 2e™ cos(2t) + xe sin(2t) 
2 


y=tť -Pe + zre 
y =cosht 
y = ((a — 5) cos(at) + cos(20)) /(o* — 4) 


y= ze — e cost + 7e! sin f) 


TS 


Kj l-e 
Y =. a 
Wa qug 
1—-e* 
Y = 
= 344) 
1 22e + e725 
ppc ie 
ead) 


F(s) = 1/(s — a)? 

F(s) = 2b(3s? — b*)/(s* +b’? 
F(s) =n!/(s — ay"! 

F(s) = 2b(s — a)/[(s — a +’? 


. a Yli+s*Y=s 


b. sY" + 2sY' — (s? + a(a + 1)Y =-1 


Section 6.3, page 262 


b. f(t) = —2u3(t) + 4us(t) — u(t) 

b. f(t) = 1 —2u,() + 2uy(t) — 2u(0) + uzt) 

b. f(0) = 1- u(0[e- 72 — 1] 

b. f() = t u()Q — 0 + us(06 - 0 -u(( — D 
F(s) = 2e/s$ 


—zs —2zs 
F(s = = - 0 + zs) 
S S 


F(s) = Le + 2675 — 6e) 
F(s) = s? (Q1 — s)e?5 — (1 + se’) 


. ft Be 

. f(-2 puo (e? -e 0») 

. f(0 = 2uy(0)e'? cos(t — 2) 

. f) =u, + u(t) — u3(t) — ug) 
. ft) 20D" 

. f se? cost 


. f= ceP(enh =1) 


F(s) = eo = e). s>0 


F(s) =s! (1 —e™ +e -— e), s>0 


ie =} Şener I. Bley 
sa loe 
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586 Answers to Problems 


25. LFO = + 1/s -, s>0 Section 6.5, page 273 
H js Ag 1l. a. y 2e cost + e^ sint + u,(t)e -? sin(t — z) 
26. E£(f() = ————, s»0 1 : 1 i 
Ut) . d E e=) s 2a y= gus sin(2(t — )) — gas sin(2(t — 27)) 
2 -s — ol. l _o-2(t-5) 4. p-(t-5) 
27. LFO) = 1-(1+s)e sm 3.a. y - z + ze + u5(t)( e +e ) + 
(1 — e) uso (5 + pe em = ee) 
28. cyo t... s>0 4. a. y= Linde eres Le- cos(4/21) + 
(1 + s2)(1 — e-75) i 4 
29. a. £(f(0) 25s1(1 e), s»0 — us, (tet) sin( V24 = 3n)) 
b. L(g} 25200 e), s»0 2 ' 
c. Lh} = E: -e?y, s>0 5. a. y= sint + uy,(t) sin(t — 27) 
30. b. cip] = —e* m 6. a. y zu, sint- 2/4) 
XT J T» 8. y= cosit snpele*emwe3etda 


+ us p(t)e € sin(t — 2/2) 
8. a. y 2u,(D(sinh(t — 1) — sin(t — 1))/2 
9. a. -eT/45(t-5-T), T=8a/V15 


Section 6.4, page 268 


1. b. y= 1 -— cosż + sint — u3, (t)(1 + cos f) 


; 1 : 
2. b. ye e^'sint-- zu,(t)(1 + e7? cos t + e 7? sint) 
2 10. a. y= ai u (te 00/4 »( VIS. D) 
-Lup O — e 29) cos t — g7 (727) sin t) v15 
2 : b. ti £2.3613, y, & 0.71153 
3. b. y= =(1 -u (D) sint — sin(27) 
7 i il C. y= a u (eD sin e D) 
4. b. v2 4+ te- e- + 12-20-10) _ - (1-10) 
Be yt Mn: ZU s ee) 5224569, y, = 0.83351 
5b. y= ait + fi 1 AG cost- 12 sin f) d. t 21*2/2225708 y =1 
a a 1l. a. k, = 2.8108 
+ u.s +24, Fortra reost 16 sin?) b. h = 2.395 
6. b. y= h(t) + u,(OR( — 2), = 
h(t) = Ba cos t + sin t + 4e? cos t + e7'/? sin f) 12. a. $4) = zg se ORG 4 +k) - Wax QR — 4 — E), 
17 h(t) = 1 — cost 
7. b. y= ug (4 a cos(2t — 2n)) b. bolt) = ug) sint — 4) 
1 : 4 C. Yes 
= rums See = 20 
iss 5 ges 6n)) 13. b. y= Yu, (f sin(t — kr) 
8. b. yz h(t) - u,C)h( — 7), k= 
20 
h(t) = (3 — 4 cos t + cos(20)/12 14. b. y= Y (- D**'u, (D sin(t — kx) 
h 4 = m 
9, f= z€ — toU (Ð — us, 4 0) + hus ua) jo 
h 15. b. y- y (Dsin(t — (2k — 1 
10. g(0- zu = (u0 — O) y p Uy 4), (t) Sin(t — ( )z) 
h 40 
-q-h- 24) (wu - ssa) 16. b. y= 2 (D uuu) sint — 1016/4) 
3 5 d 


21.2 V us 3V7, BIN E 
h(t) — 184 e a( 0 -ge e 3 t ? (5, nce 


d. k=2.51 
e. 7 = 25.6773 15 

"nb 18. b. y= 20 35 di oio, tno m 
b. 


= (us(t)h(t — 5) — us Hh — 5 — k))/k, 4 js 

1 i 1. = X sin Pq- (2k — 1)z) 
A(t) = —t — — sin(2t) 

4 8 


m Section 6.6, page 279 
13. b. y-1- cost 2 VED yO — cos(t — kz)) "SE 
k=1 


: . l,. . . 
3. sint * sint = jin: — tcosf) is negative when t = 2z, for 


15. b. ye 1—cost - Y (- Du, (1 — cos(r — kz) example. : 
e 4. F(s) = ——— 
s?(s? + 4) 
17. a. y=1 = cost +2 Y (Du yO — cosl — 11k/4)) 5. F= 1 


. F(s) = —W 
k=l (s + 1)(s? + 1) 


6. F(s)= 


s 
(s? + 1 


7. f= ERES sinc dc 
6 Jo 
t 
8. f= | e *-? cos(2r)dt 
0 


9. f= : f (t — r)e™™? sin(2r)dc 
0 


Tim 4 1) P(n+ 1) 


1 
10. e f wa- du = Ton tn+d 


$ 
11. y= L sin(ot) + - 4 sin(@(t — 7))g(r) dr 


[4 
12. y- ; i; e 9? sinQ(r — c))g(r) dt 
—t/2 


13. y 2e"? cost- xe sint 


t 
+ I e «n2 sin(t — ad = U,(T)) dt 
0 


t 
14. y=2et-—e 7 + ri jer" - ere) cos(at) dt 
0 


15. y= $ cos f — ; cos(2f) 


+ : li (2 sin(t — 7) — sin(t — 1))) g(t) dr 
0 


_ F(s) 
16. 9) = Lx 
17. a. $(0-2 HE sin(2t) — 2 sin f) 
18. a. (ft) = cost 
b. $"() +o — 0, $(0) = 1, Q'(0) =0 
19. a. AA = (1—2t - ?)e" 
b. p'O +2) + PA 22e*, d(0-1 4'(02-3 
20. a. $(0 = lj lignes v3, + +e sin v3, 
3 3 2 V3 
b. g"O+eO=0, 60)=0, #O)=0, Pd" O)=1 
21. a. (t) = cost 
b. Qr) — $(r) — 0, $0) = 1, Q'(0) = 0, 
$"(0--1 $"(0-0 
Chapter 7 
Section 7.1, page 284 
L xj 2x, x = —2x, — 0.5% 
2: x =X, X) --(1-025t?)x, — t^lx, 
3. x =X, X) = X3, X) = X4, x, =x] 
4. x =X, X = —4x, — 0.25x, + 2 cos(31), x,(0021 
xy(0) = -2 
5. x = Xp, X) = —q(tx, — px + g(t), 
xı(0)= uy  x,(00— Uy 


6. c. x, 2c,€ + ce! 
= ^ —3t 
d. x; =cye*-— ce 


"Ho 


7. a. x x -2x, 20 
11 2 11 4 
b. x = =e” — =e", yy = =e” — =e 
mmg 3 2778 3 
C. Graph is asymptotic to the line x, = 2x, in the first quadrant. 
8. a. x + 4x, =0 
b. x, = 3 cos(2t) + 4 sin(2t), 
c. Graph is a circle, center at origin, radius 5, traversed 


clockwise. 


X, = —3sin(2t) + 4 cos(2t) 


Answers to Problems 


9. a. x! +x) +4.25x, =0 
b. x, = —2e7*/? cos(2t) + 2e~/? sin(21), 
X= 2e7!? cos(2t) + 2e7'/? sin(2f) 
c. Graphis a clockwise spiral, approaching the origin. 
10. LRCI'" +Li'+RI=0 
y, =Yy gmjy,--(k + hy)y + oy, + FiO, 


my, = ky, — (ky + k)y, + FLO 


15. y =y 


3 1 3 
19. a. g^ ==- — — Q;, 0) 2 25, 
Q, 75-19 *i;9» 20) 
Q1 =3+ 1o, — lo, Q,(0) = 15 
2 10 $7 
b. OF =42, QE =36 
1 3 
e oxp--4utagge +10) = —17, 
1 1 
x = ou 7 5% x (0) = —21 
1 1 4 
20. a. oA = 3q; = 7521 + Too 2 OQ, (0) — p 
1 3 
Q =+ 392! = 1009» Q,(0) = Q9 
b. Qf —6(94, +), QË = 20Bq, +242) 
c. No 
10 20 


Section 7.2, page 293 


6 —6 3 
l. a. |5 9 -2 
2 3 8 
-15 6 -12 
b. 7 —18 -1 
—26 -3 —5 
6 -12 3 
c. |4 3 7 
9 12 0 
—8 -9 11 
d. | 14 12 —5 
5 —8 5 
1-i —] -- 2i 
2 a a Dn 
b 344i 6i 
* (1 4 6i 6 — 5i 
"m 7 «5i 
` 2+i 7+2i 
8+7i 4-—4i 
d. Du a) 
-2 1 
3. a. 1 0 - 
2 -3 
1 3 -2 
b. |2 -1 1 
3 -1 0 
-1 4 0 
c,d.} 3 -1 0 
5 -4 1 
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588 Answers to Problems 


4 3 —2i 2-i Section 7.3, page 303 
2 : 1+i —2 t 3i 1 7 1 
lL x =- =>, x«--- 
b 342i 1-i 3 3 3 
Nos; -2-—3i 2. No solution 
" 342i 24i 3. xj 2—ce x,-ccl x =c, where c is arbitrary 
Mi -2 — 3i 4. Xx =G X =c, X= —c, where c is arbitrary 
7 -11 -3 5. x20 x. =0, x20 
5. a. P = = 6. Linearly independent 
7. x» — 5x2 2x0 =0 
5 0 -i 
b. 7 4 8. Linearly independent 
-1 1 4 9, xD 4x) -x9?-0 
6 -8 —11 11. 3x) — 6xOX(r) + x (r) = 0 
c | 9 15 6 12. Linearly independent 
=) -1 2 
1 1 
= 1) i = 2) e 
7. a. 4i 14. 4,22, x»» (3) mn = (1) 
b. 12-8: 
i 15. A, 21-21 20 = ( S À;21-2i x»- ( s 
€ 242i 1-i Ici 
d. 16 16. 4 2-3, x? = (1) h% =-l1, x® = a 


17. 


) aene Ca 


0 
ES 18. 4 =1, x® =|-3|; A421-«2ix?- ] 
-i 


Ne 
Il 
N 
». 
Il 
Lom 
N D 
me ù% 


=3 ; 
10. |- 3 -1 
2 -1 0 1 -2 
1l. Singular 19. A,=1, x=] Of 422 x=] 1 
` -1 
1 1 1 
c a = 0 
: 1 i %=3, x? 2| 1 
12. | 0 = zum en 
2 4 
1 2 2 
ý 2 20. 421,xP2[|-2| a, =2, x2=|1|; 
13. Singular -1 2 
1 
"I Àj - 1 xO =| 2 
14. E: 
1111 
0101 
Section 7.4, page 308 
Te | Set 10e? : i 
16. a. |-e Tet 22? i l ae Æ "i 
8e! 0 -e 1. d. x()- 5x” + jx? si 3 , 
iat 3t 
22-2438 14de%— ef 38 426 — eM 2 at 
b. | 4?'—1—3e* 2-42e? + ef 6e%4 ef e e. 2. 
—2e —3 + 6 —14 6e?! — 2e! —3e9 + 3e! — 2e" : -le y ber 
do pi ret 2. d. x) = - 2x0 + 2x = : ! 
-3 2 
c 2e! —e™ —222t 5° t4 3° d 
—e! —3e*t 4e” e. 5e™. 
1 1 8 cos t — 8 sint 
Loo» =(e + 1) 3. d. x(t) = =x — 2xO = 
2 3 5 2cos t — 3 sint 
d. (e- D| 2 el —2(e- 1) e. —5. 
4. d. x) = Oe ( ) 
-1 3e! e+1 2 2 


e. 2. 


5. d. x = qx" +x” = = j 
t 3r! 
1 +3 
e. 2. 
(0 4 9x0 = (7€, 
6. d. x(r) = 2x? + 0x? = apt 
e. —3t. 
8. c. W= eew( [os pst) 
12. a. W() 2? 
b. x(? and x® are linearly independent at each point except 
t = 0; they are linearly independent on every interval. 
C. At least one coefficient must be discontinuous at t = 0. 
0 1 
P. 
ex e M 
13. a. W(t) = t(t — 2)e' 


b. x and x® are linearly independent at each point except t = 0 
and t = 2; they are linearly independent on every interval. 

C. There must be at least one discontinuous coefficient at t = 0 
and t = 2. 


0 1 
d. x’=| 2-2 rĉ-2 )x 
2-2t 2?-2t 


Section 7.5, page 318 


1. 


10. 


11. 


12. 


14. 


15. 


; e^ + c4| 1 fe 


1 
1 [e^ +c, apes 4 
2 
1 
2 
1 1 1 
—4 le! + co] -1 [e + c,| 2 [e 
-1 -1 1 


16. 
21. 


22. 


23. 


Answers to Problems 


3 D s 
E60 | +c 2 t 


a x =X, X), = —(c/a)x, — (b/a)x, 
2.55 3 -t/20 29 1 —t/4 
a x= 8 E e + 8-2 e 
c. T274.9 
a. x=c, an (- 2+V2)1/2 4 tc i 


(enm 


Section 7.6, page 327 


1. 


C4e 


13. 


14. 


— 2 cos(2t) me —2 sin(2t) 
dm sin2n) T ? cos(2f) 
is oe Scost P 5sint 
n 2cost+sint) ^ V —cost 4-2sint 
— cos f -— sint 
E 2cost+ sint 2 —cost+2sint 
CR —2 cos 3t E —2 sin 3t 
B cos 3t + 3 sin3t C2 | sin3t — 3cos3t 
2 0 0 
x = c| —3 Je’ *c;e|cos20 | + c4e'| sino) 
2 sin(21) — cos(2t) 
2 —V2sin(/21) 
x=c,|-2 e? + coe" cos( V f) + 


1 — cos( V2 9) — V2 sin( V2 f) 
vV2cos( 2 


Fi sin( V2 r) 


V2 cos(/2 t) - sin(/2 r) 
=e ee — 3sin ') 
x=e ; 
cost — sint 
cost —5sint 
—2cost — 3 sint 


a. r "t 
-r=-7t 

a T. 

ee oe 

a r=asti 

b. a=0 

a. r= (a+ a? -20)/2 
b. «æ = —y20, 0, 20 
a. r2 isV5a 

b. a 20, 25/12 

a. r=-l+,/-a@ 

b. 2-10 
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590 Answers to Problems 


15. 


16. 


N 


17. 


18. 
19. 


20. 


21. 


23. 


24. 


25. 


a. r=-—l + y25 + 8a 


Section 7.7, page 336 


b. a = —25/8, -3 -Zent 4 Set Zent Zen 
cos( V2 In r) sin( V2In r) 1. b. e()- 2 2 4 1 
x= or + Cot! aa gt + Ze?” le- le 
v2sin( ¥2Inr) -V2cos( V2nr) 3 3 3 
1 —t/2 1 -t —t/2 =t 
5cos(In r) 5 sin(Int) 35 Fae E eme 
X-c i t c5 i 2. b. t) = 
2 cos(In f) + sin(In f) — cos(In f) + 2 sin(In £) 1 ete li lap " lj 
1 1 4 4 2 2 
a. r-—-—-zrti,-- : : 
4^" 4 cost+2sint —5sint 
1 1 3. b. O®H= 
a. xin žb 10 sin t cost —2sint 
: e™ cos(2t) —2e" sin(2t) 
b. o —— or) bart sin(t/2) à. BY x 
V 4 sin(t/2) —4 cos(t/2) = sin(21) et cos(2r) 
3. 
c. Usec 22, cy = <q ewer to part (b). loi " 5 i ly E lou 
d. lim /(r) 2 lim V(f) 20; no 5. b. =| 2 2° 2 2 
teow 100 3 3 3 1 
— legt + x ett 2g. Le 
b I [| cost „| snt 2 2 2 
d 2 EIE — cos f — siní TIUS —sint+cost 6. b e) (“ cost + 2e™ sint —e™ sint ) 
Í) 
C. Usec, = 2and c; = 3 in answer to part (b). 3e "sint e™ cost — 2e™' sint 
d. lim /(r) = lim V(r) 20; no mud a -— 
100 100 —2e ^ + 3e7 —e" tte —e ^ Ee 
b. r= V k/m Fe 4e~ + 2e Že = x + 13 et Fe _ x + i et 
d. |r| is the natural frequency. 7 3 7 2 T 7 2 1 
3 3 Qe" 2e^ 2e 167 _ Dm D e? QE _ ae" _ p 
2 _ D. mox 2). 
€ n-2-l c= (3) r= —4, m= (1) 8. b. o0) = 
1 t 1 —2t 1 3t 1 t 1 —2t 1 t —2t 1 3t 
d. x, = 3c, cost + 3c, sint + 3c3 cos2r) + 3c, sint), a a7 Tas uk d ce we 
xj = 2c, cos t + 2c; sint — 4c, cos(2r) — 4c, sin(27) E se! " Qe Fë $e a Qe EU 
e. x| = —3c, sint + 3c; cos t — 6c, sin(2t) + 6c, cos(21), 02 $i s.1- "EP 1 e ds 
: : e eT + e e e e te? +e 
x, = —2c, sin t + 2c, cos t + 8c, sin(21) — 8c, cos(2t) 6 3 2 3 3 2 2 
0 0 1 0 9. x= (ie cos(2t) + (55): sin(2f) 
À 0 0 0 1 
a. = ; 7 
= =, qw 14. c. ZI "o jam 
-ou 
9/4 -13/4 0 0 a oy 9 
1 1 
: 1 1 Section 7.8, page 343 
b. r-i EV = if eo M: i 
2 t 2 £ 1 t 
i -i loa. x=c, Je teat iE (9 Je 
4 4 0 
1 1 
-3 -3 2.a x=a(3) +a (3)e- 1 
r= 28 ®=| 10i | mu =—-26 €® =] -10i R z 2 
15 15 
753! 5?! 3. a. x=c, (De «e (Dv (2e) 
cost sint 
cost sint -3 0 0 l 
€» 49 LL cel © 2 coer 4. x=c,| 4le' *c; 1ļe”+c|| 1 \te+] 0 te” 
—sint cos f 2 -1 -1 | 
4 5 Asi 5 1 1 0 
cos( 51) sin( $1) 5. x-c|1|?' *ce| Ole“ +c] lle 
1 -1 -1 
-3 cos( 31) -3 sin( Žr) 
+ C3 5 \ 11% 5 6. a. x= C ii a et 
—10sin(>1) 10( 51) T 
2 2 3 1 
. (5 5 Taa XS e'-6 te" 
= sin(>1) -> cos( =r) -1 1 
2 2 2 
10 8. a x=2())4+14( Pi 
e ġ= 7 c& =0, & = >=, C4 =0 period = 4z. nis f^ 1 


10. a 


11. 


12. 


14. 


ms 


17. 


18. 


0 0 
ed 1 te’ + 3| 0 |e” 
—33 —6 1 


+ (Je Qe (Q0) 


TL 


.é--(k- ky EY, k, +k, #0 
a. xO 2| 1 te 
-1 
0 1 
c. x?(pn =] 1 Ite? +] 1 fe?" 
-1 0 
0), 1 2 
d. xO@=] 1 z” +] 1 |e” +10 le” 
-1 0 3 
0 1 t+2 
e Wy=e*] 1 t-1 — P/24t 
-1 -t -P[243 
(0) 1 -3 3 2 
f. T=] 1 1 O| T=] 3 —2 -2| 
-1 0 -1 1 1 
2 0 
J=|0 2 1 
0 0 2 
1 0 
a. x(t) =10 je, xO(nz| 2 Ie! 
2 —3 
-2 0 
b. x9) = | —4 Ite’ + |0 fe 
2 1 
10 -2t ]-2 -2t 
e. V(2e|0 2 -4t jor e'|l0—4 —4t 
2-3 2t+1 2 2 2t+1 
1 -2 0 1 -1/2 0 
f T-|o -4 o| T!=| 0 -1/4 of 
2 2 1 —2 3/2 1 
0 0 
J=|0 1 
0 0 1 


2 2) o, B 33N ., 
o gh F = io wp ! 


€ 
0 
d. x = expJox? 
1 
. exp(JA = e*10 
0 
1 
. exp(Jt) = e^'| 0 
0 


c. exp(Jt) = e^ 


Section 7.9, page 351 


2cost 4 sint 


Lm 8 


B xc ZEN 


2cost + sint 


js 


js 


Answers to Problems 


5sint 
— cost +2sint 


5sint " 
— cost + 2sinf 


(s) - CP ys (2) cost 


e? cos( £) 
2 


1 
13. a. v-«(' «(z 
6 5 2e! 
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592 Answers to Problems 


0 1 ay Oe A 
0 e _ 2 
14. a. x(t) = _t+2 t+2 X + (2) 16. En+1 (1 151,0(7, ) a /4 
2 t _ = S ANELZN 
15 ) 0 1 0 17. nel = (1 + (in + a(i) ) ) /4 
. a X(f)— x+ 4 
-q()  -pQ) a(t) 18. ea = (2- (4(7,) +27, ) exp( -7,6(7,) ) 
17. Gla, -a,)-4)/6=c,, (a, +a,+3)/2=c, = exp( - 21,0(1,)) i2 
19.a 12,10, 12 

b d$(t-1-4(1/5z)sin(5zt) 
Chapter 8 d 1.1, 1.1, 1.0, 1.0 

e h<1/vV/50z = 0.08 


Section 8.1, page 361 : 
! 2n. en -ler(i)e 


3. a. 1.59980, 1.29288, 1.07242, 0.930175 
b. 1.61124, 1.31361, 1.10012, 0.962552 22. a. 1.55, 2.34, 346, 5.07 
c. 1.64337, 1.37164, 1.17763, 1.05334 b. 1.20, 1.39, 1.57, 1.74 
d. 1.63301, 1.35295, 1.15267, 1.02407 c. 1.20, 1.42, 1.65, 1.90 
4. a. 1.2025, 1.41603, 1.64289, 1.88590 23. a. 0 
b. 1.20388, 1.41936, 1.64896, 1.89572 b. 60 
c. 1.20864, 1.43104, 1.67042, 1.93076 c. -92.16 
d. 1.20693, 1.42683, 1.66265, 1.91802 24. 0.224 4 0.225 
5. a. 1.10244, 1.21426, 1.33484, 1.46399 Section 8.2, page 366 
b. 1.10365, 1.21656, 1.33817, 1.46832 
c. 1.10720, 1.22333, 1.34797, 148110 2. a. 1.19512, 1.38120, 1.55909, 1.72956 
d. 1.10603, 1.22110, 1.34473, 1.47688 b. 1.19515, 1.38125, 1.55916, 1.72965 
6. a. 0.509239, 0.522187, 0.539023, 0.559936 Ce: 413316, 1138120; 9918, opea 
b. 0.509701, 0.523155, 0.540550, 0.562089 3. a. 1.20526, 1.42273, 1.65511, 1.90570 
c. 0.511127, 0.526155, 0.545306, 0.568822 b. 1.20533, 1.42290, 1.65542, 1.90621 
d. 0.510645, 0.525138, 0.543690, 0.566529 c. 1.20534, 1.42294, 1.65550, 1.90634 
7. a. —0.920498, —0.857538, —0.808030, —0.770038 4. a. 1.10483, 1.21882, 1.34146, 1.47263 
b. —0.922575, —0.860923, —0.812300, —0.774965 b. 1.10484, 1.21884, 1.34147, 1.47262 
c. —0.928059, —0.870054, —0.824021, —0.788686 c. 1.10484, 1.21884, 1.34147, 1.47262 
d. —0.926341, —0.867163, —0.820279, —0.784275 5. a. 0.510164, 0.524126, 0.542083, 0.564251 
8. a. 2.90330, 7.53999, 19.4292, 50.5614 b. 0.510168, 0.524135, 0.542100, 0.564277 
b. 2.93506, 7.70957, 20.1081, 52.9779 c. 0.510169, 0.524137, 0.542104, 0.564284 
c. 3.03951, 8.28137, 22.4562, 61.5496 6. a. —0.924650, —0.864338, —0.816642, —0.780008 
d. 3.00306, 8.07933, 21.6163, 58.4462 b. —0.924550, —0.864177, —0.816442, —0.779781 
9. a. 0.891830, 1.25225, 2.37818, 4.07257 C. —0.924525, —0.864138, —0.816393, —0.779725 
b. 0.908902, 1.26872, 2.39336, 4.08799 7. a. 2.96719, 7.88313, 20.8114, 55.5106 
c. 0.958565, 1.31786, 2.43924, 4.13474 b. 2.96800, 7.88755, 20.8294, 55.5758 
d. 0.942261, 1.30153, 2.42389, 4.11908 8. a. 0.926139, 1.28558, 2.40898, 4.10386 
10. a. 1.60729, 2.46830, 3.72167, 5.45963 b. 0.925815, 1.28525, 2.40869, 4.10359 
b. 1.60996, 2.47460, 3.73356, 5.47774 9. a. 3.96217, 5.10887, 6.43134, 7.92332 
: 1.61792, 2.49356, 3.76940, 5.53223 b. 3.96218, 5.10889, 6.43138, 7.92337 
BOLE, ZARAI; fares, D1107 10. a. 1.61263, 2.48097, 3.74556, 5.49595 
11. a. —1.45865, —0.217545, 1.05715, 1.41487 b. 1.61263, 2.48092, 3.74550, 5.49589 
b. —145322, —0.180813, 1.05903, 1.41244 11. a. 0.590897, 0.799950, 1.16653, 1.74969 
c. —1.43600, —0.0681657, 1.06489, 1.40575 h.:0550906. 0090928. L16663. 172900 
d. —1.4190, —0.105737, 1.06290, 1.40789 T ibt e oi aio 
12. a. 0.587987, 0.791589, 1.14743, 1.70973 Tu Ke 2 oe 
b. 0.589440, 0.795758, 1.15693, 1.72955 eo aa 
c. 0.593901, 0.808716, 1.18687, 1.79291 an 
, . > c. « 0.00193389 
d. 0.592396, 0.804319, 1.17664, 1.77111 lel < 


14. a. e,,, = QI? /3)expQt,), 
< 4.926041? on 


14. e, = 2A) -D, les (: +2 max wor. O<r<l, 
Osts1 b. |e,| < 0.000814269 


41 = (4h? /3) exp(2z,), 


-OGI 2 le,41| € 9.852070? on 
15. en = ÊG) -t)^, lel (1 + 2 max DO) ; a Be i 


e,,4, = 2e%"h?, le, < 0.024,  |e4| < 0.045 b. |e,| € 0.00162854 


13. 1.595, 2.4636 


e1 eI, le €0.012, [e4] < 0.022 15 


16. 
17. 
18. 
19. 
21. 
22. 
23. 
24. 


h = 0.036 
h = 0.023 
h = 0.081 
h = 0.117 
1.19512, 1.38120, 1.55909, 1.72956 
1.62268, 1.33435, 1.12789, 0.995130 
1.20526, 1.42273, 1.65511, 1.90570 
1.10485, 1.21886, 1.34149, 1.47264 


Section 8.3, page 370 


2 


10. 


11. 


£P WE wp SP PP we Eg Tp SS WE 


1.19516, 1.38127, 1.55918, 1.72968 
1.19516, 1.38127, 1.55918, 1.72968 


1.62231, 1.33362, 1.12686, 0.993839 
1.62230, 1.33362, 1.12685, 0.993826 


1.10484, 1.21884, 1.34147, 1.47262 
1.10484, 1.21884, 1.34147, 1.47262 


0.510170, 0.524138, 0.542105, 0.564286 
0.520169, 0.524138, 0.542105, 0.564286 


—0.924517, —0.864125, —0.816377, —0.779706 
—0.924517, —0.864125, —0.816377, —0.779706 


2.96825, 7.88889, 20.8349, 55.5957 
2.96828, 7.88904, 20.8355, 55.5980 


0.925725, 1.28516, 2.40860, 4.10350 
0.925711, 1.28515, 2.40860, 4.10350 


3.96219, 5.10890, 6.43139, 7.92338 
3.96219, 5.10890, 6.43139, 7.92338 


1.61262, 2.48091, 3.74548, 5.49587 
1.61262, 2.48091, 3.74548, 5.49587 


0.590909, 0.800000, 1.166667, 1.75000 
0.590909, 0.800000, 1.166667, 1.75000 


Section 8.4, page 375 


1. 


COR CSP OTP Ppp STP Ose oe 


1.7296801, 1.8934697 
1.7296802, 1.8934698 
1.7296805, 1.8934711 


0.993852, 0.925764 
0.993846, 0.925746 
0.993869, 0.925837 


1.4726173, 1.6126215 
1.4726189, 1.6126231 
1.4726199, 1.6126256 


0.56428577, 0.59090918 
0.56428581, 0.59090923 
0.56428588, 0.59090952 


—0.779693, —0.753135 
—0.779692, —0.753137 
—0.779680, —0.753089 


2.96828, 7.88907, 20.8356, 55.5984 
2.96829, 7.88909, 20.8357, 55.5986 
2.96831, 7.88926, 20.8364, 55.6015 


0.9257133, 1.285148, 2.408595, 4.103495 
0.9257124, 1.285148, 2.408595, 4.103495 
0.9257248, 1.285158, 2.408594, 4.103493 


10. 
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3.962186, 5.108903, 6.431390, 7.923385 
3.962186, 5.108903, 6.431390, 7.923385 
3.962186, 5.108903, 6.431390, 7.923385 


1.612622, 2.480909, 3.745479, 5.495872 
1.612622, 2.480909, 3.745479, 5.495873 
1.612623, 2.480905, 3.745473, 5.495869 


0.5909091, 0.8000000, 1.166667, 1.750000 
0.5909091, 0.8000000, 1.166667, 1.750000 
0.5909092, 0.8000002, 1.166667, 1.750001 


oop OSPR ore 


Section 8.5, page 378 


1. 


6. 


a. 1.26, 0.76; 1.7714, 1.4824; 2.58991, 2.3703; 

3.82374, 3.60413; 5.64246, 5.38885 

b. 1.32493, 0.758933; 1.93679, 1.57919; 2.93414, 
2.66099; 4.48318, 4.22639; 6.84236, 6.56452 

c. 1.32489, 0.759516; 1.9369, 1.57999; 2.93459, 2.66201; 
4.48422, 4.227784; 6.8444, 6.56684 


a. 0.582, 1.18; 0.117969, 1.27344; —0.336912, 1.27382; 
—0.730007, 1.18572; —1.02134, 1.02371 

b. 0.568451, 1.15775; 0.109776, 1.22556; —0.32208, 
1.20347; —0.681296, 1.10162;  —0.937852, 0.937852 

C. 0.56845, 1.15775; 0.109773, 1.22557; —0.322081, 
1.20347; —0.681291, 1.10161; 0.937841, 0.93784 


a. —0.198, 0.618; —0.378796, 0.28329; | —0.51932, 
—0.0321025; —0.594324, —0.326801; | —0.588278, —0.57545 
b. —0.196904, 0.630936; |—0.372643, 0.298888; 
—0.501302, —0.0111429;  —0.561270, —0.288943; 
—0.547053, —0.508303 

C. —0.196935, 0.630939; —0.372687, 0.298866; 
—0.501345, —0.0112184; .—0.561292, —0.28907; 
—0.547031, —0.508427 


a. 2.96225, 1.34538; 2.34119, 1.67121; 1.90236, 1.97158; 
1.56602, 2.23895; 1.29768, 2.46732 

b. 3.06339, 1.34858; 2.44497, 1.68638; 1.9911, 2.00036; 
1.63818, 2.27981; 1.3555, 2.5175 

c. 3.06314, 1.34899; 2.44465, 1.68699; 1.99075, 2.00107; 
1.63781, 2.28057; 1.35514, 2.51827 


a. 1.42386, 2.18957; 1.82234, 2.36791; 2.21728, 2.53329; 
2.61118, 2.68763; 2.9955, 2.83354 
b. 1.41513, 2.18699; 1.81208, 2.36233; 2.20635, 2.5258; 
2.59826, 2.6794; 2.97806, 2.82487 
C. 1.41513, 2.18699; 1.81209, 2.36233; 2.20635, 2.52581; 
2.59826, 2.67941; 2.97806, 2.82488 


For h = 0.05 and 0.025: x = 10.227, y = —4.9294; these results 


agree with the exact solution to five digits. 


7. 
8. 


1.543, 0.0707503; 1.14743, —1.3885 
1.99521, —0.662442 


Section 8.6, page 386 


1. 
2. 
3. 
4. 


b. p(t) — (®© = 0.001e! — oo as t — co 

b. pO 21n(e//2 —e)); hO = In(1/ü — 9) 

a., b. h = 0.00025 is sufficient. c. h = 0.005 is sufficient. 
y= 4e-10t + ?/4 

Runge - Kutta is stable for h = 0.25 but unstable for h = 0.3. 
h = 5/13 = 0.384615 is small enough. 

y=t 

yz? 


Pp pepp 
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Chapter 9 


Section 9.1, page 397 
Loa He EP SH n-5f4 5-00 
b. saddle point, unstable 
a. 7,22, = (0,375, r 24, 49 = 1T 
b. node, unstable 
a eL! = (1,3); 2H 1,2" = (1, D7 
b. saddle point, unstable 
a nene = (1,1) 
b. improper node, asymptotically stable 
a. rpn =li é, ¿0 =(2+i,1)" 
b. spiral point, asymptotically stable 
6. a rpn =i; é, éO = Qi) 
b. center, stable 
a. ror = 12i; LE = (1,1 FDT 
b. spiral point, unstable 
a. r =-1,€9 = (1,0); n 21/4, 49 = (4, 3)" 
b. node, asymptotically stable 
a. rjr; = 23i; é, €% = (2, -1 x 3i)? 
b. center, stable 
a 
b 


10. a. r =r 5-1; £9 = (,07, ® = (0, 1)" 


. proper node, asymptotically stable 
11. Xo = l, yy = l; = v2, ry = — V2; saddle point, unstable 
12. xy =-2,y9=1; rom =-ltV2i; 
spiral point, asymptotically stable 
13. xo =y7/6,¥ =a/B; rir; = + fói; center, stable 
14. c? > 4km, node, asymptotically stable; c? = 4km, improper 
node, asymptotically stable; 
c? < 4km, spiral point, asymptotically stable 


Section 9.2, page 406 
1l. x24e,y = 2e 7, y 2 x? /8 
2. x 24e y 22e ,y 2 32x?; x 2 4e ^, y 0 


3. x= Vacos( abr). y = - Vb sin( abi); x Ja -yMb-l 


4. a.c. (- 
(0, 0), (proper) node, unstable 


S. a5 G (- 3/3, - : ) saddle point, unstable; ( 3/3, — : ) 
center, stable 


L 1), saddle point, unstable; 


6. a.,c. (0,0), node, unstable; 
(2, 0), node, asymptotically stable; 
(o. ;) saddle point, unstable; (—1, 3), node, asymptotically stable 


7. a8., C. (0,0), node, asymptotically stable; (1, — 1), saddle point, 
unstable; (1, —2), spiral point, asymptotically stable 
8. a.c. (0,0), spiral point, asymptotically stable; 
(1—- v0, 1+ 2), saddle point, unstable; 
d+ v2, 1- v2), saddle point, unstable 
9. a.,c. (0,0), saddle point, unstable; 
(2, 2), spiral point, asymptotically stable; 
(—1, —1), spiral point, asymptotically stable; 
(—2, 0), saddle point, unstable 
10. a., c. (0,0), saddle point, unstable; (—2, 2), node, unstable; 
(4, 4), spiral point, asymptotically stable 
11. a.,c. (0,0), saddle point, unstable; 
(2, 0), saddle point, unstable; 


(1, D), spiral point, asymptotically stable; 

(—2, —2), spiral point, asymptotically stable 

12. a.,c. (0,0), node, unstable; (1, 1), saddle point, unstable; 
(3, —1), spiral point, asymptotically stable 

13. a.,c. (0, 1), saddle point, unstable; 

(1, 1), node, asymptotically stable; (—2, 4), spiral point, unstable 
14. a. 42 - y? zc 


15. a. 42 +y =c 

16. a. (y- 22 (x - y) 2c 

17. a. arctan(y/x) — In Và y? =g 
18. a. 2y -3xy-2y zc 

19. a. y?/2-cosx 2 c 

20. a. x +y? -xt/12=c¢ 


Section 9.3, page 415 


1. linear and nonlinear: saddle point, unstable 


2. linear: center, stable; 
nonlinear: spiral point or center, indeterminate 


3. linear: improper node, unstable; 
nonlinear: node or spiral point, unstable 


4. a.,b.,c. (0,0); u 2 —2u + 2v, v! = 4u c 4v; r2 1 VIT; 
saddle point, unstable 
(—2,2); u’ = Au, v' = 6u + 6v; r= 4,6; node, unstable 
(4, 4; u’ 2 —6u + 6v, v! 2 —8u; r = —3 + y39i; spiral point, 
asymptotically stable 

5. a.b., c. (0,0); w =u v! 23v; r= 1,3; node, unstable 
(1,0); w = -—u-— v, v! 22v; r=-—l1,2; saddle point, unstable 


(0.3 su = "T - -Žu = 3v; p -Lh node, 


asymptotically stable 
(-1,2); w = u +v,v! = —2u — 4v; r = (-3 + V17)/2; saddle 
point, unstable 

6. a., b., c. (L1; w 2 =v, v! = 2u — 2v; r= —l +i; spiral 
point, asymptotically stable 
(1,1); w = -y,v 2 —2u — 2v; r 2 —l + y3; saddle point, 
unstable 

7. a.b., c. (0,0); u’ =—2u+4y, v/ = 2u+4v; r= (1x: /17/2; 
saddle point, unstable 


(2,1); w = —3u + 6v, v! = —4u; r = 3t lo spiral point, 
asymptotically stable 

(2, —2); w = —6v,v! = 2u, r = +V3i; center or spiral point, 
indeterminate 

(4,-2); u! = —8v,v/ = —2u — 4v; r = -1 + y5; saddle point, 
unstable 

8. a,b.,c. (0,0); uv =v; r=1,1; node or spiral point, 
unstable 

(-1,0); w = —u, v! = 2v; r2 —1,2; saddle point, unstable 


9. a., b., c. (0, +2nz),n = 0,1, 2,... 5 u! = y, v! 2-u; 
r = +i; center or spiral point, indeterminate 
(2, +(2n — Dz,n2 L2,3,..; W = -3y v = -u; r= +73; 


saddle point, unstable 

10. a., b., c. (0,0); u’ = u, v = v; r= 1,1; node or spiral 
point, unstable 

(1, 1); w 2u—2v, v = —2u +v; r 23,—1; saddle point, unstable 
1l. a,b.,c. 0,1); uw =-—u-—v,v' = u — 3v; r= -2, 2; 

node or spiral point, asymptotically stable 

(-L-1; u = u +v,v = u — 3v; r= -l+ V5; saddle point, 
unstable 


12. a.,b.,c. (0,0); w 2-2u-v,v'2u-v; 

r=(-3+ V3 i)/2; spiral point, asymptotically stable 
(—0.33076, 1.0924) and (0.33076, —1.0924); 

u’ = —3.5216u — 0.27735v, v’ = 0.27735u + 2.6895v; 

r = —3.5092, 2.6771; saddle point, unstable 

13. a., b., c. (00; wv =utyvw =-ut+v r=14i 
point, unstable 


spiral 


14. a, b.c. 2,2); u 2 -4wv = -ju dcus 


2 
r = (7 + N273)/4; saddle point, unstable 
(-2, —2); uw’ 24v, v = is — ls r = (—1 + N33)/4; saddle point, 


2 2 
unstable 
(-5.2); u = —4y,V = zu: r= +vV14i; center or spiral point, 
indeterminate 
(-5. -2); u! = 4, v! = -Zu r= +yŅy2i;, center or spiral point, 
indeterminate 


15. a, b, c. (0,0); u 22u-v v! 22u-4v; r = -1 + VT; 
saddle point, unstable 

(2,1); w = —3y, v! = 4u — 8v; r = —2, -6; node, asymptotically 
stable 

(-2,1); uw’ = 5yvy' = —4u, r = 225i; 
indeterminate 

(—2, -—4); u’ = 10u—5v, v = 6u; r=5+ 


18. b.,c. Refer to Table 9.3.1. 


center or spiral point, 


5i spiral point, unstable 


20. a. R=A, T 23.17 
b. R=A, T & 3.20, 3.35, 3.63, 4.17 
c ToaaA>0 
d. A=x 

21. a. v, = 4.00 

22. a. v, & 451 

27. a. dx/dt =y, dy/dt = —g(x) — coy 
b. The linear system is dx/dt — y, 


dy/dt = —g'(0)x — c(O)y. 
C. The eigenvalues satisfy r? + c(0)r + g'(0)=0. 


Section 9.4, page 426 


1. b.,c. (0,0); w = 2s y =2v; r= 5,2: node, unstable 
(0,2); u = ju v= -5" — 2v; r= > —2; saddle point, unstable 
(3.0): u = -2u — url = Ly; r= -4 i: saddle point, 


unstable 

(i 7 y i 4 27 21 7. 
=z); wW =-—-u-— =y, v =-—u- 2; 
25 2 5 20 5 

r = (—22 + N204)/20; node, asymptotically stable 


2. b,c. (0,0); w = Au, v=; r= > 2; node, unstable 
(0,4); w = =" v = -6u-2v; r= "m —2; node, asymptotically 
stable 
(2.0); u = = m 34 y = -— r= = = node, 

2 24 4 4 4 2 
asymptotically stable 
(L1); u’ =-u- Um v= -2u — n: r = (—3 + y 13)/4; 
saddle point, unstable 


3. b.c. (00; w =u, v = sy 
u 


(0,3); u = -5u v = 


asymptotically stable 


r= 1, z; node, unstable 


1 


Dipa ov r= -> node, 
2 2 


= 
3*9 


(1,0); u’ = —u — v, v! = T r=-l, 7 saddle point, unstable 
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4. b.c. 0,0); w =u, v = 2" r=1, >: node, unstable 


(0. >); u = Zu, y= Au v; r= 3: saddle point, 
unstable 
(1,0); u/2-u- js y= s r=-l, T; saddle point, unstable 


f 


Qo: Stee v Au- 3 eae 3/5 
node, asymptotically stable 


6. a. Critical points are x = 0, y = 0; x = €,/o,, y = 0; 
x=0, y= 6/0. 
x > 0, y > 6/0 as t > co; the redear survive. 
b. Same as part a except x > €j/6,, y > Oast — oo; the 
bluegill survive. 


7. a. X (B-yR)/( - io), Y = R- nB)/ — 172) 
b. X is reduced, Y is increased; yes, if B becomes less than 5 R, 
then x > 0 and y > Rast > œ. 


8. a. cje, — ae + 0: (0,0), (0, e/o»), (ei /o, 0); 
0,€5 — a5€, = 0: (0, 0), and all points on the line 
ox + ay =E 
b. 6,6; — a5€, > 0: (0, 0) is unstable node; 

(e; /o,, 0) is saddle point; 

(0, €2/02) is asymptotically stable node. 

©] € — a5€, < 0: (0, 0) is unstable node; 

(0, e2/02) is saddle point; 

(e; /o,, 0) is asymptotically stable node. 

C. (0,0) is unstable node; points on the line o,x + ay = e, are 
stable, nonisolated critical points. 


10. b. (0,0), saddle point; (0.15, 0), spiral point if y? « 1.11, node 
if y? > 1.11; (2, 0), saddle point 
c. y 2120 


J 3 3 J 3. 3 
11. b. (2- 4-28 à) (2+ 4- 5G 2j 


C. (L3)is an asymptotically stable node; 
(3, 3) is a saddle point 
d. a) = 8/3; critical point is (2, 4); 4 = 0, -1 


J 3 3 / 3. 3 
12. b. (2- 4-5% ja) (2+ 4— 5G, 2j 


c. (1, 3) is a saddle point; (3, 3) is an unstable spiral point 
d. aj = 8/3; critical point is (2, 4); 4 = 0,1 


13. b. ((3- v9 -4)/2 (324 - y9=4a)/2), 
((3+ v9=4a) /2, (3+2a + V9 -24)/2) 


c. (L3)isa saddle point; (2, 4) is an unstable spiral point 
d. ay = 9/4; critical point is (3/2, 15/4); A = 0,0 


14. b. ((3- v9=4a) 2, (3-2 - V9 =4a)/2), 
(3+ v9=4a)/2, (3+20+ V9 = 4a)/2) 


C. (1,3) is a center of the linear approximation and also of the 


nonlinear system; (2, 4) is a saddle point 
d. a = 9/4; critical point is (3/2, 15/4); 4 = 0,0 


15. b. P,(0,0), P (1,0) P3(0, a), P4y(2 — 2a, -1+4 2a). 
P, is in the first quadrant for 0.5 < a < 1. 
C. a =0; P} coincides with P}. a = 0.5; P4 coincides with P5. 
a = 1; P4 coincides with P3. 

1-—2x-y —x 

d. J= —0.5y a — 2y — 0.5x 
e. P, is an unstable node for a > 0. P, is an asymptotically 
stable node for 0 < a < 0.5 and a saddle point for a > 0.5. P} is 
a saddle point for 0 « a < 1 and an asymptotically stable node 
for a > 1. P4 is an asymptotically stable node for 0.5 < a < 1. 
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16. b. P,(0,0, P,(1,0), P4(0,0.75/a), 
P4((4a — 3)/(4a — 2), 1/(4a — 2)). 
P, is in the first quadrant for a > 0.75. 
C. a = 0.75; P, coincides with P4. 


1-2x—y “x 
d. J= ( —0.5y 0.75 — 2ay — - 


e. P, is an unstable node. P, is a saddle point. P4 is an 
asymptotically stable node for 0 « a « 0.75 and a saddle point 
for a > 0.75. P4 is an asymptotically stable node for a > 0.75. 


b. P,(0,0, P,(1,0), P4(0 a) P4(0.5,0.5). In addition, for 
a = l every point on the line x + y = 1 is a critical point. 
C. à = 0; P, coincides with P}. Also a = 1. 

| [1-2x-y =x 
d. J= —(2a — 1)y a — 2y — Qa — 1x 
e. P, is an unstable node for a > 0. P, and P; are saddle points 
for 0 < a < 1 and asymptotically stable nodes for a > 1. P, is 
an asymptotically stable spiral point for 
0 « a < 0.5, an asymptotically stable node for 0.5 < a « 1, and 
a saddle point for a > 1. 


17 


Section 9.5, page 433 


1. bc. (005 w = m v= ais 


ro 2 =! saddle point, 
2 2 2 2 


unstable 


(2.3): u = -iv v = 3u, r= +/3i/2; center or spiral point, 


indeterminate 

2. b., c. (0,0); wW = uv = i r= L-i saddle point, 

unstable 

(5 2) u = zi y -—-urcz- "m center or spiral point 
2 ; zd 4^ = Uy = t56 pP: p! , 

indeterminate 

3. b., c. (0,0); wW = uv = -iv r= L-i saddle point, 

unstable 

(2,0); u = —u — v, v! = s r=-l, T saddle point, unstable 


(5. >); u = -iu - D v= Du r = (—1 + V110/8; spiral 


point, asymptotically stable 
4. b, c. (0,0); u! = -uv = 
asymptotically stable 


(2.0): ul = 3e ya =-v; r=-l, 


(2,0); u’ = —3u — 5” v= 2^ r= -3, 7 saddle point, unstable 


(5. i) ul = -iu — x v= 3'5 r = (—3 + V391)/8; spiral 
point, asymptotically stable 


f. E node, 
2 


T saddle point, unstable 


5, as 

YA 

22 dB Xe. xoxo) RR A XR CAS NUN CX, e SUN 
LX Q3 Y 303 Wok v W0X $$ X0. SX 
Bee $0 4c No 03 X X 3 X0 X x NOM XX 
] oW WO UM UON Noo V NC OX Ne X NON. UNS ON 
b 3 X MX CX 078 X08 EN Ne, SS X S a 

Ree ea Be RR Re BR RON OR ON. AC EU, S 
LW XoX XO X 0X WX WX X XN A 
MN EE S Nie e RS S d dde e NE NS 
ho XN 05 We X XN CM NOM CS X ON ON, NUN NS OX 
to RON RR AN RR ON CN SS SAN 

JEN EL UE MM X C aA UE S TM a i t 
MO e MAS NS E. Do e A e IE. DM. c6 Ne Ds. e No eS 
N.N NRG, Sh NS UO NCC ON NON ONCOL S 
S SN AS ee Re NON UNO ONCOL SUN ALUNCUN 
VN SN WO WOONO WAND A a A 30S LUN UN 

OSF uw wq dog aa dg dy Jp ues dp b =O 
Eh gu acuta a ran n ru oc LIE n utro 
EP uM ub E 2 m e aes aT Ge M E cub cR T UP P NT 
ee ——— 
a —À ee el — — — — — — 

——————————————— 

0 0.5 1 1:5 2 X 


d. 


NE] 


1 1 1 1 
b., c. (0,0);u’ = -=u, v! = ——v;r = —-,-—<=; node 
, ü , , , $ ’ * 

2 4 4 2 

; 1 i j 1 
asynmptotically stable (0, 2 ;u = OVW = Tu r= 

0, 0; proper or improper node, or saddle point 
y 

= 4.6 M: M ^ M M M M k 

E AME oe dE * on c 

e eee A H s x M M & x y% 

k E. NP ox H \ N ` `N DS AA ^. 

M SS 5 \ M Ni Dj. N M ki 

^ = N x M N » * & \ 

A M N M N ` M N \ N N 

ea SE 21 "ELE 

| 051 i| ca anes 

1 a ‘ + £ 5 + L4 [4 [i 

E E. oF i Fi a " 4 4 4 4 

rd yh. wm 4 4 4 Z £ x , Di 

ac qi etn Lan £ r4 4 £ 4 A 4 

a qut C ‘ rd p e D 4 r4 4 

ge uam. M ‘ £ 4 4 x 4 x 4 

—— N + 4, PA z 4 z PA 

SOHAL 27 ee p 


e. 
YA 
\ N N ` 
YAAA x 
\ VWAW N 
\ AA N 
XN MAAR SR A 
Kv Wd Y QA 
Vin A AAS 
NNNM. NS X 
WK YQ YQ 
AN QX QK SAS 
SK SS 
SASS. SAK KRACK 
Kj ASS ARS SKS 
AA, Ne CNN, 
NNNM NOW 
p 4153 -— 
"P, VP PE" 
99 9 d T we 
L LLLA 
0 0.5 1 15 2 # 
6. b.c. 
t=0,7,2T....: H is a max., dP/dt is a max. 
t = T/4,5T/4,...: dH[dtisamin, Pisamax. 
t = T/2,3T/2,...: Hisamin, dP /dt is a min. 
t=37T/4,7T/4,...: dH/dtisamax, Pisa min. 
7. a. vc aJ (fa y) 


b. V3 

d. The ratio of prey amplitude to predator amplitude increases 
very slowly as the initial point moves away from the equilibrium 
point. 


a. 4zx/N3z 7.2552 
C. The period increases slowly as the initial point moves away 
from the equilibrium point. 


a. Tz26.5 
b. 7 23.7,T = 115 
c. T238,T2 11.1 


11. 


12. 


13. 


a. P,(0,0), P,(1/o, 0), P4(3, 2 — 60); P, moves to the left and 
P moves down; they coincide at (3, 0) when o = 1/3. 

b. P, is a saddle point. P; is a saddle point for o < 1/3 and an 
asymptotically stable node for o > 1/3. P3 is an asymptotically 


stable spiral point for o < c, = ( /7/3— 1/2 = 0.2638, an 
asymptotically stable node for o, < o < 1/3, and a saddle point 
foro > 1/3. 


a. P,(0,0), P:(a/o, 0), Pslc/y, (a/a) — (co /ay)]; 

P, moves to the left and P} moves down; 

they coincide at (c/y, 0) when o = ay/c. 

b. P, isa saddle point. P; is a saddle point for o < ay/c and an 
asymptotically stable node for o > ay/c. P} is an asymptotically 
stable spiral point for sufficiently small values of c and becomes 
an asymptotically stable node at a certain value c, < ay/c. P} is 
a saddle point for o > ay /c. 


a., b. P,(0,0) is a saddle point; P,(5,0) is a saddle point; 


P4(2, 2.4) is an asymptotically stable spiral point. 


14. 


15. 


16. 


b. same prey, fewer predators 
more prey, same predators 
more prey, fewer predators 


same prey, fewer predators 
more prey, fewer predators 
more prey, even fewer predators 


same prey, fewer predators 
more prey, same predators 
more prey, fewer predators 


R^pRhOpRO 


Section 9.7, page 452 


1. 
2. 
3. 


r=1, 0—t- to, stable limit cycle 
r=1, 0=-t+ tọ, semistable limit cycle 
r= l, 0—t- to, stable limit cycle; 


r=3, 0—t- tọ, unstable periodic solution 


4. 


r=1, 0 = —t + t, unstable periodic solution; 


r —2, 0 = —t + tg, stable limit cycle 


5. 
r=2n 0=t+t, n=1,2,3,..., 


6. 


r=2n-1, 0=t+%, n=1,2,3,..., stable limit cycle; 
unstable periodic solution 


r=2, 0 = —t + tọ, semistable limit cycle; 


r= 3, 0 = —t + tọ, unstable periodic solution 


ds 


14. 


15. 


16. 


17. 


a. Counterclockwise 

b. r=1, 0 = t+ 4%, stable limit cycle; 
r=2, 0— t + to, semistable limit cycle; 

r 23, 0 = t + tg, unstable periodic solution 


r= V2, 0 = —t + fg, unstable periodic solution 


a. w=0.2, T2629; w=1, T = 6.66; 
5, T = 11.60 


y, y! = —x + uy - uy [3 

H < 2, unstable spiral point; u > 2, unstable node 
2. 

0. 


Bo 
cn Il Lee 


16, T = 6.65 
2,A21.99, T = 6.31; 
A = 2.03, T = 6.39; 
| A & 2,60, T = 7.65; 

, A S 4.36, T= 11.60 


/ 


b. x = pxty, y = —x + py; A = pi; the origin is an 
asymptotically stable spiral point for 4 < 0 and an unstable spiral 
point for u > 0. 

c. r' 2r(u—- r2) 6 =-1 


LEE Mu 
MANoe 


a. The origin is an asymptotically stable node for  « —2, an 
asymptotically stable spiral point for -2 < 4 < 0, an unstable 
spiral point for 0 < u < 2, and an unstable node for p > 2. 


18. 


Answersto Problems 597 


a., b. (0, 0) is a saddle point; (12, 0) is a saddle point; 


(2, 8) is an unstable spiral point. 


19. 


20. 


21. 


a. (0,0), (5a, 0), (2, 4a — 1.6) 


b. r= -0.25 + 0.125a + 0.025/220 — 400a + 25a”; ay = 2 
b. A= (-6/4- b) + N((5/4 - by — 1)/2 


c. O«b « 1/4: asymptotically stable node; 
1/4 « b « 5/4: asymptotically stable spiral point; 
5/4 « b « 9/4: unstable spiral point; 

9/4 « b; unstable node 

d. by — 5/4 

b. k — 0, (1.1994, —0.62426); 

k = 0.5, (0.80485, —0.13106) 

C. ky = 0.3465, (0.95450, —0.31813) 

d. k=0.4,T = 11.23; k= 0.5, T & 10.37; 
k = 0.6, T = 9.93 

e. kı = 1.4035 


Section 9.8, page 460 


1. 


11. 


12. 


13. 
14. 


15. 


b. 4=4, é” = (0,0, D7; , 
dde £? = (209 - 81 +40r, 0) 


=A, €9 = (20. 9 + 81 F 40r, oJ 
c. 4 & 2.6667, & = (0,0, 1)"; 
Ay = —22.8277, EO = (20, —25.6554, 0)"; 
Az & 11.8277, EO = (20, 43.6554, 0)" 
C. 2; & —13.8546; A, A; = 0.0939556 + 10.1945; 
a. dV/dt = —2e(rxX? € y? + b(z — r? — br?) 


b. c= v0.5 : P (V2/4 - V2. v2); 
A=0, —0.05178 + 1.5242: 

c=1: P, = (0.8536, —3.4142, 3.4142); 
A = 0.1612, —0.02882 + 2.09437 
P4(0.1464, —0.5858, 0.5858); 

A = —0.5303, —0.03665 + 1.1542i 


a. P,(1.1954, —4.7817, 4.7817); 
A= 0.1893, —0.02191 + 2.40077 
P,(0.1046, —0.4183, 0.4183); 

A= —0.9614, 0.007964 + 1.0652i 
d. 7, £59 

a., b., c. c, & 1.243 

a. P,(2.9577, —11.8310, 11.8310); 
A = 0.2273, —0.009796 + 3.5812i 
P4(0.04226, —0.1690, 0.1690); 

A = —2.9053, 0.09877 + 0.99697 

c. T & 11.8 

a. P,(3.7668, 15.0673, 15.0673); 
A = 0.2324, —0.007814 + 4.0078i 
P,(0.03318, —0.1327, 0.1327); 

A = —3.7335, 0.1083 + 0.9941; 

b. T, = 23.6 


Chapter 10 


Section 10.1, page 468 


1. 
2. 


3. 
4. 


y- —sin(x) 
y= (cot( v2) cos( 2x) + sin( V2x))/v2 
y = 0 for all L; y = c; sin(x) if sin(L) = 


y = -tan(L)cos(x) + sin(x) if cos(L) # 0; no solution if 


cos(L) 2 0 


Ds 


No solution 
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y= (-z sin( 2x) + xsin( V22 ))/2 sin( V2) 
No solution 


6 
7 
8. y= c, sin(2x) + sinx 
9. y = cı cos(2x) + Z cosx 


10. y= 5 cosx 


11. y=-3x+ z 


12. y= -5x + La —e»)yclinx4 5° 
13. No solution 


2 
14 4,= (==). »,6) = sin( Ain = [4 eee 


2n- 1? 2n-1 
15. Aet 2 Js »,09 = cos( punc t3 e. 
16. Ay = 9, y(x) = 1; A, =W, y,(x) = cos nx; 
n= 1,2,3,... 
2 
| (Qn-Dz _ (2n — Dzx X. 
17. a= { 2L Ja xe = c05( JL ) 
n= 1,2,3,... 
18. Ay = 0, yoX) = 1; å, = (nz/L),, y, (x) = cos(nzx/L); 
n= 1,2,3,... 
2 
19. n=-( 237) $ y40 = sin( ZPE); 
n= 1,2,3,... 
20. An ze] t (nz/ ln Ly, y,(x) = xsin(nz In x/ In L); 
n21,2,3,... 


21. a. wr = G(R? — r/(4y) 
C. Qisreduced to 0.3164 of its original value. 


22. a. y= kQ — 2L + Dx)/24 
b. y= ka* — 2L + 12x?)/24 
c. y 2 k(* — AL --612x2)/24 


Section 10.2, page 476 
1. T=22/5 

2. T=1 

3. Not periodic 
4. T=2L 

S5. Ti 

6. Not periodic 
75. T2 

8. T-4 


9. f(x) -2L-xinL«x«2L; 
f(x) = -2L — xin -3L <x < —2L 


10. f@)=x-1linl <x <2; f) -x-8in8«x«9 
11. arr ae 

(1) . (naxx 
13. b. f@)= 15 na - sin (=) 


n=1 


14. b. dsl. 2 Y UD 


15. b. fœ) = E +5 


n=1 


( 2cos(Qn - 13) , CD" sinn) ) 
z(2n — 1)? n 

"n. v cos((2n — 1)zx) 

urs A Qn-1y 


16. b. fa) = 


17. b. f= 
3L Y 2Lcos((2n — l)zx/L) ——Ó 
4 4 (2n — 1yz2 na 
= 2 nz 2M. (nx 
18. b. m2 cos( 7 )+ (=) sin( 2 )) 
x sin( =) 
2 
4 ws sin(2n — Dax/2) 
mde CR 
nl 
20. b. f= yc - t sin(nzx) 


2 8 « (-1Y (=) 
= cos 
3 m = n2 2 


1 12 & cos((2n — 1)zx/2) 


22 b. fœ = 5 EP Ona 1 
PR (-1)" nax 
* x Acl n sin( “= ) 
B 1 wv (C-D'-5  (nax 
23. b. f(x) = x Y - ( z ) 


efa4l-CD] CD. 
+> / nig C nr ) sin“) 


9 wxf/I169[-1"-1] 27-1» 
24. b. fog y (SEU C) 


= nirt nig? 
NIX d 108(-1)" +54 . /nzx 
x cos( 7 )- à ww ES sin( 2) 
25. b. m=81 
26. b. m=27 


28. f (t) dt may not be periodic; 
for example, let f(t) = 1 + cost. 


Section 10.3, page 481 


4 ws sin(2n — Dax) 
i 2; 2n-1 


n=1 


( 


al sna) 
n 
—.L | AL xs cos(Qn = l)ax/L) 
2 mg (2n — 1)? 


n-l 


œ s qyn+l 
4. a. jeep T cos(nzx) 
T & 


5. a Kia SS Da — cos(Qn - Dx) 
bas 2 2n — 


6. a. f= E + Y«, cos(nzx) + b, sin(nzx)); 


n-l 
2(-1)" |j (nan), n even 
B c 1/(nz) — 4/(n?x?), n odd 


ay Y ( 1- D) eo s(nx) — =~ sina) 


b. n210; maxle| = 1.6025 at x = xz 
n -—20; max|e| = 1.5867 at x 2 +7 
n = 40; max|e| = 1.5788 at x = +7 
C. Not possible 


1 2 € ] — cos(nz) 

2 T Eo » — a cos(nzx) 

b. n= 10; max|e| = 0.02020 at x = 0, +1 
n —20; max|e| = 0.01012 at x = 0, +1 
n —40; maxle| = 0.005065 at x = 0, +1 

€. n=21 


99 g opyrtl 
9. a. f@ = 255 2 sin(nx) 


b. n = 10, 20, 40; 
c. Not possible 


10. a. fœ) = : + bi (£ scone) cos( 222) 
n=1 


8. a. f@= 


max|e| = 1 atx+1 


nn 


2cos(nz) . /nzx ) 
pe —— —5 sin( Ux) 
nz 2 


b. n= 10; luble| = 1.0606 as x > 2 
n= 20; luble| = 1.0304 as x > 2 
n = 40; luble| = 1.0152 asx > 2 
c. Not az 


11. a. fœ = -+= ++ Y (Ae COS nx 


2 — 2cos(nz) +n?x? cos(nz) . 
— M —— ——— ——-— sin(nzx) 
nn 
b. n= 10; luble| = 0.5193 as x > 1 
n= 20; luble| = 0.5099 as x > 1 
n= 40; luble| = 0.5050 as x > 1 


c. Not possible 


12. a. jo=-B 


sin(nzx) 


= 0.001345 at x = 20.9735 
= 0.0003534 at x = 20.9864 
= 0.00009058 at x = 40.9931 


b. n= 10; maxle| 
n = 20; max|le| 
n = 40; max|e| 
€. n-4 


13. y= (@sin(nt) — nsin(wt))/(@ (o? -n 21 o? 4n 


= (sin(nt) — nt cos(n))/ (27), o? =n? 


14. y- Y b, (o sin(nt) -n sin(c))/ (c (o? - n^)), 


n=1 


$2123. 
y= Y b, (msin(nt) ^ n sin(mt))/ (m (n? — n’)) 


n=1 


nim 
+ b,,(sin(mt) — mt cos(mt))/ (2m?), œ =m 


oo 


4 1 
13, y= P Pr um: 


x( : 
2n — 


16. y 2cosot 4 hog — cos ot) 
2a 


sin(a- DA- c sinon) 
1 [o) 


>) cos((2n — 1)zt) — cos(at) 
s (Qn — 1 (o? — (2n — 1?z?) 


n=1 


Section 10.4, page 487 
1. Odd 

Neither 

Odd 


Even 


LE et 


Even 


16. 


17. 
18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


26. 


27. 


28. 


29. 


Answers to Problems 


Neither 


fo) = 4 >) 1- e m cos (^7), 


n= 


(nx /2) — sin( /2) i 
4 >) nn sin(nz in( 7) 


n2 
n=1 


2 


—]yrti = 
. a ORFES y CD cos( ve) 


nan 2n—1 


a. f(x) = Y =(- cos(nz) + = sin( =) ) 


a. f() 21 

4 ws f sin(Qn — 1)x) 
= poy = 25 2n-1 ) 
a. fa) = 


a. fay=}-+ Y DO 
n=1 
a f= zm } oer 2 m 
a. fo) = LN Laun 
n=1 

a. fœ) = 
aa 215 (=)+ = (eos) -1)) 
a K n-l m 2 n2 2 
X COS z) 

2 


a. fQx)= 

Y 4n? z^(1 + cos(nz)) n 16(1 — cos(nz)) sin( 22) 

= n3 a? n?n? 2 
4 1 e lr 3 cos(nz) NIX 

a. ls o E cos ( 27) 
" 3 6 = 1 — cos(nz) AZX 

T s=7+ 5d n2 (F) 
EN 6 v H nzx 

ha) = m » n ( 3 ) 

b. ga) = 

|| xS Acos(nz/2) + 2nz sin(nz/2) — 4 nax 

4 t 2 nig? cos (777) 


_ we Asin(nz/2) - 2nz cos(nz/2) .. (naxx 
h(x) — £ |: sin( ) 
12cos(nz) + 4 


cos( zx ) 
nz? Dad 
oo 


1 i? zx? (8 +5 cosnz) + 32(1 — cos(nz)) 
h(x) = — 
(x) 2; PE 


b. sw-t 
n=1 


n=1 
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30. b. g(x) = = 
Y 6n? z?(2 cos(nz) — 


5) + 324(1 — cos(nz)) cos( AzX 


n^ g^ 
n=1 


itis Y (zm +2 " 144 cos(nz) + =) 


na nix 
n-l 


x sin( Ux) 
3 


40. a. Extend f(x) antisymmetrically from [0, L] into (L, 2L]; that 
x) = —f (x) for 0 € x < L. Then extend this function as 


is, f (2L — 
an even function into (—2L, 0). 


Section 10.5, page 495 


1. xX"—4X-0 T'+AT=0 

2. X'—AxXX 20, T'+ aT =0 

3. X" AX'-X)20, T'- AT =0 

4. (pG)X') + Ar@)X 20, T" c AT =0 

5. Not separable 

6. X" -r(x-A)X-20, Y'—4Y 20 

7. u(x f) = e^ sin(25x) — 725002! sin(55y) 
8. u(x t) = 2e7/16 sin(grx/2) — e7" sin(ax) 


+ 4e" sin(2zx) 


100 e 1- cos(nz) 22 . (nzx 
9. j= n2 22/1600 (=) 
u(x, t) = by 2 e sin 40 


n=1 


— Si 2 
10. u(x, = un sin(na/2) , 
1 


221/1600 sin( 22) 
-— n2 40 
11. ux, = 
100 Y cos(nz/4) — cos(5n2/4) | 1221/1600 sin( um) 
n 40 


n=1 


80 V (=+! =n? x2 . [NTX 
12. fm n2 221/1600 ( ) 
u(x, t) E 2 ae e sin 40 


13. t25, nz 16; 
17. d. t = 673.35 
18. d. t= 451.60 
19. d. t=617.17 


20. b. 125, x 23320; 1210, x 231.13; t= 20, 
x = 28.62; t= 40, x = 25.7351 = 100, x = 21.95; 
t = 200, x = 20.31 
e. t= 524.81 


200 — 1—cos(nz) _,2,2,2 . (nnx 
21. ġġ = > LO SISIA) pn x7 a /400 ( ) 
u(x, t) m E e sin 20 


t=20, n=8; t=80, n=4 


n=1 
35.91°C 
67.23°C 
99.96*C 


76.73 s 

152.56 s 

1093.36 s 

aw, — bw, + (c — bó)w = 0 
5=c/bifh £0 
26. X" --,?X =0, 

27. PR"-rR' - (2) — ,2)R 20, 9" + 720 =0, 
T' +T =0 


23. 


25. 


Fp Ppp PpP 


Y" + (2 -uDY =0, T!+022T =0 


Section 10.6, page 502 


1. 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


Nego DN AN Aw geo 9 OBS 


oo 
2.2 : 
. a. u(x,t) 2 407 £ cpe 7D 771/3600 Sin 


3 


=10+= 
u se 
5c 
u = 30- = 
4° 
LT 
u=T(L+x)/(+L) 
u=T1+L—-x/d+L) 
a u(xft)- 
vs 70 cos(nz) + 50 —0.8652521/400 <: (m 
3 ————————e7 — 
di? nz TE 20 ) 
d. 160.29 s 
a. f(x)= 2x, 0€ x € 50; 
f(x) = 200 — 2x, 50<x< 100 


b. u(x,t) = 20 — s t »6 e71 14n? eiuf aop? sin( 775 


100 


n=1 


800 . (=) 40 
si 
2 


nz? 
d. u(50, f) > 10 as t > 00; 3754 s 
a. u(xt)-30-x-4 2 cpe"? #711900 sin( T5), 


C= L oa — cos(nz)) — n? z?(1 + cos(nz))) 
n3 


n 


oo 
a. u(xft)-— = + e ue cos( 222), 
n=l 
c= 0, n odd; 
= —4/(n? — 1), neven 


b. lim u(x, t) = 2/z 
teow 


a. u(xt)- ~ + ve cpe eI? 1/6400 o o (45 ) 


n-l 
_ 160 
2n 3n?z? 


(3 + cos(nz)) 


c. 200/9 
d. 1543s 


a. u(xt)- = + X c,e "7 1/900 cos ( 225), 


n=1 


Ch = > (sin(“ ) = sin() ) 


— 2 2521412 (2 1) 
n — l)zx 
b. u(x, t) = 2 gres ss le s (Enzo), 


n=1 


2 f} . ( Qn=1)ax 
JI fs) sin( Ez Pa 


— 22 (2n — 1)ax 
ie -(2n-1) 21/3600 + = 
a. {= > ——— | 
u(x, t) 2, Or! sa( 60 ) 
120 


Cy = Qn = yai cos(nz) + (2n — 1)z) 


C. x, increases from x = 0 and reaches x = 30 
when t = 104.4. 


(2n — 1)zx 
60 


n=1 
C= ga Dig cos(nz) — 


"^ Qn- Dix ene 


, 


J Cn = 


rH) sss 
19. u@)= aera! 
r(1- L= Lua) a<x<L, 
a &+(L/a)-1 


where € = &5A5/(k1A1) 

20. e. u(x) = d sin(J,x) 

21. ov" 4s(x)- X0) 2 T, v1)-2T,, 

w, = Wai w(0, t) = 0, w(L, t) 2 0, 

w(x, 0) = f(x) — v(x) 

22. a. v(x) = T, + (T; — T(x/L) + kLx/2 — kx? [2 
b. w(x, t) = 2n ce"? 71/400 sin( =). 


n=1 
_ 160 (cos(nz) — 1) 
ps "REPE 


23. a. v(x) = T, + (Ty — Tx/L + kLx/6 — k? /(6L) 
b. wx t) = by c,e ua sin( um), 


n=1 


_ 20 ( 3n? z? (3 cos(nz) — 1) + 60cos(nz) ) 


C= 
" 3 wr 
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8 c: 1 XN . (nnx nzat 
l. a. u( t) = » z sin( 7) sin( 77) ( 5 ) 
2. a. u(x,t) = 
2 Y a sin( 75) + sin(=27)) sin( 77) cos( 22%) 
n2 = n2 4 4 L L 
G2 
3. a. u&x t) = L — sin( 0) cos( 22%) 


n=1 


4. a. u(x t) — es Y Sine) sin) sig mex) 
Da 


n=1 Mt 


Scos ( nzat ) 
L 


5. a. u(x,t) = 8L E sin( 7) sia( 2 ) sin( 2) 
eee" am m 2 L L 


n=1 


6. a. u(x,t) = 
8L x5 sin(nz/4) + sin(3nz/A) . es . (ec) 
— Sint — | Sin 

L 


an e n? L 
n=1 
oo 
32L cos(nz) +2 . (nzxN . (nzat 
7. a. u(x t) = sin( ) sin( ) 
02 am^ eu nî L L 
8. a. u(x t) = 
oo . " 
AL sin(nz/2)sin(nz/L) . (zm) . (12) 
Biot ———$ sind — ) sin 
an? n2 L L 


n=1 


n=1 


_2 p . { Qn - Dax 
JI sosa SW Pa 
sS 1 Qn- 1)z 
10. a. u(x t) = = Tk sa( 1 ) 


; (ex 2 : (gezom) (= 23 
xsin| ———— cos 
2L 2L 2L 


1l. a. u) = 222 y Qn = Da + 3cosinz) 
a n-l (2n — 1)4 


: (Ss 2) (= x) 
Xxsin| ————— | cos| —————— 
2L 2L 
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14. b. $(x-at) represents a wave moving in the negative x direction 
with speed a > 0. 
15. Using g 2322 ft/s*: 
a. 249 ft/s 
b. 49.8nz rad/s 
c. Frequencies increase; modes are unchanged. 
21. PR" +R x (2p — ,DR-0, 0" + 20-9, 
T" + aT =0 
23. a. a, =ay1 + (2D? /n2 x?) (c y=0 
24. a. c= =; (2 sin( 7.) - sin( 225) - sin( 2 )) 


Section 10.8, page 520 


l. a. u(x, y)= 25 sin( 


Nt) ei 
a a 


Cn = : s0 sin( a E Jas 


b. uy) = 

4a N^ 1 sin(nz/2) . (=) ; (=) 
m — ———— —— sin( — | sinh{ — 
m2 = n? sinh(nzb/a) a 


n=1 


c= 
" smn Jy 
3. a. u(x, y)= 2 c\ sinh( ; 


O sinf MEX) sinh nz(b — y 
DE in m 3) sin (= " 


h(x) sin( e 
a 


"EE, LUN eds) P 
E mE iN Fy) sin p |^» 
2 
pO = AL E E sin( za 
iE sinh(nzb/a) a 
2 
b. = "—-— E 
“a nz sinh(nza/b) 
-2 (mr? — 2) cos(nz) + 2 
" pp sinh(nzb/a) 


5. u(n0)— T + Y r (e, cos(n0) + k, sin(n0)), 
=] 


n 2z a 2z 
c,=— | f(0)cos(n0)d0, k, = 
a Jo a Jo 
oo 


FCO) sin(n0) do 


6. a. u(r6) = » c," sin(n0), 


=l 


ri f(8) sin(n0) d 


= ian 
_ 4 1-cos(nz) 
b. Ch = zd a 
V 0 
pA 0)— LZ (= j! 
u(r, 0) — Le sin a 
c, = Q/aja "*l« p fO) sin( (= ne Jae 


- nax 
8. a. T mle sin( AE , ) 


n=1 
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10. b. ux y) = co + Y €; "s cos( 22), 


n=1 


_ 2/nx 
Cn = sina h, fo» cos( m e 


11. ulr, 0) = co + Y r" (c, cos(n8) + k, sin(n0)), 
n=1 


2a 
C, = l f g(0)cos(n8) dé, 
0 


nra”! 


1 2r 
k, = —_ | g(0) sin(n0) dé; 
nma 0 


2z 
necessary condition is f g(0)d0 = 0. 
0 


12. a. u(xy)- Sa sin(* 77: cos ( Z2), 


n=1 


2/a g . f nzX 
me cosh(nzb/a) Jo 80) sin( 75 Jas 
4a sin(nz /2) 
b. c, = 


13. a. 


ux, y) = 


^ 
© 
l 


"o n2x2 cosh(nab/a) 
6.9 = Yo sinh( DA i (22) 
n=1 2b 
7 2/b zh 
^» 7 sinh(Qn — Daa/2b) a fo» s (Ete Jo 
32b? 
C= 
RC m ve 3 sinh(2n — 1)aa/2b) 
Ys cos( 222 £) sinh( 2 =, 
2 
— ab T gx) dx, 
= 2/a j nzxx 
Cn = anhnab/g) Ja 8 cos( 72 Jax 
2 at E 24a + cos(nz)) 
B b (1 z 5) Lan ^ sinh(nzb/a) 
19. a. u(x, z)=b+ E 
_ 4aa Z cos((2n — I)zx/a)cosh((2n — l)zz/a) 
a2 £ (n= 1) cosh((2n — Dab/a) 
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1. Homogeneous 
Nonhomogeneous 
Nonhomogeneous 
Homogeneous 
Nonhomogeneous 


Homogeneous 
. a. $,0)- sin( 4, ). where 4/4, satifies va- 
— tan( Va z) 


No 

A, = 0.6204, A, = 2.7943 

A, & (2n — 1)*/4 for large n 

Pf, (x) = cos( 4). where VÀ, satisfies Va = cot( Vä) 
No 

A, = 0.7402, A, = 11.7349 

A, & (n — 1)?x? for large n 


$,0) = sin( Ax) VÀ, cos ( 4x). where 4/ 4, satisfies 
a= Dsin( V) = Zeos( V4) =0; 


IAW RYH 


peaoosp aos 


b. No 
C. A, = 1.7071, A, = 13.4924 
d. A, & (n — 1? z? for large n 
10. a. Forn = 1,2,3,...,¢,@) = sin(u,x) — 
An = =p, where y, satisfies 4 = tan p. 
b. Yes; Ag = 0, do(x) =1-x 
C. å; = —20.1907, A, = —59.6795 
= —(2n + 1 z?/A for large n 
12. uœ) = e? 
13. w(x) = 1/x 
14. u()-e 
15. uix)yed-xyl^ 
16. X"-AX-0, T"-FcT' -(k 4 Aa2)T =0 
17. a. s(x) = rd 
b. A, 2 iz?, p œ) = e sin(nzx); n= 1,2,3,. 
18. Positive eigenvalues A = 4,, where 4/4, satisfies ei = 2/3 
tan(3 vaL) corresponding eigenfunctions are @,(x) = 
e sin(3/,x). If L = 1/2, Ap = 0 is eigenvalue, do) = xe?* 
is eigenfunction; if L # 1/2, A = 0 is not eigenvalue. If L € 1/2, 
there are no negative eigenvalues; if L > 1/2, there is one negative 
eigenvalue 4 = —,2, where p is a root of y = 2/3tanh(3uL); 
corresponding eigenfunction is $. ; (x) = e~” sinh ux). 


Hp cos(u,x) and 


19. No real eigenvalues. 
20. Only eigenvalue is A = 0; eigenfunction is f(x) = x — 1. 


21. a. 2sin( V4) - Vicos( V2) =0 
C. A, = 182738, A, = 57.7075 
d. 2 sinh( 4/4) — y/u cosh( 4/1) =0, w=-A 
e. A & —3.6673 
24. a. 4, = ui, where p, is a root of sin uLsinh uL = 0, hence 
An = (nn /L)*: 
à = 97.409/L^, A, = 1558.5/L^, p, (x) = sin(nzx/L) 
b. 4, = x4, where y, is a root of 
sin(uL) cosh(uL) — cos(uL) sinh(uL) = 0 


A, & 2312/15, Ay = 2496.5] L^, 
jie = ous 
m X) n Hn”. sinh(u,L) sin Hy, 


C. 4, = i5. where p, is a root of 1 + cosh(uL) cos(uL) = 
0; AS = 12.362/L^, A, = 485.52/L4 


yx) = sin(x) — sinh(gu,x) 
sin(u,L) + sinh(u,L) 
i cos(u, L) + cosh(u,L) 


25. d. $,(x) = sin( /4,x). where 4, 
satisfies eos( VAL) — YVA,L sin( /A,L) =0 


e. 4& L1597/D2, A, = 13.276/L2 


(cos(u,x) — cosh(u,x)) 


Section 11.2, page 543 
L 440- V2sin((n- ;)n): =i D a 
2. p,a) = Va(cos(n - ;)n): RN 
3. WEL p = V2cos(nzx); n=1,2,... 
V2cos( VAn) 
(1 + sin? /2,) 2 
where 4, satisfies cos( 4.) =A, sin( 4, ) =0 


5. $,(x) = V2e'sinnzx); n=1,2,... 


4 $,(x) = 


22 


aC | a 


_yyi-l 
y ,2 eer, 


n= On- T 


8. a,= = =i — cos((2n — 1)z/4); n=1,2,... 
9. a= _2visin( (n= (2). n=1,2,... 


2 
(n - 5 x? 
2 1/2 
In Problems 10 through 13, a, = (1 + sin? vin) 
and cos 4/4, — A, sin 
y2sin Van 


10. a, = —————; n= 1,2... 
An Gy 
Va(2cos V4, - 1). 
11. a,= ; n=1,2,... 
Anan 
VA(1~ cos Vin) 
12. a, = ———— ————-; n= 1,2... 
Anan 
V2sin( /4,/2) 
135: 4, 2 —— —— —— ji re 1, 2... 
440, 


nn 


14. Not self-adjoint 
15. Self-adjoint 
16. Not self-adjoint 
17. Self-adjoint 
18. Self-adjoint 


21. a. If a; 2 O or b, = 0, then the corresponding boundary term is 


missing. 
25. a. 4 2 z?/D^; p% = sin(zx/L) 

b. A, = (4.4934)?/L?; 

P(x) = sin( 41x) m VA xeos( y% L) 

c. 12 = (22 /L^; pœ) 21— cos(2zx/L) 
26. - az (AP); $09) 21- Mino 
27.. as p — (v/D)X! + AX 20, X(0) = 

X'(L) 2 0; T' c ADT 20 


e. cx t= A a,e ^nP'g/0D gin(y, y), 


n=1 


where 4, = 42 + v? /(4D?); 


L 
4Dy? f e OD) sin(u,x) dx 
0 


a, 


2LDy2 + vsin'(u,L) 
28. a. u,+vu, = Du,» u(0, f) 20, u, (L, f) 2 0, 
u(x, 0) = —co 


b. u(x, t = x b, e ^P eD) gin(y, y), 


Xx? 


n=l 
where 4, = 42 + V? /(4D?); b, = 
8cyD? u? (e D (2D y,, cos(u,L) + v sin(u,L)) — 2Dp,) 


(v2 +. AD? P)QOLDy? + v sin (uL) 


Section 11.3, page 553 


(222 — 2)nz 


Answers to Problems 


a O Es cos(2n — 1)zx) 
3. y- 4 los — D?z? — 2)0n — 1yz? 


co es Vin- I) cos( V7, x) 
B aU »(1 t sin? VÀ) 


8 Y sin(nz /2) sin(nzx) 


(a? — 2) nig? 


n=1 


6-9 For each problem the solution is 
oo 


y à l0. 


nc] n 


1 
€, = f f 009,00 dx, 
0 


603 


HFA, 


where @,,(x) is given in Problems 1—4, respectively, in Section 11.2, 
and 4, is the corresponding eigenvalue. In Problem 8 summation starts 


atn = 0. 


10. a=-=, y= I 5 (x- 5) tesin(axx) 
11. No solution 

12. ais arbitrary, y = ccos(zx) + a/x? 

13. a=0, y 2csin(zx) — (x/2z) sin(zx) 
17. vax) =at+(b-ax 


18. wa=1- 2 


19. u&x, t)= (Us + (2 + 4e) sin( =) 


EY a re nmm) (n 3) 
Cy = evi n=1,2,... 
20. u(xn0- 


Ye m 
n=l n 


V2sin( 4, ) Va (1- cos( à.) 


T Ant) + ae wt) oe 
(1 + sin’ ( VA) ) 


n 


1/2 n 
JA +sin?( /A,)) 
and 4, satisfies cos VÀ, = VA, sin( V/A) = 


21. wxi=8 Y i d — e 9^) sin(nzx) 
o c,(e t 9-1/2 271) sin( (n — 2 TX 

22. u&x, t)= va, nnm ae e 
= 
2y2Qn — 1a + A /2(-1y 

Qn — 1yz2 
23. a. rGOw, = (pG)w), — qGOw, w(0, f) = 0, 

w(l,f) 2 0, w(x, 0) 2 f(x) — vx) 

e On-? xt an((25 — 1)zx) 

2n—1 


where c, = 


24. u(xn2x-2x41-4 E 


n= 


25. u(x,t) = —cos(zx) +e" 1/4 cos(3zx/2) 


Areas (un) 
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1 oo 
31-34. In all cases solution is y = | G(x, s)f(s) ds, 2. u(nt)- 2 k, Jo (A,r) sin(A,at), 
where G(x, s) is given below. à i io 1 
ETT 2 
— ( i“. d xx k, = af mansoa | rJó(,r)dr 
i Pon l5 x<s<l 3. Superpose the solution of Problem 2 and the solution (Eq. (21)) 
|[s2-2/2 O<s<x of the example in the text. 
32. Gx, s) = Us = 5)/2, x<s<l co » 
33. Go, a [ Cs) /cos(D, 0<s <x eae aie 
j > À sin(1 — s)cos(x)/cos(1), x<s<1 1 i 1 
c, = | oA nf dr / f rJo(A,r) dr, 
34. 93-0 ones f, à y 
(mam and 4, satisfies Jo(A) = 0. 
Section 11.4, page 561 7. b. v(r,0) = Ayer) 
oo C ud . 
1. = n À + J (krY(b,, sin(m@) + c,, cos(m0)), 
. 2 À,— wll Vex) 2 
i M b : * R6) in(m0) do 1,2 
T "LI sin(m ; m=1,2,... 
es jJ feo ( à. x)ax / f x À,x dx. - Z A 
V, satisfies Ja( V2) =0. on re A, f(8)cos(m0) d0; m=0,1,2,... 
oo 1 1 
2. d. y= miu + > fn JoQ/ À, 3): 8. €, = | rf(rMJg(A,r) aff rJ (A,r) dr 
H n=1 An UH 0 0 
1 co 
Co = Ji fo») dx; 10. u(p, s) = b» c, p" Pí(s), 
0 n= 
1 1 1 i 1 
ES | FIAn x) a/f xJ5 (4/4, x) dx, where c, = / f (arccos s)P, (s) «f [ P^(s) ds; 
0 0 -1 -1 
n-21,2,....5 Va, satisfies (va) =0. P, is the nth Legendre polynomial and s = cos(@). 
1 1 
3. d. a, = f x (às x)feo ax/ [ x ( yaz) dx Section 11.6, page 571 
0 0 
m 1. n=21 
Cn 
e y= Y. AA). 2. a. b, = (CD V2/(nz) 
= “2 # p 
7 c. n=20 
1 1 as 
c, = f FOI 0) dx 7 l (VT) dx 3. a. by, = 2V2(1 — cos(ma))/ (à x?) 
0 0 €. n=1 
Ly fn 7. a. f(x) 21 
4. b. y= Pj,4 00, where 0 
2 An H b. iG) 2 ¥30 - 2x) 
1 i c. fha) = V5(-1 + & - 6x2) 
C, - fP- 00 dx f Po, 0) dx d. g@=1, gi) =2x-1, ga) = 62 -6x4 1 


8. Pox) =1, PQ) =x, Po(x) = Gx? - 1)/2, 
Section 11.5, page 564 P3(x) = (5 — 3x)/2 


1. b. u(&2) 2 f(£-1) uG,0)=0, 0< E <2, 
u(0, n) = u(2,n)=0, O< yn <2 
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Abel, Niels Henrik, 117, 176 
Abel’s formula, 117, 132, 133, 174, 187 
for systems of equations, 307 
Acceleration of convergence, 482 
Adams, John Couch, 371 
Adams-Bashforth formula 
fourth-order, 372 
second-order, 371 
Adams-Moulton formula, 372 
first-order, 372 
fourth-order, 372 
second-order, 372 
Adaptive numerical method, 361, 366, 370 
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differential equation, 120 
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Advection, 544 
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Amplitude 
of simple harmonic motion, 151 
Amplitude modulation, 165 
Analytic function, 192, 206 
Angular momentum 
principle of, 23, 400 
Annihilators, 
method of, 184—185 
Aquifer 
flow in, 522, 544 
Archimedes, 158 
Augmented matrix, 291, 296 
Autonomous 
equation, 58 
system, 399 


B 
Backward differentiation formulas, 374—375 
first-order, 374 
fourth-order, 374 
second-order, 374 
Backward Euler formula, 357-358 
Bashforth, Francis, 371 
Basin of attraction, 403, 415, 423, 440-443 
Beat, 165, 269 
Bendixson, Ivar Otto, 448 
Bernoulli, Daniel, 68, 69, 489, 504 
Bernoulli, Jakob, 50, 58, 280 
Bernoulli, Johann, 50, 211, 271 
Bernoulli equation, 58 
Bessel, Friedrich Wilhelm, 230 
Bessel equation of order 
k, 561 
nu, 120, 196, 219, 230, 239, 533, 556 
one, 224, 236-239 
one-half, 234—236, 239 
zero, 224, 231—234, 239, 256, 522, 557, 563 
Bessel functions 
Jo(x), 232, 233, 239, 256, 557, 560, 561, 563, 
566 
asymptotic approximation to, 234 
Laplace transform of, 256 
zeros of, 240, 558, 564 
J\(x), 237, 239 
J_1/2(x), 236 
Jıj2a(x), 235 
Yo(x), 233, 557, 563 
asymptotic approximation to, 234 
Yi(x), 238 
orthogonality of, 240, 561 


Bessel inequality, 572 
Bessel series expansion, 564 
Bifurcation diagram, 69 
Bifurcation point, 69, 97, 408, 427—453, 455 
Hopf, 453 
pitchfork, 69 
saddle-node, 69 
transcritical, 69 
Boltzmann, Ludwig, 48 
Boundary conditions, 463 
for elastic string, 504, 512 
for heat conduction equation, 489, 497, 
498, 524 
for Laplace's equation, 515 
nonhomogeneous, 497-498, 552 
periodic, 542, 571 
separated, 535 
Boundary layer, 383 
Boundary value problems 
two-point, 463 
heat conduction equation, 488—503, 
522-552 
homogeneous, 464, 535, 545 
Laplace's equation, 514-522 
nonhomogeneous, 464, 545-556 
self-adjoint, 541-542, 559 
singular, 556—561 
Sturm-Liouville, 535, 541 
two-point, 468, 530 
wave equation, 504—514, 526-527, 554, 
562-564 
Brachistochrone, 50 
Brusselator, 453 
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Capacitance, 156 
Cardano, Girolamo, 176 
Cauchy-Euler equation, 211 
Cayley, Arthur, 286 
Center, 322, 395, 408, 412, 416 
Change of independent variable, 126, 240 
for Euler equation, 126, 214 
Chaotic solution 
of logistic difference equation, 97, 100 
of Lorenz equations, 457 
Characteristic equation, 105, 175, 251 
complex roots, 120, 177 
repeated, 178 
real and equal roots, 127, 178 
real and unequal roots, 106, 175 
Characteristic polynomial, 175, 251 
Chebyshev, Pafnuty L., 209, 435 
Chebyshev equation, 209, 224, 533, 561 
Chebyshev polynomials, 209, 561 
Chemical reactions, 70 
Collocation 
method of, 566 
Competing species, 417 
Complementary solution, 134 
Complete set of functions, 568 
Complex exponential function, 120-121 
Compound interest, 43 
Computer use in differential equations, 20 
Conjugate matrix, 287 
Continuous spectrum, 559 
Convergence 
of an improper integral, 241 
of a numerical approximation, 358 
of a power series, 189 


Convergence in mean, 568 
Converging solutions, 3, 12, 82 
Convolution integral, 147, 275-280 
Laplace transform of, 275-276 
Cosine series, 483, 540 
Cramer, Gabriel, 186 
Cramer’s rule, 186 
Critical amplitude, 65 
Critical damping, 154 
Critical point 
approached by trajectory, 396 
center of linear system, 322, 395, 408, 416 
definition of, 389, 399 
for first order equation, 60 
improper node of linear system, 340, 392 
isolated, 409 
node of linear system, 315, 323, 389 
proper node of linear system, 391 
saddle point of linear system, 313, 323, 390 
spiral point of linear system, 321, 323, 394 
Cycloid, 50, 280 


D 
D’Alembert, Jean, 127, 489, 504, 513 
Dal Ferro, Scipione, 176 
Damping force, 148-149, 400 
Decay, radioactive, 16 
Degenerate node, see Improper node, 392 
Diagonalization 
of homogeneous systems, 334-335 
of matrices, 333-334 
of nonhomogeneous systems, 345-347 
Difference equation, 91-100 
equilibrium solution of, 92 
first order, 91-100 
initial condition for, 91 
iteration of, 92 
linear, 91-94 
logistic, 94-100 
chaotic solutions of, 97, 100 
nonlinear, 91, 94-100 
solution of, 91 
Differential form, 33 
Differential operator, 110, 535 
Diffusion equation 
see Heat conduction equation, 524 
Dimensionless variables, 495, 512 
Dirac, Paul A. M., 271 
Dirac delta function, 271, 545, 556 
Laplace transform of, 271-272 
Direction field, 
for first order equations, 3, 4 
for systems, 310 
Dirichlet, Peter Gustav Lejeune, 515, 538 
Dirichlet problem 
for circle, 518-521 
for rectangle, 516-518, 520-521 
for sector, 521 
for semi-infinite strip, 521 
for semicircle, 521 
Discontinuous coefficients, 58 
Discontinuous forcing function, 58, 264-269 
Dispersion 
mechanical, 544 
Dispersive waves, 514 
Divergence 
of an improper integral, 241 
of a power series, 190 
Diverging solutions, 5, 11, 82 
Drag force, 2, 9 


Duffing, Georg, 407 
Duffing equation, 407 


E 
Eigenfunctions, 466, 531 
normalized, 538 
orthogonality of, 560 
series of, 540—541, 545, 560, 569 
Eigenvalues 
of linear homogeneous boundary value 
problems, 466, 531 
of Sturm-Liouville problem 
real, 467, 468 
of Sturm-Liouville problem 
existence, 536 
real, 536 
simple, 538 
when positive, 543 
Eigenvalues of matrix, 300-303 
algebraic multiplicity, 301 
geometric multiplicity, 301 
real, 303 
simple, 301 
Eigenvectors of matrix, 300-303 
generalized, 341 
linear independence of, 301 
normalized, 301 
orthogonality of, 303 
Elastic bar 
longitudinal vibrations of, 534, 554 
transverse vibrations of, 534 
Elastic column 
buckling of, 544 
Elastic membrane 
vibrations of, 513, 562-564 
Elastic restoring force, 527 
Elastic string 
boundary value problems for, 504-514 
derivation of wave equation, 526—527 
free at one end, 512 
general problem, 511 
infinite length, 513 
justification of solution, 509-510, 513 
natural frequencies of, 507 
natural modes of, 507 
nonzero initial velocity, 510 
propagation of discontinuities in initial data, 510 
wavelength of, 507 
Electrical networks, 16, 156-157, 255, 281, 
285-286 


global truncation error, 360, 362 
local truncation error, 359-361 
Euler- Fourier formulas, 471—472 
Euler - Máscheroni constant, 233 

Even function, 482 
Even periodic extension, 485 
Exact equations, 70-73 


necessary and sufficient condition for existence 


of solutions, 71 
for second-order equations, 119 
Exchange of stability, 69, 95 
Existence and uniqueness theorems, 19 
for first order equations, 51-55, 83-91 
proof of, 84-90 
for nth order linear equations, 169 
for second-order linear equations, 110 
for series solution of second-order linear 
equations, 207, 229 
for systems of first-order equations, 283 
Exponential growth, 59 
Exponential matrix, 332-333, 336 
Exponential order 
functions of, 244 
Exponents at the singularity, 221, 225 


F 
Falling object problem, 1-3, 12-13 
Fehlberg, Erwin, 370 
Feigenbaum, Mitchell, 100 
Feigenbaum number, 100 
Ferrari, Ludovico, 176 
First order ordinary differential equations 
applications of, 39-50, 58-70 
Bernoulli, 58 
direction field, 3, 4 
exact, 70-73 
existence and uniqueness theorems, 51-55, 
83-91 
proof of, 84-90 
general solution of, 11, 29, 56 
graphical construction of integral curves, 56 
homogeneous, 38-39 
implicit solutions, 56 
integrating factor for, 25, 28, 73-75 
interval of definition, 29, 36, 55-56 
linear, 24-32 
nonlinear, 51 
separable, 33-39 
series solution of, 209 
first-order Adams-Bashforth formula, 372 
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Frequency, natural 

of simple harmonic motion, 151 

of vibrating string, 507 
Frobenius, Ferdinand Georg, 220, 286, 388 
Frobenius 

method of, 220 
Fuchs, Lazarus Immanuel, 207, 388 
Fundamental matrix, 329-336, 341 
Fundamental modes of vibration, 326 
Fundamental set of solutions, 114, 115, 170, 306, 

307 

Fundamental solutions 

of heat conduction equation, 492, 500 

of Laplace's equation, 517, 520 

of wave equation, 506, 511 
Fundamental theorem of algebra, 175 


G 
Galois, Evariste, 176 
Gauss, Carl Friedrich, 175, 291 
Gaussian elimination, 291 
Gear, C. William, 374 
General solution of linear equations 
first order, 11, 29, 56 
nth order, 171 
second order, 106, 114, 123, 129, 134 
systems of first-order equations, 306 
Generalized function, 271 
Gibbs, Josiah Willard, 480 
Gibbs phenomenon, 480, 485 
Global asymptotic stability, 415 
Gompertz, Benjamin, 67 
Gompertz equation, 67 
Gradient of, 438 
Graphical construction of integral curves, 56 
Gravity, 2, 46 
Green, George, 554 
Green's function, 554-556 


H 

Half-life, 16, 48 

Harvesting a renewable resource, 68 

Heat conduction equation, 17, 489, 524 
bar with insulated ends, 498-501 
boundary conditions, 489, 497, 498, 524 
derivation of, 522-526 
with external heat source, 503 
fundamental solutions of, 492, 499 
nonhomogeneous boundary conditions, 

497-498 

nonhomogeneous source term, 525, 549-552 


Elliptic integral, 417 

Environmental carrying capacity, 61 
Epidemics, 68-69 

Equidimensional equation, 211 
Equilibrium solution, 3, 5, 12, 60, 92, 389 
Error 


in polar coordinates, 496 

smoothing of discontinuities in initial 
conditions, 496 

solution of fundamental problem, 490-492 

steady-state solution, 497 


first-order method, 360 

Fitzhugh, Richard, 453 
Fitzhugh-Nagumo equations, 453 
Fourier, Joseph, 469, 489 

Fourier coefficients, 568 


for Adams-Bashforth formula, 372 

for Adams-Moulton formula, 372 

effect of step size, 360, 378—380 

for Euler method, 359—362 

for Fourier series, 474—475, 480 

global truncation, 359, 360, 378-380 

for improved Euler method, 364, 367 

local truncation, 359 

mean square, 567 

for modified Euler method, 367 

round-off, 359, 363, 378-380 

for Runge-Kutta method, 368 

Escape velocity, 47 

Euler, Leonhard, 77, 233, 488, 489, 504, 514 

Euler equation, 126, 133, 211-215, 318, 328, 343, 
519 

change of independent variable, 126, 214 

Euler formula for exp(it), 121, 179 

Euler method, 76-83, 354-363, 376 

convergence of, 83 


Fourier series, 469—488 
acceleration of convergence, 482 
choice of series, 485—486 
convergence of, 478, 481 
convergence of partial sums, 474, 480 
cosine series, 483, 540 
error, 474—475, 480 
Euler-Fourier formulas, 471—472 
Gibbs phenomenon, 480, 485 
integration of, 471 
orthogonality of sines and cosines, 470-471 
Parseval equation, 481, 487, 514, 572 
periodicity of sines and cosines, 469—470 
sawtooth wave, 484, 488 
sine series, 484, 540 
specialized kinds, 488 
square wave, 479, 487 
triangular wave, 472, 474, 487 
Fredholm, Erik Ivar, 547 
Fredholm alternative theorem, 547 


transient solution, 497 
Heaviside, Oliver, 243 
Heaviside function, 257 
von Helmholtz, Hermann, 565 
Helmholtz equation, 565 
Hereditary systems, 276 
Hermite, Charles, 204, 286 
Hermite equation, 204, 219, 533 
Hermite polynomials, 204 
Hermitian matrix, 204, 303, 316, 333, 338, 541, 
554 
Heun, Karl, 364 
Hodgkin, Alan L., 427 
Hodgkin- Huxley equations, 427, 453 
Homogeneous algebraic equations, 295 


Homogeneous boundary value problems, 464, 530 


eigenfunctions, 466, 531 
eigenvalues, 466, 531 

singular Sturm-Liouville, 556-561 
Sturm-Liouville, 535, 545 
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Homogeneous differential equations, 
with constant coefficients, 103—109, 120-133, 
174-181 
systems, 309-345 
definition of, 104, 284 
general theory of, 110—120, 170—173, 304-309 
Homogeneous first order differential equations, 
38-39 
Hooke, Robert, 147 
Hooke's law, 147 
Hopf, Eberhard, 453 
Hopf bifurcation, 453 
Huxley, Andrew F., 427 
Huygens, Christian, 280 
Hypergeometric equation, 230 


I 
Identity matrix, 290 
Implicit numerical method, 357, 372, 386 
Implicit solutions, 56 
Improper integrals, 241-243 
comparison theorem for, 243 
Improper node, 340, 392, 412 
Improved Euler method, 363-367 
Impulse functions, 270-274 
Impulse of force, 270 
Impulse response, 278 
Indicial equation, 221, 225, 229 
Inductance, 156 
Initial conditions, 11, 57, 91, 104, 169, 283 
propagation of discontinuities for wave 
equation, 510 
smoothing of discontinuities for heat conduction 
equation, 496 
Initial value problem, 11, 35, 104, 283, 169 
Laplace transform of, 248-255 
Inner product 
for functions, 470, 536 
for vectors, 289 
Instability 
of critical point, 62, 92, 94, 396, 399, 402, 438 
of numerical method, 381—386 
of periodic orbit, 447 
Integral curves, 11, 56 
Integral equation, 84 
Volterra, 279 
Laplace transform of, 279 
Integral transform, 243 
Integrating factor, 25, 28, 73-75, 120 
Integro-differential equation, 279 
Laplace transform of, 279 
Interval of convergence, 190 
Inverse Laplace transform, 251 
as linear operator, 253 
uniqueness of, 251 
Inverse matrix, 290—292 
Invertible matrix, 290 
Irregular singular point, 216, 230 
Iteration 
of difference equation, 92 
method of, 84 


J 

Jacobi, Carl Gustav Jacob, 411 
Jacobian matrix, 411, 455 

Jordan, Camille, 286, 341 

Jordan form of matrix, 341—342, 344 
Jump discontinuity, 242, 478, 510 


K 


Kernel 

of convolution, 147 

of integral transform, 243 
Kirchhoff, Gustav, 156 
Kirchhoff's laws, 156, 285 


Kronecker, Leopold, 538 
Kronecker delta, 538 
Kutta, M. Wilhelm, 367 


L 
Lagrange, Joseph-Louis, 143, 192, 469, 504 
Lagrange's identity, 535, 558 
Laguerre, Edmond Nicolas, 224 
Laguerre equation, 224, 533 
Laplace, Pierre-Simon de, 243, 514 
Laplace transform, 241—280 
of convolution, 275-276 
definition of, 243 
of derivatives, 248-249 
of Dirac delta function, 271—272 
existence of, 244 
of initial-value problems, 248-255 
of integral equations, 279 
of integro-differential equations, 279 
inverse of, 251 
as linear operator, 246 
of periodic functions, 263 
of rectified sine wave, 264 
of sawtooth wave, 264 
of square wave, 263 
for systems of equations, 350-351 
translation formula, 259, 262 
of unit step function, 259 
Laplace's equation, 514—522, 564 
boundary conditions, 515 
Dirichlet problem, 515 
for circle, 518-521 
for rectangle, 516-518, 520-521 
for sector, 521 
semi-infinite strip, 521 
for semicircle, 521 
fundamental solutions of, 517, 520 
mixed problem, 521 
Neumann problem, 515 
for circle, 521 
for rectangle, 521 
in cylindrical coordinates, 565 
in polar coordinates, 518 
in spherical coordinates, 566 
Legendre, Adrien Marie, 117, 210 
Legendre equation of order alpha, 119, 196, 208, 
210, 217, 219, 224, 256, 556, 561, 566 
Legendre polynomials, 210, 561, 572 
Leibniz, Gottfried Wilhelm, 50, 58, 280, 487 
Length, of a vector, 289 
L'Hópital, Marquis de, 50 
Liapunov, Alexandr M., 435 
Liapunov function, 438 
Liapunov theorems, 438 
Liapunov's second method, 435-444 
Libby, Willard F., 48 
Liénard, Alfred-Marie, 417 
Liénard equation, 417, 443 
Limit cycle, 446 
Linear dependence and independence, 
of functions, 171, 174 
of vector functions, 300 
of vectors, 298-300 
Linear operator, 184, 246, 253 
Linear ordinary differential equations 
adjoint equation, 120 
change of independent variable, 126, 214, 240 
characteristic equation, 105, 175, 251 
complex roots, 120, 177 
complex roots, repeated, 178 
real and equal roots, 127, 178 
real and unequal roots, 106, 175 
complementary solution, 134 
definition of, 18, 103 
Euler equation, 126, 133, 211 
exact, 119 


existence and uniqueness theorems, 51, 110, 
169, 207, 229, 284 
first order, 24-32 
fundamental set of solutions, 114, 115, 170, 
306, 307 
general solution of, 11, 29, 56, 106, 114, 123, 
129, 134, 171, 306 
homogeneous equation with constant 
coefficients, 103-109, 120-133, 174-181 
integrating factor, 25, 28, 120 
interval of definition, 29, 51 
nonhomogeneous equation, 104, 133-147, 173, 
181-352 
ordinary point, 196, 206, 209, 219 
particular solution, 134, 173 
reduction of order, 131-132, 173 
self-adjoint, 120 
series solution of, see Series solution, 120 
singular point, 196, 206, 211-219 
systems of, see Systems, 219 
undetermined coefficients, 15, 135-142, 
181-185 
variation of parameters, 32, 142-147, 185 
Linearization, 
of a nonlinear equation, 18 
of a nonlinear system, 410 
Liouville, Joseph, 84, 535 
Lipschitz, Rudolf, 91 
Lipschitz condition, 91 
Locally linear systems, 407-417 
Logarithmic decrement, 157 
Logistic difference equation, 94—99 
Logistic differential equation, 60-63, 94, 417, 433 
Logistic growth, 59-63, 65-66 
Lorenz, Edward N., 454 
Lorenz equations, 454-460 
Lotka, Alfred J., 428 
Lotka-Volterra equations, 17, 428-435 


M 
Magnitude, of a vector, 289 
Malthus, Thomas, 59 
Máscheroni, Lorenzo, 233 
Mathematical model, 1, 39-40 
analysis of, 40 
comparison with experiment, 40 
construction of, 6, 13, 39 
Matrices, 286-304 
addition of, 287 
adjoint, 287 
augmented, 291, 296 
conjugate, 287 
diagonalizable, 333-334 
eigenvalues of, 300—303, 466 
eigenvectors of, 300-303, 466 
equality of, 287 
exponential, 332-333, 336 
fundamental, 329-336, 341 
Gaussian elimination, 291 
Hermitian, 204, 303, 316, 333, 338, 541, 554 
identity, 290 
inverse, 290-292 
invertible, 290 
Jordan form of, 341—342, 344 
multiplication by a number, 288 
multiplication of, 288-289 
noninvertible, 290 
nonsingular, 290 
row reduction of, 291 
self-adjoint, 303 
similar, 333 
singular, 290 
subtraction of, 288 
transpose, 287 
zero, 287 


Matrix functions, 292-293 
May, Robert M., 98 

Mean convergence, 568 
Mean square error, 567 
Millikan, Robert A., 50 
Mixing problems, 42, 44-45 
Mode 

natural (of vibrating string), 507 
Modified Euler formula, 367 
Moulton, Forest Ray, 372 
Multiplicity of eigenvalue 
algebraic, 301 

geometric, 301 

Multistep method, 371-375 


N 
Nagumo, Jin-Ichi, 453 
Negative definite function, 437 
Negative semidefinite function, 437 
Neumann, Carl Gottfried, 515 
Neumann problem, 515 
for circle, 521 
for rectangle, 521 
Newton, Isaac, 50 
Newton’s law 
of cooling, 9, 16, 48 
of motion, 2, 148, 285, 526 
Node, 315, 323, 389, 408, 412 
Nonhomogeneous algebraic equations, 295 
Nonhomogeneous boundary value problems, 464, 
545-556 
Fredholm alternative, 547 
solution 
by eigenfunction expansion, 545-554 
by Green’s function, 554 
Nonhomogeneous boundary value problems 
solution 
by Green’s function, 556 
Nonhomogeneous linear differential equations, 
104, 133-147, 173, 181-278, 284, 345-352 
Noninvertible matrix, 290 
Nonlinear ordinary differential equations 
autonomous systems, 398-460 
definition of, 18, 104 
existence and uniqueness theorems, 52, 84, 283 
first order, 51-57 
methods of solving, 33-39, 70-75 
linearization, 18, 410 
periodic solutions of, 428-435, 444-454 
y" = f(t, y), 101 
y" = f(y, y), 101 
Nonsingular matrix, 290 
Normalization condition, 538 
Nullcline, 424 
Numerical dependence, 384 
Numerical methods, 354—378 
Adams-Bashforth formula, 371 
Adams-Moulton formula, 372 
adaptive, 361, 366, 370 
backward differentiation formulas, 374—375 
backward Euler formula, 357-358 
comparison of, 375 
convergence of, 358 
effect of step size, 360, 378—380 
errors in, see Error, 354 
Euler, 76-83, 354-363, 376 
Heun, 364 
improved Euler, 363—367 
modified Euler, 367 
multistep, 371-375 
one-step, 371 
predictor-corrector, 372, 378 
Runge-Kutta, 367-370, 376 
stability of, 381-386 
for stiff equations, 382-384 


for systems of first-order equations, 376-378 
vertical asymptotes, 380-381 
Numerical stability, 38 1-386 


(0) 
Odd function, 482 
Odd periodic extension, 486 
One-step method, 371 
Orbital stability, 446 
Order of differential equation, 17 
Ordinary differential equation 
definition of, 17 
Ordinary point, 196, 206, 209 
at infinity, 219 
Orthogonality 
of Bessel functions, 240, 561 
of Chebyshev polynomials, 561 
of eigenfunctions of Sturm-Liouville problems, 
560 
of functions, 470 
of Legendre polynomials, 210, 561, 572 
of sine and cosine functions, 470—471 
of vectors, 289 
Orthonormal set, 538, 566 
Overdamped motion, 154 


P 
Parseval, Marc-Antoine, 48 1 
Parseval equation, 481, 487, 514, 572 
Partial differential equation 
see also Heat conduction equation, Laplace’s 
equation, Wave equation, 463 
definition of, 17 
Partial fraction expansion, 250, 253, 254, 256 
Particular solution, 134, 173 
Pendulum equation 
generalized nonlinear undamped, 443 
linear undamped, 18 
nonlinear damped, 400-402, 410-416 
nonlinear undamped, 18, 22, 23, 415-416, 
435-436, 439-440, 443 
period, 417 
Period 
of nonlinear undamped pendulum, 417 
of simple harmonic motion, 151 
Periodic boundary conditions, 542, 571 
Periodic forcing terms, 481 
Periodic functions, 469 
derivative of, 476 
fundamental period, 469 
integral of, 476 
Laplace transform of, 263 
linear combination of, 470 
product of, 470 
sine and cosine, 470 
Periodic solutions of autonomous systems, 
428-435, 444-454 
Periodicity of sine and cosine functions, 469-470 
Phase line, 60 
Phase plane, 158, 310, 389 
Phase plot, 158, 168 
Phase portrait, 310, 389 
Phase, of simple harmonic motion, 151 
Picard, Charles-Emile, 84 
Picard 
method of, 84 
Piecewise continuous function, 242, 477 
Pitchfork bifurcation, 69 
Poincaré, Henri, 388, 448 
Poincaré-Bendixson Theorem, 448 
Poiseuille, Jean Louis Marie, 468 
Poiseuille flow, 468 
Population dynamics, 58-66 
Positive definite function, 437 
Positive semidefinite function, 437 
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Potential equation, 514-522 

see also Laplace’s equation, 463 
Power series, properties of, 189-192 
Predator-prey equations, 17, 428—435 
Predictor-corrector method, 372, 378 
principle of conservation of energy, 280 
Proper node, 391, 412 


Q 


Quasi frequency, 153 
Quasi period, 153 


R 
Radioactive decay, 16, 48 
Radiocarbon dating, 48 
Radius of convergence, 190, 207 
Ramp loading, 267 
Rate function, 4 
Rate of growth (decline), 5, 59 
intrinsic, 60 
Rayleigh, Lord, 452 
Rayleigh equation, 452 
Rectified sine wave, 264 
Recurrence relation, 197, 220, 222, 225 
Reduction of order, 131-132, 173 
Reduction to systems of equations, 282 
Region of asymptotic stability, 403 
Regular singular point, 215—219 
at infinity, 219 
Resistance, electric, 156 
see also Damping force, 156 
Resonance, 162, 166, 269 
Riccati, Jacopo Francesco, 101 
Riccati equation, 101 
Rodrigues, Benjamin Olinde, 210 
Rodrigues' formula, 210 
Rosenzweig-MacArthur equations, 434, 453 
Róssler, Otto E., 460 
Róssler equations, 460 
Round-off error, 359, 362, 378-380 
Row reduction, 291 
Runge, Carl David, 367 
Runge-Kutta method, 367-370, 376 


S 
Saddle point, 313, 323, 390, 412 
Saddle-node bifurcation, 69 
Saturation level, 61 
Sawtooth wave, 264, 484, 488 
Schaefer model for fish population, 68 
Schródinger, Erwin, 271 
second-order Adams-Bashforth formula, 371 
Self-adjoint 
boundary value problem, 541—542, 559 
equation, 120 
matrix, 303 
Separable equations, 33-39 
Separated boundary conditions, 535 
Separation constant, 491, 499, 506, 516, 530, 563 
Separation of variables, 489 
further remarks, 562-566 
for heat conduction equation, 491, 499 
for Laplace's equation, 516 
in polar coordinates, 519 
for wave equation, 506 
in polar coordinates, 565 
Separatrix, 403, 415, 423 
Series of eigenfunctions, 540-541, 545, 560, 569 
Series solution 
existence theorem for, 207, 229 
first order equations, 209 
indicial equation, 221, 225, 229 
near an ordinary point, 195-210 
near a regular singular point, 219-230 
recurrence relation, 197, 220, 222, 225 


610 INDEX 


when roots of indicial equation 
are equal, 223, 227, 229, 231-234 
differ by integer, 223, 228, 229, 234-238 
Shift of index of summation, 192-194 
Similar matrices, 333 
Similarity transformation, 333 
Simple eigenvalues, 301 
Simple harmonic motion, 151 
Simply connected region, 71 
Simpson, Thomas, 368 
Simpson's rule, 368 
Sine series, 484, 540 
Singular matrix, 290 
Singular point, 196, 206, 211-219 
irregular, 216, 230 
regular, 211-219 
at infinity, 219 
Sink 
nodal, 389 
spiral, 394 
Slope field, see Direction field, 3 
Solution of ordinary differential equation, 19 
general solution of linear equations, 11, 29, 56, 
106, 114, 123, 129, 134, 171 
of systems, 306 
implicit, 56 
of systems of equations, 283 
Source 
nodal, 390 
spiral, 394 
spectral analysis, 476 
Spiral point, 321, 323, 394, 408, 412 
Spring-mass system, 147-155, 157-168, 255, 329 
three springs, two masses, 281, 285, 323-326, 
329 
two springs, two masses, 180 
Square integrable function, 569 
Square wave, 263, 479, 487 
Stability 
asymptotic, 62, 92, 94, 396, 401, 438, 446 
definition of, 400 
basin of attraction, 403, 415, 423, 440—443 
of critical point, 396, 399-400, 407, 438 
exchange of, 69, 95 
global asymptotic stability, 415 
Liapunov theorems, 438 
for linear systems, 396, 408 
for locally linear systems, 412 
of numerical method, 381—386 
orbital, 446 
region of asymptotic stability, 403 
semistable, 67, 447 
stable, 396, 402, 438, 447 
definition of, 399 
unstable, 62, 92, 94, 396, 402, 438, 447 
definition of, 399 
Stairstep diagram, 95 
Star point, 391 
Steady state solution, 49, 161, 497 
Stefan, Jozef, 48 
Stefan-Boltzmann radiation law, 48 
Step functions, 257 
Stiff differential equations, 374 
Stiff equations, 382-384 
Stokes, George Gabriel, 49 
Stokes’ law, 49 
Strange attractor, 457 
Stream function, 515 
Strutt, John William, see Rayleigh, Lord, 452 
Sturm, Charles-Francois, 535 


Sturm-Liouville boundary value problems, 535, 
545 
eigenvalues 
real, 536 
simple, 538 
nonhomogeneous, 545-549 
self-adjointness of, 542, 559 
singular, 556—561 
continuous spectrum, 559 
Successive approximations 
method of, 84, 336 
Superposition principle, 112, 285, 305, 511, 
562 
Sylvester, James, 286 
Systems 
of differential equations, 17 
autonomous, 399 
existence and uniqueness theorem, 283 
initial conditions, 283 
linear, 284 
nonlinear, 284 
numerical solution of, 376-378 
reduction to, 282 
solution of, 283 
of first-order linear equations, 304—352, 
388-398 
definition of, 284 
diagonalization, 334—335, 345-347 
Euler, 318, 328, 343 
existence and uniqueness theorem, 284 
fundamental matrix, 329—336, 341 
fundamental set of solutions, 306, 307 
general solution of, 306 
homogeneous, 284, 304—309 
homogeneous with constant coefficients, 
309-345 
homogeneous with constant coefficients, 
complex eigenvalues, 319—395 
homogeneous with constant coefficients, real 
and unequal eigenvalues, 309-319, 
389-390 
homogeneous with constant coefficients, 
repeated eigenvalues, 337-345, 391-392 
nonhomogeneous, 284, 345-352 
nonhomogeneous, Laplace transform of, 
350-351 
superposition of solutions, 305 
undetermined coefficients, 347-348 
variation of parameters, 348-350 
of linear algebraic equations, 295-298, 464 


T 
Tangent line method, see Euler method, 77 
Tautochrone, 280 
Taylor, Brook, 192 
Taylor series, 120, 192 
for functions of two variables, 367, 410 
Telegraph equation, 527, 533, 554 
Terminal velocity, 3 
Thermal diffusivity, 489, 524 
Threshold models, 64—66 
Torricelli, Evangelista, 47 
Torricelli's principle, 47 
Trajectories, 283, 310, 389, 396, 404-406 
almost linear systems, 412 
linear systems, 389-396 
locally linear systems, 412 
Transcritical bifurcation, 69 
Transfer function, 278 
Transfer matrix, 351 


Transient solution, 49, 161, 497 
Translation of a function, 259 
Transpose of a matrix, 287 
Triangular wave, 472, 474, 487 
Truncation error 

global, 359, 378-380 

local, 359 


U 
Undetermined coefficients, 15, 135-142, 
181-185 
for systems of equations, 347-348 
Unit step function, 257 
Laplace transform of, 259 


V 
van der Pol, Balthasar, 448 
van der Pol equation, 448-452 
Variation of parameters, 32, 142-147, 185 
for systems of equations, 348-350 
Vectors, 287, 477 
inner product, 289 
length, 289 
linear dependence and independence of, 
298-300 
magnitude, 289 
multiplication of, 289 
orthogonality, 289 
Velocity potential function, 515 
Verhulst, Pierre F., 60 
Verhulst equation, 60 
Vibrations 
of elastic bar, 534, 554 
of elastic membrane, 513, 562-564 
of elastic string, 504-514, 526-527 
natural modes of, 507 
of spring-mass system, 147—155, 157-168, 
255, 329 
of three spring, two mass system, 281, 285, 
323-326, 329 
of two spring, two mass system, 180 
Viscous damping force, 527 
Volterra, Vito, 428 
Volterra integral equation, 279 
Laplace transform of, 279 


W 
warping function, 515 
Wave equation, 17, 504, 554, 562-564 
boundary value problems for, 504-514 
derivation of, 526-527 
dispersive waves, 514 
fundamental solutions of, 506, 511 
general problem, 511 
in infinite medium, 513 
justification of solution, 509-510, 513 
nonzero initial velocity, 510 
in polar coordinates, 562, 565 
see also Elastic string, 463 
Wave velocity, 504, 512, 527 
Wavelength 
of vibrating string, 507 
Wronski, Jósef Maria Hoéné-, 113 
Wronskian, 113, 170 
Abel’s formula for, 117, 174, 306 
for systems of equations, 306 


Y 


Yield, maximum sustainable, 68 


